Xiaohan Yan

Xiaohan Yan  颜小涵

Algorithm Engineer @ AgiBot

Research interests: Multimodal Large Models, 3D Computer Vision, and Reinforcement Learning.
Feel free to drop me an email if we share any interests.

Experience
Algorithm Engineer
2025 – Present
Research Intern
Apr 2024 – Mar 2025
Research Intern
Dec 2023 – Apr 2024
Education
M.S. in Computer Science
2022 – 2025
B.S. in Computer Science
Hohai University · ACM Team Captain
2018 – 2022

* indicates equal contribution

ALOE
ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training
Rushuai Yang*, Hecheng Wang*, Chiming Liu*, Xiaohan Yan, Yunlong Wang, Xuan Du, Shuoyu Yue, Yongcheng Liu, Chuheng Zhang, Lizhe Qi, Yi Chen, Wei Shan, Maoqing Yao
arXiv 2026 arXiv Project
We propose ALOE, an action-level off-policy evaluation framework for VLA post-training that enables fine-grained credit assignment and stable policy improvement in real-world robotic manipulation.
HOLO
HOLO: Holistic Lightweight Optimization for Scene Understanding with Auto-Annotation and Multimodal Learning
Xiaoyun Hu*, Xiaohan Yan*, Nan Wang, Xiaowei Song, Gang Wei, Zhicheng Wang
WACV 2026
We propose HOLO, which includes a large-scale scene description dataset and a lightweight 3D-LLM.
RE0
RE0: Recognize Everything with 3D Zero-shot Instance Segmentation
Xiaohan Yan*, Zijian Jiang*, Yinghao Shuai*, Nan Wang, Xiaowei Song, Wenbo Ji, Ge Wu, Jinyu He, Gang Wei, Zhicheng Wang
ICRA 2025 IEEE Code Project
Given 3D point clouds and multi-view RGB-D images with poses, RE0 leverages the 3D geometric information, projection relationships and CLIP semantic features for 3D zero-shot instance segmentation.
SGGS
Semantic-Guided Gaussian Splatting with Deferred Rendering
Nan Wang, Xiaohan Yan, Xiaowei Song, Zhicheng Wang
ICASSP 2025 IEEE Code Poster
We use semantic features derived from 2D foundation model to revolutionize the material property optimization for 3DGS.
AttenPoint
AttenPoint: Exploring Point Cloud Segmentation through Attention-Based Modules
Xiaohan Yan, Nan Wang, Xiaowei Song, Gang Wei, Zhicheng Wang
PRCV 2024 ACM
We combine local and global features of the structures and performance to perform few-shot point cloud semantic segmentation.
GreedyAgent
GreedyAgent: A Simple yet Efficient Approach for Meta Learning from Learning Curves
Jinyu He, Xiaowei Song, Xiaohan Yan, Nan Wang, Yuqi Miao, Zijian Jiang, Fei Chao, Yan Zhang, Shengchuan Zhang, Rongrong Ji
ICIC 2024 ACM Code
A key sub-problem: meta-learning from learning curves is a mature but gradually gaining attention area within the field of meta-learning.
ASGMVLP
Anatomical Structure-Guided Medical Vision-Language Pre-training
Qingqiu Li, Xiaohan Yan, Jilan Xu, Runtian Yuan, Yuejie Zhang, Rui Feng, Quanli Shen, Xiaobo Zhang, Shujun Wang
MICCAI 2024 arXiv Code Project
We propose an anatomical structure-guided framework for medical vision-language pre-training that improves cross-modal alignment.
Kaggle
LLM Science Exam — Use LLMs to Answer Difficult Science Questions
Xiaohan Yan, Nan Wang, Xiaowei Song, Jinyu He
Kaggle 2023 Code
Fine-tuning large language models on private datasets. Score 0.905, ranking top 3% worldwide, Silver Medal.
eScape
eScape — A Geometry Storm Game
Origami-hui, Xiaohan Yan
GameJam 2023 Play
Scale your device and escape from this geometry storm. Innovation RK1 and Theme interpretation RK2.
Most of the awards I won during my student years. 2018 – 2024
The 2019 ICPC Asia-East Continent Final — Bronze Medal 2019 – 2020
Jiangsu Collegiate Programming Contest — Silver Medal 2nd place 2019 – 2020
[09/2025] Our paper HOLO is accepted by WACV 2026.
[03/2025] I graduated from Tongji University with a Master's degree in Computer Science!
[01/2025] Our paper SGGS is accepted by ICASSP 2025.
[01/2025] Our paper RE0 is accepted by ICRA 2025.

3D Modeling

  • Some stuff I have modeled and rendered.

Sports

  • Swimming
  • Flying Disc

Games

  • Pokemon
  • Digimon

Language

  • Japanese