Mingtong Zhang


I am building general purpose robots.

My research interests are in computer vision and robotics. I focus on generalization capability and scalability of AI.

I explore how to represent our physical world and develop general models and algorithms to empower artificial intelligence to perceive and interact with it.

Email  /  GScholar  /  Github  /  Twitter

profile photo
Research
Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling
Mingtong Zhang*, Kaifeng Zhang*, Yunzhu Li
Conference on Robot Learning (CoRL), 2024
[website] [paper] [code] [demo]

D3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
Yixuan Wang*, Mingtong Zhang*, Zhuoran Li*, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li
Conference on Robot Learning (CoRL), 2024
Oral
[website] [paper] [code]

KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
Zixian Liu*, Mingtong Zhang*, Yunzhu Li
Conference on Robot Learning (CoRL), 2024 @ LangRob Spotlight
In submission to International Conference on Robotics and Automation (ICRA), 2025
[website]

Neural Dynamics Augmented Diffusion Policy
Ruihai Wu*, Haozhe Chen*, Mingtong Zhang*, Haoran Lu, Yitong Li, Yunzhu Li
In submission to International Conference on Robotics and Automation (ICRA), 2025
[website]

Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Open X-Embodiment Collaboration
International Conference on Robotics and Automation (ICRA), 2024
Best Paper Award
[project] [paper] [blogpost] [code] [data]

Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields
Mingtong Zhang*, Shuhong Zheng*, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
European Conference on Computer Vision Workshop, 2022
[arXiv]

Service
  • Reviewer: CoRL, ICLR
Simulately: Handy information and resources for physics simulators for robot learning research.
Haoran Geng, Yuyang Li, Yuzhe Qin, Ran Gong, Wensi Ai, Yuanpei Chen, Puhao Li, Junfeng Ni, Zhou Xian, Songlin Wei, Yang You, Yufei Ding, Jialiang Zhang, Mingtong Zhang
Open-source Project
Selected into CMU 16-831
[website]


Template adapted from Jon Barron