MolmoAct 2: An open foundation for robots that work in the real world
Haoquan Fang, Jiafei Duan, Donovan Clay, Sam Wang, Shuo Liu, Weikai Huang, Xiang Fan, Wei-Chuan Tsai, Shirui Chen, Yi Ru Wang, Shanli Xing, Jaemin Cho, Jae Sung Park, Ainaz Eftekhar, Peter Sushko, Karen Farley, Angad Wadhwa, Cole Harrison, Winson Han, Ying-Chun Lee, Eli VanderBilt, Rose Hendrix, Suveen Ellawela, Lucas Ngoo, Joyce Chai, Zhongzheng Ren, Ali Farhadi, Dieter Fox, Ranjay Krishna
May, 2026
Abstract
MolmoAct2 is an open family of action reasoning models for robot control and real-world deployment. It builds on an embodied reasoning VLM backbone, integrates action modeling with a flow-matching action expert, and is released with open checkpoints and datasets for training, fine-tuning, and deployment.
Publication
arXiv preprint

Young Investigator @ AI2
Incoming Assistant Professor @ JHU
Incoming Asst. Prof. @ JHU working on Multimodal AI