Multimodal Language Models

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models
Imaginative Perception Tokens enhance spatial reasoning in multimodal language models - CVPR 2026 MUSI Workshop