Jaemin Cho
Publications
Experience
CV
Vision and Lanaguage
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Multimodal Transformers can paint when reading text - *[EMNLP 2020](https://2020.emnlp.org/)*
Cite
×