Text-to-Image Generation

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers

Multimodal Transformers can paint when reading text - *[EMNLP 2020](https://2020.emnlp.org/)*