Text-to-Image Generation

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Probing the Reasoning Skills and Social Biases of Text-to-Image Models

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers

Generate image from text by predicting masked patches with multi-modal transformers - *[EMNLP 2020](https://2020.emnlp.org/)*