Text-to-Image Generation

DOCCI: Descriptions of Connected and Contrasting Images
High-quality, long, human-annotated descriptions of 15K images
DOCCI: Descriptions of Connected and Contrasting Images