Text-to-Image Evaluation

DOCCI: Descriptions of Connected and Contrasting Images
High-quality, long, human-annotated descriptions of 15K images
DOCCI: Descriptions of Connected and Contrasting Images