Tags

CLIP
FineCapEval
Image captioning
Adapter
VL Adapter
Multi-hop
Multi-modal
Distillation