Tags

LST
CLIP
FineCapEval
Image captioning
Adapter
VL Adapter
Multi-hop