Retrieval-Enhanced Contrastive Vision-Text Models paper page: https://
huggingface.co/papers/2306.07
196
… Contrastive image-text models such as CLIP form the building blocks of many state-of-the-art systems. While they excel at recognizing common generic concepts, they still struggle on
Retrieval-Enhanced Contrastive Vision-Text Models for Improved Concept Recognition
By
–
