KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models Image ad understanding is a crucial task with wide real-world applications. Although highly challenging with the involvement of diverse atypical scenes, real-world
KAFA: Knowledge-Augmented Vision-Language Models for Image Ad Understanding
By
–
Leave a Reply