CulturalGround data construction: We designed a scalable pipeline to create culturally grounded multilingual VQA data from Wikidata, a structured knowledge base. Our data curation pipeline:
– Cultural Entity Selection: Extract 3M+ culturally relevant entities from Wikidata
CulturalGround: Scalable Multilingual VQA Data Pipeline from Wikidata
By
–
