They first used the INQUIRE dataset to test if VLMs could narrow a pool of five million images to the top 100 most relevant results (AKA ranking). For straightforward search queries like "a reef w/manmade structures & debris," relatively large models like "SigLIP" found matching
VLMs Ranking Images: INQUIRE Dataset Tests SigLIP Performance
By
–
Leave a Reply