Interesting new benchmark from Hugging Face testing how well vision LLMs can handle long video inputs (generally after they've been split into many thousands of images) – my notes here: https://
simonwillison.net/2025/Jul/23/ti
mescope/
…
Hugging Face Benchmark: Vision LLMs Long Video Input Performance
By
–
Leave a Reply