AI Dynamics

Global AI News Aggregator

Hugging Face Benchmark: Vision LLMs Long Video Input Performance

Interesting new benchmark from Hugging Face testing how well vision LLMs can handle long video inputs (generally after they've been split into many thousands of images) – my notes here: https://
simonwillison.net/2025/Jul/23/ti
mescope/

→ View original post on X — @simonw,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *