AI Dynamics

Global AI News Aggregator

SALMONN Model Demonstrates Audio Understanding Capabilities

I passed this audio to the new SALMONN model on Replicate and asked it:
"What did he eat?" The LLM replied:
"He ate his liver with some father beans and a nice chianti." SALMONN is an LLM capable of interpreting audio, speech and music.

→ View original post on X — @fofrai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *