AI Dynamics

Global AI News Aggregator

About

SALMONN Model Demonstrates Audio Understanding Capabilities

I passed this audio to the new SALMONN model on Replicate and asked it:
"What did he eat?" The LLM replied:
"He ate his liver with some father beans and a nice chianti." SALMONN is an LLM capable of interpreting audio, speech and music.

→ View original post on X — @fofrai