Nope, RAG is still 100s of time faster and cheaper than long-context
@aymericroucher
-
Qwen2.5-Coder-32B first sub-70B model with great agentic abilities
By
–
Spreading awareness: Qwen models are fire.
Qwen2.5-Coder-32B is the first model below 70B that I find to have really good agentic capabilities! -
Use Spaces as Tools for Agentic Workflows
By
–
Breaking: You can now use a Space as a tool for your transformers.agent! 🛠️🔥🔥
— m_ric (@AymericRoucher) 19 novembre 2024
This lets you take the coolest spaces, like FLUX.1-dev, and use them in agentic workflows with a few lines of code! 🧑💻
On the video below, I set up my fake vacation pictures where I'm awesome at… pic.twitter.com/rzHFFWjvzaBreaking: You can now use a Space as a tool for your transformers.agent! This lets you take the coolest spaces, like FLUX.1-dev, and use them in agentic workflows with a few lines of code! On the video below, I set up my fake vacation pictures where I'm awesome at
-

Meta’s New Unbreakable Watermarking Model: A Shield Against Deepfakes and Stolen Art
By
–
Meta team just dropped the first Watermarking model that not edit can break! Ever heard of watermarking? It's a technique that allows you to mark in an image its original source. It's our best shield against AI-generated deepfakes, or content stolen from artists!
-

Qwen2.5-Coder-32B: first open-source model to match GPT-4o
By
–
> Qwen2.5-Coder-32B: new best-in-class open coding model, beats GPT-4o on most coding benchmarks! It's the first time Open-Source coding model of this size class that clearly matches GPT-4o's coding capabilities! Completes the previous two Qwen 2.5 Coder release with 4
-

Scaling laws diminishing returns for OpenAI GPT models
By
–
> Are scaling laws over? A report from the Information announced that @OpenAI is seeing diminishing returns from scaling up the next GPT models. What are scaling laws? These are empiric laws that say "Every time you increase compute spent in training 10-fold, your LLM's
-
HfApiEngine improvement simplifies open-LLM agent creation
By
–
A nice PR from @BruleNaudet in transformers.agents just improved HfApiEngine. This makes the creation of open-LLM-powered agents even easier with our free Inference API!
-
Autogen-based agent team tops GAIA benchmark submissions
By
–
It's not a new framework, it's simply a team of agent based on Microsoft's autogen framework! And submissions of this structure from Microsoft have long been near the top of the GAIA benchmark.
-

AndroidLab benchmark shows small fine-tuned models can power JARVIS
By
–
> AndroidLab: First ever systematic benchmark for Android mobile agents shows that small, fine-tuned open models can power a JARVIS system on your smartphone A team from @Tsinghua_Uni just released AndroidLab, the first systematic framework to evaluate and train Android