๐ฆ๐ต๐ผ๐๐จ๐: ๐ฎ ๐๐บ๐ฎ๐น๐น ๐ฒ๐ป๐ฑ-๐๐ผ-๐ฒ๐ป๐ฑ ๐ฎ๐ด๐ฒ๐ป๐ ๐๐ต๐ฎ๐ ๐ฐ๐ฎ๐ป ๐ป๐ฎ๐๐ถ๐ด๐ฎ๐๐ฒ ๐ฎ๐ป๐ ๐จ๐ ๐ฎ๐ป๐ฑ ๐ผ๐๐๐ฝ๐ฒ๐ฟ๐ณ๐ผ๐ฟ๐บ๐ ๐บ๐๐ฐ๐ต ๐ฏ๐ถ๐ด๐ด๐ฒ๐ฟ ๐๐๐๐๐ฒ๐บ๐! A team from NUS and Microsoft just released an agent that can act on any UI (Desktop, Android, Web)
@aymericroucher
-

Adobe’s Code-Generating Agent Tops GAIA Leaderboard
By
–
๐๐ฑ๐ผ๐ฏ๐ฒ'๐ ๐ฐ๐ผ๐ฑ๐ฒ-๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ป๐ด ๐ฎ๐ด๐ฒ๐ป๐ ๐ฟ๐ฒ๐ฎ๐ฐ๐ต๐ฒ๐ ๐๐ต๐ฒ ๐๐ผ๐ฝ ๐ผ๐ณ ๐๐๐๐ ๐น๐ฒ๐ฎ๐ฑ๐ฒ๐ฟ๐ฏ๐ผ๐ฎ๐ฟ๐ฑ – and they cite "Roucher, 2024" in the paper! Reminder:ย Broadly defined, an "Agent" is a system where a LLM is augmented with the ability to run
-
Release of Qwen-QwQ-32B Preview Model on Hugging Face
By
–
https://
huggingface.co/chat/models/Qw
en/QwQ-32B-Preview
โฆ -
Original MNIST dataset updated on Hugging Face
By
–
MNIST original dataset updated by the himself on Hugging Face!
-
The State of Generative AI in the Enterprise
By
–
https://
menlovc.com/2024-the-state
-of-generative-ai-in-the-enterprise/
โฆ -

State of Enterprise AI 2024: Market Trends and Agent Adoption
By
–
๐ฆ๐๐ฎ๐๐ฒ ๐ผ๐ณ ๐๐ป๐๐ฒ๐ฟ๐ฝ๐ฟ๐ถ๐๐ฒ ๐๐ ๐ฎ๐ฌ๐ฎ๐ฐ: ๐๐ป๐๐ต๐ฟ๐ผ๐ฝ๐ถ๐ฐ ๐ฒ๐ฎ๐๐ถ๐ป๐ด ๐๐ฝ ๐ข๐ฝ๐ฒ๐ป๐๐, ๐๐ด๐ฒ๐ป๐๐ ๐ฟ๐ฎ๐บ๐ฝ ๐๐ฝ ๐๐ผ ๐ญ๐ฎ% ๐ผ๐ณ ๐๐๐ฒ-๐ฐ๐ฎ๐๐ฒ๐, ๐ผ๐ฝ๐ฒ๐ป ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐บ๐ฎ๐ธ๐ฒ ๐ญ๐ต% ๐ผ๐ณ ๐๐๐ฎ๐ด๐ฒ @MenloVentures surveyed 600 enterprise IT decision-makers
-
Discussion on fine-tuning Llama-based models
By
–
Yes it's a finetune of llama, thus I hesitated to include it, but they do build great models
-

New app shows no European company in top 10 LLM rankings
By
–
Made a new app to visualize the LLM race โ ๐ก๐ผ ๐๐๐ฟ๐ผ๐ฝ๐ฒ๐ฎ๐ป ๐ฐ๐ผ๐บ๐ฝ๐ฎ๐ป๐ ๐ถ๐ป ๐๐ต๐ฒ ๐๐ผ๐ฝ ๐ญ๐ฌ I've adapted an app by @andrewrreed that tracks progress of LLMs on the Chatbot Arena leaderboard, to compare companies from different countries. The outcome is quite
-

New leaderboard ranks LLMs for LLM-as-a-judge; Llama-3.1-70B tops
By
–
New leaderboard ranks LLMs for LLM-as-a-judge: Llama-3.1-70B tops the rankings! Evaluating systems is critical during prototyping and in production, and LLM-as-a-judge has become a standard technique to do it. First, what is "LLM-as-a-judge"? It's a very useful technique

