Daredevil-8B is a mega-merge composed of 9 models using DARE TIES. Like the original Daredevil-7B, the model recipe was designed to maximize the MMLU score. The model family tree is already quite wild. Daredevil-8B: https://
huggingface.co/mlabonne/Dared
evil-8B
…
@maximelabonne
-

Daredevil-8B: Mega-Merge Model Using DARE TIES
By
–
-

NeuralDaredevil-8B Achieves Top MMLU Scores Among 8B Models
By
–
NeuralDaredevil-8B Daredevil-8B has the highest MMLU score among 8B models on the Open LLM Leaderboard. Thanks to abliteration and DPO fine-tuning, I managed to create an uncensored version that outperforms Llama 3 Instruct 8B on every benchmark (9 tested).
-
AI Model Performance Regression Discussed in Technical Community
By
–
Yeah it's broken, the model even recovered the MMLU loss
-
Building Large Language Models From Scratch Repository
By
–
Thanks Sebastian, I really enjoy Build a Large Language Model From Scratch. Amazing repo too!
-

Models Show Refusal Behavior on Harmless Instructions
By
–
Yes they also show it in the blog post. In this figure, they've added the refusal direction and you see models refusing harmless instructions
-
New Abliterated Models Released for Larger AI Systems
By
–
They also released a collection of abliterated models. I recommend giving it a try (especially for larger models).
-

Weight Modification Jailbreak Technique for Large Language Models
By
–
Abliterating LLMs is the most interesting trend I've seen in months A simple weight modification can jailbreak models without any retraining. Here's how it works: Identification – Run model on harmful & harmless prompts
– Capture activations at the last token position
– -
Phi Models Overrated: Scaling Remains Safest Path Beyond GPT-4o
By
–
I'm slightly skeptical about Phi models, which are very good but overrated in benchmarks imo I'd say it's not necessary, but scaling is definitely the safest way of surpassing GPT-4o
-
Code Generation Advancement in AI Development
By
–
Hey that's cool. I worked a lot in code gen, it's great to see this
-
Gemini Performance Improvement on MMLU Benchmark
By
–
Agreed, MMLU is not a great benchmark and Gemini has improved quite a lot