@maximelabonne - AI Dynamics

Daredevil-8B: Mega-Merge Model Using DARE TIES

By

–

29 May 2024 17h08

Daredevil-8B is a mega-merge composed of 9 models using DARE TIES. Like the original Daredevil-7B, the model recipe was designed to maximize the MMLU score. The model family tree is already quite wild. Daredevil-8B: https://
huggingface.co/mlabonne/Dared
evil-8B
…

→ View original post on X — @maximelabonne,

29 May 2024

NeuralDaredevil-8B Achieves Top MMLU Scores Among 8B Models

By

@maximelabonne

–

29 May 2024 17h08

NeuralDaredevil-8B Daredevil-8B has the highest MMLU score among 8B models on the Open LLM Leaderboard. Thanks to abliteration and DPO fine-tuning, I managed to create an uncensored version that outperforms Llama 3 Instruct 8B on every benchmark (9 tested).

→ View original post on X — @maximelabonne,

29 May 2024

AI Model Performance Regression Discussed in Technical Community

By

@maximelabonne

–

27 May 2024 17h07

Yeah it's broken, the model even recovered the MMLU loss

→ View original post on X — @maximelabonne,

27 May 2024

Building Large Language Models From Scratch Repository

By

@maximelabonne

–

27 May 2024 17h06

Thanks Sebastian, I really enjoy Build a Large Language Model From Scratch. Amazing repo too!

→ View original post on X — @maximelabonne,

27 May 2024

Models Show Refusal Behavior on Harmless Instructions

By

@maximelabonne

–

26 May 2024 18h00

Yes they also show it in the blog post. In this figure, they've added the refusal direction and you see models refusing harmless instructions

→ View original post on X — @maximelabonne,

26 May 2024

New Abliterated Models Released for Larger AI Systems

By

@maximelabonne

–

26 May 2024 13h54

They also released a collection of abliterated models. I recommend giving it a try (especially for larger models).

→ View original post on X — @maximelabonne,

26 May 2024

Weight Modification Jailbreak Technique for Large Language Models

By

@maximelabonne

–

26 May 2024 13h51

Abliterating LLMs is the most interesting trend I've seen in months A simple weight modification can jailbreak models without any retraining. Here's how it works: Identification – Run model on harmful & harmless prompts
– Capture activations at the last token position
–

→ View original post on X — @maximelabonne,

26 May 2024

Phi Models Overrated: Scaling Remains Safest Path Beyond GPT-4o

By

@maximelabonne

–

24 May 2024 18h10

I'm slightly skeptical about Phi models, which are very good but overrated in benchmarks imo I'd say it's not necessary, but scaling is definitely the safest way of surpassing GPT-4o

→ View original post on X — @maximelabonne,

24 May 2024