AI Dynamics

Global AI News Aggregator

About

Daredevil-8B: Mega-Merge Model Using DARE TIES

Daredevil-8B is a mega-merge composed of 9 models using DARE TIES. Like the original Daredevil-7B, the model recipe was designed to maximize the MMLU score. The model family tree is already quite wild. Daredevil-8B: https://
huggingface.co/mlabonne/Dared
evil-8B

→ View original post on X — @maximelabonne,