Model Merging and Safety Alignment New paper looking into how model merging poorly preserves safety alignment. The authors modify EvoMM and LM-Cocktail to balance performance on safety data and domain-specific data. They show that this safety-aware merging approach can
Model Merging Preserves Safety Alignment Better
By
–
Leave a Reply