Excited to feature Tamper-Resistant Safeguards for Open-Weight LLMs from @lapisrocks
! Introducing the first safeguards for LLMs that resist fine-tuning attacks, showing the power of tamper-resistance to make open-weight LLMs safer. @rishub_t is here to answer your questions!
Tamper-Resistant Safeguards for Open-Weight LLMs
By
–
