For red teaming, the surface area of potential risks is extremely large. As a result, our approach involves extensive threat modeling, & automated evals to identify risks. Then our process enables capable experts to identify key risks, and report them back to model developers.
Red Teaming Methodology: Threat Modeling and Automated Risk Evaluation
By
–
