Cybench A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models discuss: https://
huggingface.co/papers/2408.08
926
… Language Model (LM) agents for cybersecurity that are capable of autonomously identifying vulnerabilities and executing exploits have the potential to
Cybench: Framework for Evaluating Language Models Cybersecurity Capabilities
By
–
