Meta is releasing a new suite of security benchmarks for for LLMs, CyberSecEval 3
Meta is releasing a new suite of security benchmarks for for LLMs, CyberSecEval 3, to continue the conversation on empirically measuring LLM cybersecurity risks and capabilities. CyberSecEval 3 assesses 8 different risks across two broad categories: risk to third parties, and risk to application developers and end users.
"Compared to previous work, we add new areas focused on offensive security capabilities: automated social engineering, scaling manual offensive cyber operations, and autonomous offensive cyber operations. In this paper we discuss applying these benchmarks to the Llama 3 models and a suite of contemporaneous state-of-the-art LLMs, enabling us to contextualize risks both with and without mitigations in place."
8
5 comments
Marcio Pacheco
7
Meta is releasing a new suite of security benchmarks for for LLMs, CyberSecEval 3
Data Alchemy
skool.com/data-alchemy
Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®
Leaderboard (30-day)
powered by