
Cybersecurity AI Benchmark - A Meta-Benchmark for Evaluating Cybersecurity AI Agents
Top Score
N/A
Models Evaluated
0
Dataset Size
10,000 samples
Last Updated
October 28, 2025
Availability
Modular meta-benchmark with 10,000+ instances across 5 evaluation categories including RCTF2 robotics challenges and CyberPII-Bench privacy assessment
Number of Tasks
5
No verified public primary numeric leaderboard/result table has been extracted into the catalog yet; metadata and source links were refreshed during the 2026-05-12 audit.