
Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems
Top Score
0.0%
Models Evaluated
0
Dataset Size
40 samples
Last Updated
July 10, 2025
Title
BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems
Authors
Stanford CRFM
Published
July 10, 2025
arXiv ID
2505.1521625 systems with complex real-world codebases and 40 bug bounties covering 9 of OWASP Top 10 Risks
Number of Tasks
vulnerability-detectionexploit-generationpatch-generationdefense-evaluation
Dataset Size
40 samples