2026-03-20 20:17:53

BlockSec researchers re-evaluated the EVMBench AI audit benchmark and found that its claimed 72% AI exploit success rate may be overly optimistic. In a paper titled "Re-evaluating EVMBench," they tested 26 agent configurations and analyzed 22 real security incidents that occurred after February 2026, with results showing a 0% end-to-end exploit success rate. While AI detection results for known vulnerability patterns were consistent with the initial research findings, BlockSec co-founder (Yajin Zhou) believes that fully automated auditing is not yet practical, advocating for a human-AI collaboration approach in which AI handles breadth analysis while humans handle depth analysis.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

2 Likes