Sonnet 4.5 can now autonomously find the vulnerability behind the Equifax breach and write an exploit
Using only a Bash shell on a Kali Linux host.
Anthropic published the report "AI models are showing a greater ability to find and exploit vulnerabilities on realistic cyber ranges," where Sonnet 4.5 identified the vulnerability exploited in the Equifax breach and generated an exploit without looking up the publicized CVE details.
What can we learn from this?
- The Equifax breach remains a highly relevant incident. The lessons learned paper that Stuart and I published four years ago
- Frontier labs continue to invest heavily in foundational model cybersecurity capabilities, making models more self-sufficient at executing cyber tasks with reduced reliance on assistive tooling. See my earlier post on why frontier labs invest in cybersecurity
- Did I already mention that prompt patching matters? As LLMs become capable of identifying zero-days, it matters even more.