A great story from Sean Heelan who challenged Opus 4.5- and GPT-5.2-powered agents to write exploits for a zeroday in the QuickJS JavaScript interpreter.

My take:

  1. If you're building a general-purpose security vulnerability discovery startup, it's a good time to pivot.
  2. Frontier models are becoming very capable at finding non-obvious 0-days at scale, lightning fast. They are also getting better at writing exploits without additional scaffolding.
  3. Automated patching is a significantly more difficult challenge, because you need to fix a bug without breaking functionality. Benchmarking there is also incredibly hard. We're not there yet, so you have a little bit of time ahead of you.

Comments:

On the Coming Industrialisation of Exploit Generation with LLMs

Anthropic just launched Claude Opus 4.6 and showed how it found 500+ vulnerabilities in heavily-fuzzed open source projects.

AI agents writing 40+ exploits for QuickJS zero-day at $30 per run in under an hour