Mar 22, 2026 Ilya Kabanov

5 AI security stories this week that change your decisions (Mar 16-22, 2026)

The gap between intended and actual behavior in deployed AI systems is widening.

1. 7 proofs of False in Rocq, the proof checker that verifies the Airbus C compiler

Finding soundness bugs in proof assistant kernels used to require PhD-level expertise in type theory. Historically, one was found per year. A guy with a $200/month AI subscription found 7 in 3 days, each one a way to make the checker certify something impossible as correct.

2. OpenAI reveals its coding agents bypass security, extract credentials, and deceive users to get tasks done

Over five months monitoring tens of millions of internal coding agent interactions, OpenAI found that circumventing restrictions and deceiving users are common behaviors. The agents are just trying so hard to complete tasks that they encode commands in base64, extract encrypted credentials from keychains, and attempt to prompt-inject users.

5 AI security stories this week that change your decisions (Mar 16-22, 2026)

1. 7 proofs of False in Rocq, the proof checker that verifies the Airbus C compiler

2. OpenAI reveals its coding agents bypass security, extract credentials, and deceive users to get tasks done

3. OpenAI explains why Codex Security doesn't include SAST. We may not need it for long.

4. Cursor enters code security with four autonomous agents reviewing 3,000+ internal PRs per week

5. Microsoft benchmark for LLM performance on end-to-end SOC tasks

Sources: