Mathematics had the same problem. Terence Tao solved it. We're applying the same approach to software.
AI compressed every "generative" phase of academic research. What remained? The tasks that require trust and verification.
AI made generating research cheap. The bottleneck shifted to verification — peer review.
AI has driven the cost of idea generation down to almost zero, in a very similar way to how the internet drove the cost of communication down to almost zero. Now the bottleneck is different. People can generate thousands of theories for a given scientific problem. Now we have to verify them, evaluate them.
— Terence Tao, Fields Medal laureate
His solution: replace social trust with mechanical verification. Here's how it works:
Every function of peer review has a formal-methods equivalent. The difference: one scales, the other doesn't.
| What Peer Review Does | Traditional Approach | Formal Methods Approach |
|---|---|---|
| Correctness guarantee | Reviewer re-derives the argument | Proof assistant checks every step mechanically |
| Reproducibility | Other researchers replicate results | Proof compiles deterministically from source |
| Error catching | Human spot-checking, intuition | Static analysis across ALL possible cases |
| Building on prior work | Citation + trust chain | Import proven theorem, compose with new proof |
| Social trust / reputation | "Published in Nature" | "Verified by Lean" — trust is mathematical, not social |
| Scalability | ~5 trusted collaborators max | Unlimited — anyone can contribute, system verifies |
The parallels aren't metaphorical. They're structural. Every concept maps 1-to-1.
| Mathematics | Software |
|---|---|
| Mathematical proof | Property / invariant over code |
| Proof in Lean | Property verified by AG framework |
| Peer review | Code review / security audit / compliance check |
| Theorem | Invariant (e.g., "no SQL injection possible") |
| Proof assistant (Lean) | AG framework + abstract interpreter |
| Formalization (annotating a proof) | Annotation / attribute declaration in code |
| Crowdsourced contributors | AI agents generating annotations |
| Proof compilation (Lean checks) | AG evaluation (framework verifies) |
| Library of proven theorems | Library of proven code properties |
Formal verification isn't theoretical anymore. It's shipping in production.
Formalized in Lean, verified mechanically. What would have taken years of peer review was checked in half an hour.
22 million implications, 99.9963% verified in 19 days. Crowdsourced from contributors worldwide.
AI-human collaboration formally verified sphere-packing proof for the first time — a landmark in mathematical trust.
The compiler proves memory safety. No garbage collector, no manual review needed. Formal verification going mainstream.
Fully formally verified OS kernel. Zero exploitable bugs found in production.
Formally verified C compiler. Proved to never miscompile. Used in safety-critical aviation and nuclear systems.
Formal verification isn't theoretical anymore. It's shipping in production. The question isn't will it come to mainstream software development — it's who builds the infrastructure.
Tao's formalization projects required skilled humans to write the Lean proofs. That's still a bottleneck — just a different one.
The breakthrough for software: AI agents can generate the annotations. They do the "creative" part (deciding what to assert) cheaply and at scale. The framework does the "verification" part (checking the assertion holds) mechanically.
This is the missing piece that makes formal methods practical for everyday software development.
Want the deep technical version? See The Sudoku Insight →