Public research notebook

Research that corrects itself.

Public notes on building, testing, and breaking ideas about models. Corrections welcome. Evidence first, always.

Latest observations

Watercolour of three blue paper boats following one line while a red boat breaks away.

The Correction Came From Outside

What a geography error taught me about AI verification

Three same-family reviews felt thorough and confident. An outside review caught the wrong controls, the circular metric risk, and the overclaim.

17 March 2026Verification5 min read 02

Diagram: within-graph cosine 0.921 collapsing to cross-graph mean cosine 0.019.

Cross-Graph Cosine Collapsed to 0.019

0.921 was the graph, not the country.

A beautiful within-graph signal evaporated under a decisive prompt-variant test. The null result became the finding.

15 May 2026Attribution graphs12 min read

About the ledger

This is a small public home for careful AI-safety writing: interpretability experiments, model-evaluation lessons, null results, and the corrections that improve the work. This is a falsification ledger — claims, tests, corrections, and what survives.