The Observatory Ledger AI safety & mechanistic interpretability by Pano Pouroullis
Public research notebook

Research that corrects itself.

Public notes on building, testing, and breaking ideas about models. Corrections welcome. Evidence first, always.

By Pano Pouroullis · LinkedIn · GitHub · Substack

Latest observations

About the ledger

This is Pano Pouroullis's public home for AI safety notes: interpretability experiments, model evaluation lessons, null results, and the corrections that improve the work. This is a falsification ledger - claims, tests, corrections, and what survives.