Writing & Essays

I write on LessWrong as azsantosk — I've been there since 2018, with a dozen posts and a few hundred karma.

The piece closest to what I do now is "The Market Singularity", which argues for securing human influence over AI through market and decentralized mechanisms rather than centralized control.¹ In "Does AI governance need a Federalist Papers debate?" I make the case for adversarial institution design — governance with real power-limiting mechanisms.² My most developed alignment piece, "An AI-in-a-box success model", argues that a safely boxed Oracle AGI is both easier to build and time-competitive with the unsafe kind.³

I've also written "Cruxes in Katja Grace's Counterarguments", decomposing the case for AI existential risk,⁴ and "Optimization happens inside the mind, not in the world", on why model-based agents optimize the map rather than the territory.⁵

All posts

The Market Singularity: A New Perspective · 2024
Does AI governance need a "Federalist Papers" debate? · 2023
Contra Kevin Dorst's Rational Polarization · 2023
Optimization happens inside the mind, not in the world · 2023
I bet $500 on AI winning the IMO gold medal by 2026 · 2023
Cruxes in Katja Grace's Counterarguments · 2022
Pivotal acts from Math AIs · 2022
An AI-in-a-box success model · 2022
Reverse (intent) alignment may allow for safer Oracles · 2022
Strategies for differential divulgation of key ideas in AI capability · 2022
We cannot directly choose an AGI's utility function · 2022
Why will an AGI be rational? · 2022