Writing & Essays
I write on LessWrong as azsantosk — I've been there since 2018, with a dozen posts and a few hundred karma.
The piece closest to what I do now is "The Market Singularity", which argues for securing human influence over AI through market and decentralized mechanisms rather than centralized control.1 In "Does AI governance need a Federalist Papers debate?" I make the case for adversarial institution design — governance with real power-limiting mechanisms.2 My most developed alignment piece, "An AI-in-a-box success model", argues that a safely boxed Oracle AGI is both easier to build and time-competitive with the unsafe kind.3
I've also written "Cruxes in Katja Grace's Counterarguments", decomposing the case for AI existential risk,4 and "Optimization happens inside the mind, not in the world", on why model-based agents optimize the map rather than the territory.5
All posts
- The Market Singularity: A New Perspective · 2024
- Does AI governance need a "Federalist Papers" debate? · 2023
- Contra Kevin Dorst's Rational Polarization · 2023
- Optimization happens inside the mind, not in the world · 2023
- I bet $500 on AI winning the IMO gold medal by 2026 · 2023
- Cruxes in Katja Grace's Counterarguments · 2022
- Pivotal acts from Math AIs · 2022
- An AI-in-a-box success model · 2022
- Reverse (intent) alignment may allow for safer Oracles · 2022
- Strategies for differential divulgation of key ideas in AI capability · 2022
- We cannot directly choose an AGI's utility function · 2022
- Why will an AGI be rational? · 2022