“Omniscaling to MNIST” by cloud

08/11/2025 20 min

Listen "“Omniscaling to MNIST” by cloud"

Descargar episodio Ver en sitio original

Episode Synopsis

In this post, I describe a mindset that is flawed, and yet helpful for choosing impactful technical AI safety research projects. The mindset is this: future AI might look very different than AI today, but good ideas are universal. If you want to develop a method that will scale up to powerful future AI systems, your method should also scale down to MNIST. In other words, good ideas omniscale: they work well across all model sizes, domains, and training regimes.The Modified National Institute of Standards and Technology database (MNIST): 70,000 images of handwritten digits, 28x28 pixels each (source: Wikipedia). You can fit the whole dataset and many models on a single GPU! Putting the omniscaling mindset into practice is straightforward. Any time you come across a clever-sounding machine learning idea, ask: "can I apply this to MNIST?" If not, then it's not a good idea. If so, run an experiment to see if it works. If it doesn't, then it's not a good idea. If it does, then it might be a good idea, and you can continue as usual to more realistic experiments or theory. In this post, I will: Share how MNIST experiments have informed my [...] ---Outline:(01:58) Applications to MNIST(02:42) Gradient routing(04:43) Distillation robustifies unlearning(08:39) Subliminal learning(10:37) Why you should do it on MNIST(11:30) MNIST is not sufficient (and other tips)(14:25) The omniscaling assumption is false(17:09) Code and more ideas(18:40) Closing thoughts The original text contained 7 footnotes which were omitted from this narration. ---
First published:
November 8th, 2025

Source:
https://www.lesswrong.com/posts/4aeshNuEKF8Ak356D/omniscaling-to-mnist
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

More episodes of the podcast LessWrong (30+ Karma)

“Don’t use the phrase ‘human values’” by Nina Panickssery 15/11/2025

“Increasing marginal returns to effort are common” by habryka 15/11/2025

“Generation Ship: A Protest Song For PauseAI” by LoganStrohl 15/11/2025

“‘But You’d Like To Feel Companionate Love, Right? ... Right?’” by johnswentworth 15/11/2025

“Understanding and Controlling LLM Generalization” by Daniel Tan 15/11/2025

“AI Craziness: Additional Suicide Lawsuits and The Fate of GPT-4o” by Zvi 15/11/2025

“AI Corrigibility Debate: Max Harms vs. Jeremy Gillen” by Liron, Max Harms, Jeremy Gillen 14/11/2025

“10” by Ben Pace 14/11/2025

“Everyone has a plan until they get lied to the face” by Screwtape 14/11/2025

“The rare, deadly virus lurking in the Southwest US, and the bigger picture” by eukaryote 14/11/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

“Omniscaling to MNIST” by cloud

Listen "“Omniscaling to MNIST” by cloud"

Episode Synopsis

More episodes of the podcast LessWrong (30+ Karma)

Googling with breathtaking tricks you ignore

Dot COM: The Internet’s dominant TLD

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD