When AI Cannibalizes Its Data

18/02/2025 13 min Episodio 1224

Listen "When AI Cannibalizes Its Data"

Descargar episodio Ver en sitio original

Episode Synopsis

Asked ChatGPT anything lately? Talked with a customer service chatbot? Read the results of Google's "AI Overviews" summary feature? If you've used the Internet lately, chances are, you've consumed content created by a large language model. These models, like DeepSeek-R1 or OpenAI's ChatGPT, are kind of like the predictive text feature in your phone on steroids. In order for them to "learn" how to write, the models are trained on millions of examples of human-written text. Thanks in part to these same large language models, a lot of content on the Internet today is written by generative AI. That means that AI models trained nowadays may be consuming their own synthetic content ... and suffering the consequences.View the AI-generated images mentioned in this episode.Have another topic in artificial intelligence you want us to cover? Let us know my emailing [email protected]!Listen to every episode of Short Wave sponsor-free and support our work at NPR by signing up for Short Wave+ at plus.npr.org/shortwave.Learn more about sponsor message choices: podcastchoices.com/adchoicesNPR Privacy Policy

More episodes of the podcast Short Wave

Behold a T-Rex holotype, paleontology's "gold standard" 06/01/2026

Did Earth’s Water Come From Space? 05/01/2026

The trouble of zero 02/01/2026

Science In 2025 Took A Hit. What Does It Mean? 31/12/2025

Climate Anxiety Is Altering Family Planning 30/12/2025

Why Kratom Is At The Heart Of A Big Public Health Debate 29/12/2025

Why Drones Are Catching Whale Breaths 26/12/2025

Drinking Turns Some Red With Asian Glow—And May Fight Tuberculosis 24/12/2025

Why Suicide Prevention is 'Everyone's Business' 23/12/2025

No, Raccoons Aren’t Pet-Ready (Yet) 22/12/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

When AI Cannibalizes Its Data

Listen "When AI Cannibalizes Its Data"

Episode Synopsis

More episodes of the podcast Short Wave

Deep web or Invisible Internet

Information Technology (IT)

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD