Listen "Data Augmentation in Natural Language Processing"
Episode Synopsis
This week’s guests are Steven Feng, Graduate Student and Ed Hovy, Research Professor, both from the Language Technologies Institute of Carnegie Mellon University. We discussed their recent survey paper on Data Augmentation Approaches in NLP (GitHub), an active field of research on techniques for increasing the diversity of training examples without explicitly collecting new data. One key reason why such strategies are important is that augmented data can act as a regularizer to reduce overfitting when training models.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.
More episodes of the podcast The Data Exchange with Ben Lorica
Teaching AI How to Forget
15/01/2026
The Junior Data Engineer is Now an AI Agent
08/01/2026
The Truth About Agents in Production
31/12/2025
The best books we read this year 📚
24/12/2025
The Developer’s Guide to LLM Security
18/12/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.