Start / The Data Exchange with Ben Lorica / Data augmentation in natural language processing

Data Augmentation in Natural Language Processing

52 min • 29 juli 2021

This week’s guests are Steven Feng, Graduate Student and Ed Hovy, Research Professor, both from the Language Technologies Institute of Carnegie Mellon University. We discussed their recent survey paper on Data Augmentation Approaches in NLP (GitHub), an active field of research on techniques for increasing the diversity of training examples without explicitly collecting new data. One key reason why such strategies are important is that augmented data can act as a regularizer to reduce overfitting when training models.

Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.

Detailed show notes can be found on The Data Exchange web site.

Subscribe to The Gradient Flow Newsletter.

Kategorier

Förekommer på

00:00 -00:00