52 - Sequence-to-Sequence Learning as Beam-Search Optimization, with Sam Wiseman

15/03/2018 23 min

Listen "52 - Sequence-to-Sequence Learning as Beam-Search Optimization, with Sam Wiseman"

Descargar episodio Ver en sitio original

Episode Synopsis

EMNLP 2016 paper by Sam Wiseman and Sasha Rush.

In this episode we talk with Sam about a paper from a couple of years ago on bringing back some ideas from structured prediction into neural seq2seq models. We talk about the classic problems in structured prediction of exposure bias, label bias, and locally normalized models, how people used to solve these problems, and how we can apply those solutions to modern neural seq2seq architectures using a technique that Sam and Sasha call Beam Search Optimization.

(Note: while we said in the episode that BSO with beam size of 2 is equivalent to a token-level hinge loss, that's not quite accurate; it's close, but there are some subtle differences.)

https://www.semanticscholar.org/paper/Sequence-to-Sequence-Learning-as-Beam-Search-Optim-Wiseman-Rush/28703eef8fe505e8bd592ced3ce52a597097b031

More episodes of the podcast NLP Highlights

Are LLMs safe? 29/02/2024

"Imaginative AI" with Mohamed Elhoseiny 08/01/2024

142 - Science Of Science, with Kyle Lo 28/12/2023

141 - Building an open source LM, with Iz Beltagy and Dirk Groeneveld 29/06/2023

140 - Generative AI and Copyright, with Chris Callison-Burch 06/06/2023

139 - Coherent Long Story Generation, with Kevin Yang 24/03/2023

138 - Compositional Generalization in Neural Networks, with Najoung Kim 20/01/2023

137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal 13/01/2023

136 - Including Signed Languages in NLP, with Kayo Yin and Malihe Alikhani 19/05/2022

135 - PhD Application Series: After Submitting Applications 02/03/2022

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

52 - Sequence-to-Sequence Learning as Beam-Search Optimization, with Sam Wiseman

Listen "52 - Sequence-to-Sequence Learning as Beam-Search Optimization, with Sam Wiseman"

Episode Synopsis

More episodes of the podcast NLP Highlights

Googling with breathtaking tricks you ignore

Email on your own domain, luxury or need?

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD