67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman

27/08/2018 39 min

Listen "67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman"

Descargar episodio Ver en sitio original

Episode Synopsis

Paper by Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R. Bowman.

Sam comes on to tell us about GLUE. We talk about the motivation behind setting up a benchmark framework for natural language understanding, how the authors defined "NLU" and chose the tasks for this benchmark, a very nice diagnostic dataset that was constructed for GLUE, and what insight they gained from the experiments they've run so far. We also have some musings about the utility of general-purpose sentence vectors, and about leaderboards.

https://www.semanticscholar.org/paper/GLUE%3A-A-Multi-Task-Benchmark-and-Analysis-Platform-Wang-Singh/a2054eff8b4efe0f1f53d88c08446f9492ae07c1

More episodes of the podcast NLP Highlights

Are LLMs safe? 29/02/2024

"Imaginative AI" with Mohamed Elhoseiny 08/01/2024

142 - Science Of Science, with Kyle Lo 28/12/2023

141 - Building an open source LM, with Iz Beltagy and Dirk Groeneveld 29/06/2023

140 - Generative AI and Copyright, with Chris Callison-Burch 06/06/2023

139 - Coherent Long Story Generation, with Kevin Yang 24/03/2023

138 - Compositional Generalization in Neural Networks, with Najoung Kim 20/01/2023

137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal 13/01/2023

136 - Including Signed Languages in NLP, with Kayo Yin and Malihe Alikhani 19/05/2022

135 - PhD Application Series: After Submitting Applications 02/03/2022

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman

Listen "67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman"

Episode Synopsis

More episodes of the podcast NLP Highlights

Preparing for a Hacker Threat

Localhost, there’s no place like 127.0.0.1

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD