How large language models work, a visual intro to transformers

27/10/2024 15 min

Listen "How large language models work, a visual intro to transformers"

Descargar episodio Ver en sitio original

Episode Synopsis

The inner workings of large language models (LLMs) like ChatGPT, focusing on the transformer architecture. The speaker starts by defining what LLMs are and how they use pre-trained transformers to generate text. The main focus is on the attention mechanism, which allows LLMs to learn the relationship between words in a sentence and understand their context. The video uses a visual approach and provides simple analogies to explain complex concepts. It also briefly discusses the embedding process, which translates words into numerical representations, and the softmax function, which normalizes these representations into probability distributions.Become a supporter of this podcast: https://www.spreaker.com/podcast/youtube-deepdive--6348983/support.

More episodes of the podcast Youtube DeepDive

How A Poor Boy Built Oberoi Hotels 19/10/2025

Mel Robbins' podcast focuses on mindset and how to reprogram the brain to work for you 19/10/2025

The positive emotional response we experience when listening to music from our past 19/10/2025

Interview between neuroscientist Dr Wendy Suzuki and podcast host Mel Robbins 19/10/2025

Super Spy - The Man Who Betrayed the West 19/01/2025

The Fasting Doctor - Fasting Can Help To Cure Obesity! 13/01/2025

How A Mall Cop Exposed a $10.3 Million Counterfeit Empire 08/12/2024

The $18.8 Million Armored Car Heist: Phil Johnson's Bold Gamble 07/12/2024

Finding profitable market niches by utilizing various online tools 06/12/2024

The Casino Chip Forgers Who Scammed Vegas For Millions 05/12/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

How large language models work, a visual intro to transformers

Listen "How large language models work, a visual intro to transformers"

Episode Synopsis

More episodes of the podcast Youtube DeepDive

Free Internet, a prediction in Nostradamus style

Increase the rate of email delivery

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD