Build a Large Language Model (From Scratch)

27/04/2025 22 min Temporada 1 Episodio 3

Listen "Build a Large Language Model (From Scratch)"

Descargar episodio Ver en sitio original

Episode Synopsis

This compilation of excerpts focuses on the practical implementation of large language models (LLMs), particularly those resembling the GPT architecture, from the foundational concepts upwards using PyTorch. It explains key components such as tokenization, embeddings, attention mechanisms, and transformer blocks, detailing how they contribute to building these models. The text also covers crucial processes for LLM development including pretraining and fine-tuning for various tasks, like text classification and instruction following, highlighting practical aspects such as handling datasets, managing hardware limitations, and utilizing pre-trained weights. Furthermore, it introduces methods for evaluating model performance and generating text, discussing techniques like greedy decoding and probabilistic sampling, and provides insights into advanced training techniques like parameter-efficient fine-tuning.Build a Large Language Model (From Scratch) -https://amzn.to/42uzzZR

More episodes of the podcast Hidden State

Situational Awareness: Counting Orders of Magnitude in AI Progress 01/05/2025

AI Engineering with Foundation Models 24/04/2025

Understanding Deep Learning 21/04/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Build a Large Language Model (From Scratch)

Listen "Build a Large Language Model (From Scratch)"

Episode Synopsis

More episodes of the podcast Hidden State

Email on your own domain, luxury or need?

White Hat Hacking, Ethical Hackers…

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD