#80- Layer pruning and Mixture of Depths.

18/04/2024 13 min

Listen "#80- Layer pruning and Mixture of Depths."

Descargar episodio Ver en sitio original

Episode Synopsis

Hey guys, continuing the series of episodes about PEFT, in this episode I talk about inference optimization techniques for LLMs.

I talk about layer pruning, where we prune consecutive layers of the LLM without almost not losing model performance.

I also talk about Mixture of Depths, a similar technique to Mixture of Experts, where we have a router that choses which tokens will be processed in which layer of the LLM.

Paper MoD: ⁠https://arxiv.org/pdf/2404.02258.pdf⁠
Paper layer pruning: ⁠https://arxiv.org/pdf/2403.17887v1.pdf⁠
Instagram of the podcast: https://www.instagram.com/podcast.lifewithai
Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai

More episodes of the podcast Life with AI

#99- GraphRAG. 05/12/2024

#98- On-device AI with SmolLM. 07/11/2024

#97- Brazilian grammarly with Felipe from Clarice AI. 31/10/2024

#96- Maritaca AI, the brazilian LLM company. 24/10/2024

#95- Why Chain of Thought works? 26/09/2024

#94- OpenAI o1 19/09/2024

#93- Different types of AI. 12/09/2024

#92- Llama3 benchmarks, vision and speech. 22/08/2024

#91- Llama 3 training. 15/08/2024

#90- Llama 3 paper overview. 25/07/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

#80- Layer pruning and Mixture of Depths.

Listen "#80- Layer pruning and Mixture of Depths."

Episode Synopsis

More episodes of the podcast Life with AI

Positive Attitude, Share your ZARZA Attitude!

Gray Hat Hacking, those with ambiguous ethics…

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD