Efficient Compression of Large Language Models using LLM-Pruner

11/08/2024

Listen "Efficient Compression of Large Language Models using LLM-Pruner"

Descargar episodio Ver en sitio original

Episode Synopsis

The podcast discusses a paper that introduces LLM-Pruner, a task-agnostic framework for compressing Large Language Models (LLMs) through structural pruning. The framework consists of three stages: Discovery, Estimation, and Recovery, enabling efficient compression without sacrificing model performance.

LLM-Pruner utilizes structural pruning and a post-training method called LoRA to compress LLMs without task-specific retraining. The framework demonstrates promising results in maintaining model performance even with pruning up to 20% of parameters.

Read full paper: https://arxiv.org/abs/2305.11627

Tags: Artificial Intelligence, Natural Language Processing, Model Compression

More episodes of the podcast Byte Sized Breakthroughs

TransAct Transformer-based Realtime User Action Model for Recommendation at Pinterest 08/07/2024

Zero Bubble Pipeline Parallelism 08/07/2024

The limits to learning a diffusion model 08/07/2024

A Better Match for Drivers and Riders Reinforcement Learning at Lyft 08/07/2024

AutoEmb Automated Embedding Dimensionality Searchg in Streaming Recommendations 08/07/2024

NeuralProphet Explainable Forecasting at Scale 08/07/2024

No-Transaction Band Network A Neural Network Architecture for Efficient Deep Hedging 08/07/2024

ZeRO Memory Optimizations: Toward Training Trillion Parameter Models 08/07/2024

DriveVLM: Vision-Language Models for Autonomous Driving in Urban Environments 18/07/2024

Robustness Evaluation of HD Map Constructors under Sensor Corruptions for Autonomous Driving 18/07/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Efficient Compression of Large Language Models using LLM-Pruner

Listen "Efficient Compression of Large Language Models using LLM-Pruner"

Episode Synopsis

More episodes of the podcast Byte Sized Breakthroughs

Positive Attitude, Share your ZARZA Attitude!

White Hat Hacking, Ethical Hackers…

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD