Listen "Meta's Stock Plunge 💸 // TSMC's A16 Process 🚀 // Instruction Hierarchy Boosting LLMs 📈"
Episode Synopsis
Meta's aggressive AI investments have caused a 13% plunge in their stock, threatening to wipe out almost $163 billion from their market value.
TSMC's new A16 manufacturing process promises to outperform its predecessor, N2P, by a significant margin, with an up to 10% higher clock rate at the same voltage and a 15% - 20% lower power consumption at the same frequency and complexity.
The Instruction Hierarchy proposes a data generation method to demonstrate hierarchical instruction following behavior, which drastically increases robustness for LLMs against attacks.
SPLATE is a lightweight adaptation of the ColBERTv2 model that improves the efficiency of late interaction retrieval, particularly for running ColBERT on CPU environments.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:27 Meta’s stock plunges on ‘aggressive’ AI spending plans
02:49 TSMC unveils 1.6nm process technology with backside power delivery, rivals Intel's competing design
04:48 tiny-gpu
05:59 Fake sponsor
07:35 The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
08:43 A Reproducibility Study of PLAID
10:18 SPLATE: Sparse Late Interaction Retrieval
12:00 Outro
TSMC's new A16 manufacturing process promises to outperform its predecessor, N2P, by a significant margin, with an up to 10% higher clock rate at the same voltage and a 15% - 20% lower power consumption at the same frequency and complexity.
The Instruction Hierarchy proposes a data generation method to demonstrate hierarchical instruction following behavior, which drastically increases robustness for LLMs against attacks.
SPLATE is a lightweight adaptation of the ColBERTv2 model that improves the efficiency of late interaction retrieval, particularly for running ColBERT on CPU environments.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:27 Meta’s stock plunges on ‘aggressive’ AI spending plans
02:49 TSMC unveils 1.6nm process technology with backside power delivery, rivals Intel's competing design
04:48 tiny-gpu
05:59 Fake sponsor
07:35 The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
08:43 A Reproducibility Study of PLAID
10:18 SPLATE: Sparse Late Interaction Retrieval
12:00 Outro
More episodes of the podcast GPT Reviews
OpenAI's 'Strawberry' AI 🚀 // World's Fastest AI Inference ⚡ // Photo-realistic 3D Avatars 🎨
28/08/2024
Grok-2's Speed & Accuracy 🚀 // OpenAI's Transparency Push 🗳️ // LlamaDuo for Local LLMs 🔄
27/08/2024
Amazon Cloud Chief Spicy Takes 🚀 // Zuckerberg's AI Vision 📈 // Multimodal Models for Safety 🔒
23/08/2024
Grok-2 Beta Release 🚀 // Apple's $1,000 Home Robot 🏡 // ChemVLM Breakthrough in Chemistry 🔬
15/08/2024
Gemini Live AI Assistant 📱 // OpenAI’s Coding Benchmark ✅ // LongWriter’s 10K Word Generation ✍️
14/08/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.