Latest episodes of the podcast Mechanical Dreams
Mostrando página 3 de 5
Multi-Token Attention
03/04/2025
From Style to Facts
02/04/2025
Compute Optimal Scaling of Skills
21/03/2025
Predictive Data Selection
15/03/2025
Continual Pre-training of MoEs
12/03/2025
s1 - Simple test-time scaling
06/03/2025
Phi 4 Multimodal Instruct
04/03/2025
Claude 3.7 Sonnet System Card
24/02/2025
Over-Tokenized Transformer
29/01/2025
From Tokens to Words
14/01/2025
DeepSeek V3
07/01/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.