Listen "OpenAI Drama Continues 😈 // Shallow Networks as Transformer Alternative 🤔 // SelfEval for Generative Model Evaluation ✅"
Episode Synopsis
The drama at OpenAI with Sam Altman trying to return as CEO and staff threatening to quit unless the board resigns. We also explore the potential of using shallow neural networks as an alternative to attention layers in transformers, and a paper that proposes a method called SelfEval for evaluating generative models. Additionally, we discuss a paper that explores the effectiveness of using shallow feed-forward networks as an alternative to the attention mechanism in the Transformer model.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:39 Sam Altman is still trying to return as OpenAI CEO
02:52 OpenAI Staff Threaten to Quit Unless Board Resigns
04:34 Large Language Models and Lost in the Middle
06:09 Fake sponsor
07:33 LLMs cannot find reasoning errors, but can correct them!
09:00 Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
10:41 SelfEval: Leveraging the discriminative nature of generative models for evaluation
12:28 Outro
Contact: [email protected]
Timestamps:
00:34 Introduction
01:39 Sam Altman is still trying to return as OpenAI CEO
02:52 OpenAI Staff Threaten to Quit Unless Board Resigns
04:34 Large Language Models and Lost in the Middle
06:09 Fake sponsor
07:33 LLMs cannot find reasoning errors, but can correct them!
09:00 Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
10:41 SelfEval: Leveraging the discriminative nature of generative models for evaluation
12:28 Outro
More episodes of the podcast GPT Reviews
OpenAI's 'Strawberry' AI 🚀 // World's Fastest AI Inference ⚡ // Photo-realistic 3D Avatars 🎨
28/08/2024
Grok-2's Speed & Accuracy 🚀 // OpenAI's Transparency Push 🗳️ // LlamaDuo for Local LLMs 🔄
27/08/2024
Amazon Cloud Chief Spicy Takes 🚀 // Zuckerberg's AI Vision 📈 // Multimodal Models for Safety 🔒
23/08/2024
Grok-2 Beta Release 🚀 // Apple's $1,000 Home Robot 🏡 // ChemVLM Breakthrough in Chemistry 🔬
15/08/2024
Gemini Live AI Assistant 📱 // OpenAI’s Coding Benchmark ✅ // LongWriter’s 10K Word Generation ✍️
14/08/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.