Listen "Chinese AI Model Beats GPT-4 🇨🇳 // OpenAI on iOS 18 🍎 // Data-Efficient LLMs 🤖"
Episode Synopsis
SenseTime's new AI model, SenseNova 5.0, beats GPT-4 Turbo across key benchmarks, suggesting China's AI may be closer to competing with the US than previously thought.
Apple is in talks with OpenAI to potentially integrate their features into iOS 18, which could trigger a new era of AI adoption.
"Toward Inference-optimal Mixture-of-Expert Large Language Models" proposes a new scaling law for MoE-based LLMs to efficiently scale without sacrificing performance.
"How to Train Data-Efficient LLMs" investigates data-efficient approaches for pre-training LLMs, which can significantly reduce the amount of data needed to train LLMs.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:30 Chinese AI model bests GPT-4 Turbo
02:35 Apple Intensifies Talks With OpenAI for iPhone Generative AI Features
04:17 OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
05:33 Fake sponsor
07:55 Toward Inference-optimal Mixture-of-Expert Large Language Models
09:21 Scaling Laws For Dense Retrieval
11:01 How to Train Data-Efficient LLMs
12:50 Outro
Apple is in talks with OpenAI to potentially integrate their features into iOS 18, which could trigger a new era of AI adoption.
"Toward Inference-optimal Mixture-of-Expert Large Language Models" proposes a new scaling law for MoE-based LLMs to efficiently scale without sacrificing performance.
"How to Train Data-Efficient LLMs" investigates data-efficient approaches for pre-training LLMs, which can significantly reduce the amount of data needed to train LLMs.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:30 Chinese AI model bests GPT-4 Turbo
02:35 Apple Intensifies Talks With OpenAI for iPhone Generative AI Features
04:17 OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
05:33 Fake sponsor
07:55 Toward Inference-optimal Mixture-of-Expert Large Language Models
09:21 Scaling Laws For Dense Retrieval
11:01 How to Train Data-Efficient LLMs
12:50 Outro
More episodes of the podcast GPT Reviews
OpenAI's 'Strawberry' AI 🚀 // World's Fastest AI Inference ⚡ // Photo-realistic 3D Avatars 🎨
28/08/2024
Grok-2's Speed & Accuracy 🚀 // OpenAI's Transparency Push 🗳️ // LlamaDuo for Local LLMs 🔄
27/08/2024
Amazon Cloud Chief Spicy Takes 🚀 // Zuckerberg's AI Vision 📈 // Multimodal Models for Safety 🔒
23/08/2024
Grok-2 Beta Release 🚀 // Apple's $1,000 Home Robot 🏡 // ChemVLM Breakthrough in Chemistry 🔬
15/08/2024
Gemini Live AI Assistant 📱 // OpenAI’s Coding Benchmark ✅ // LongWriter’s 10K Word Generation ✍️
14/08/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.