Chinese AI Model Beats GPT-4 🇨🇳 // OpenAI on iOS 18 🍎 // Data-Efficient LLMs 🤖

29/04/2024 14 min

Listen "Chinese AI Model Beats GPT-4 🇨🇳 // OpenAI on iOS 18 🍎 // Data-Efficient LLMs 🤖"

Episode Synopsis

SenseTime's new AI model, SenseNova 5.0, beats GPT-4 Turbo across key benchmarks, suggesting China's AI may be closer to competing with the US than previously thought.
Apple is in talks with OpenAI to potentially integrate their features into iOS 18, which could trigger a new era of AI adoption.
"Toward Inference-optimal Mixture-of-Expert Large Language Models" proposes a new scaling law for MoE-based LLMs to efficiently scale without sacrificing performance.
"How to Train Data-Efficient LLMs" investigates data-efficient approaches for pre-training LLMs, which can significantly reduce the amount of data needed to train LLMs.
Contact:  [email protected]
Timestamps:
00:34 Introduction
01:30 Chinese AI model bests GPT-4 Turbo
02:35 Apple Intensifies Talks With OpenAI for iPhone Generative AI Features
04:17 OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
05:33 Fake sponsor
07:55 Toward Inference-optimal Mixture-of-Expert Large Language Models
09:21 Scaling Laws For Dense Retrieval
11:01 How to Train Data-Efficient LLMs
12:50 Outro

More episodes of the podcast GPT Reviews