Listen "Bing Chat on Mobile 📱 // Zoom's Privacy Policy Update 🔒 // AgentBench for LLMs 🚀"
Episode Synopsis
Microsoft's AI-powered Bing Chat now available on all mobile browsers, Zoom's updated privacy policy, the introduction of AgentBench for evaluating LLMs as agents, and the Flows framework for modeling complex interactions between AI systems and humans. These developments have the potential to lead to more robust and reliable AI models that can perform well in complex, real-world scenarios.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:29 Microsoft’s AI-powered Bing Chat is coming to mobile browsers
02:50 Zoom says its new AI tools aren’t stealing ownership of your content
04:30 Kubernetes Exposed: One Yaml away from Disaster
05:39 Fake sponsor
07:30 AgentBench: Evaluating LLMs as Agents
09:03 Studying Large Language Model Generalization with Influence Functions
10:31 Flows: Building Blocks of Reasoning and Collaborating AI
12:21 Outro
Contact: [email protected]
Timestamps:
00:34 Introduction
01:29 Microsoft’s AI-powered Bing Chat is coming to mobile browsers
02:50 Zoom says its new AI tools aren’t stealing ownership of your content
04:30 Kubernetes Exposed: One Yaml away from Disaster
05:39 Fake sponsor
07:30 AgentBench: Evaluating LLMs as Agents
09:03 Studying Large Language Model Generalization with Influence Functions
10:31 Flows: Building Blocks of Reasoning and Collaborating AI
12:21 Outro
More episodes of the podcast GPT Reviews
OpenAI's 'Strawberry' AI 🚀 // World's Fastest AI Inference ⚡ // Photo-realistic 3D Avatars 🎨
28/08/2024
Grok-2's Speed & Accuracy 🚀 // OpenAI's Transparency Push 🗳️ // LlamaDuo for Local LLMs 🔄
27/08/2024
Amazon Cloud Chief Spicy Takes 🚀 // Zuckerberg's AI Vision 📈 // Multimodal Models for Safety 🔒
23/08/2024
Grok-2 Beta Release 🚀 // Apple's $1,000 Home Robot 🏡 // ChemVLM Breakthrough in Chemistry 🔬
15/08/2024
Gemini Live AI Assistant 📱 // OpenAI’s Coding Benchmark ✅ // LongWriter’s 10K Word Generation ✍️
14/08/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.