Listen "TIME-100 AI List 🕰️ // Open-vocabulary Vision Models 👀 // Retrieval-Augmented Generation 📚"
Episode Synopsis
The episode covers cutting-edge AI research on vision and language models, including a new pretraining methodology for open-vocabulary object detection and a physically grounded VLM for robotic manipulation tasks. The show also features two interesting papers on DSPy, a framework for working with language models and retrieval models, and Verba, an open-source initiative for retrieval-augmented generation applications. The crew discusses the TIME100 Most Influential People in AI, highlighting the significance of generative AI and the ethical questions surrounding its development.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:38 How We Chose the TIME100 Most Influential People in AI
03:16 DSPy: Programming—not prompting—Foundation Models
04:28 Verba Retrieval Augmented Generation from Weaviate
05:26 Fake sponsor
07:28 Contrastive Feature Masking Open-Vocabulary Vision Transformer
09:24 Physically Grounded Vision-Language Models for Robotic Manipulation
11:27 Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
13:40 Outro
Contact: [email protected]
Timestamps:
00:34 Introduction
01:38 How We Chose the TIME100 Most Influential People in AI
03:16 DSPy: Programming—not prompting—Foundation Models
04:28 Verba Retrieval Augmented Generation from Weaviate
05:26 Fake sponsor
07:28 Contrastive Feature Masking Open-Vocabulary Vision Transformer
09:24 Physically Grounded Vision-Language Models for Robotic Manipulation
11:27 Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
13:40 Outro
More episodes of the podcast GPT Reviews
OpenAI's 'Strawberry' AI 🚀 // World's Fastest AI Inference ⚡ // Photo-realistic 3D Avatars 🎨
28/08/2024
Grok-2's Speed & Accuracy 🚀 // OpenAI's Transparency Push 🗳️ // LlamaDuo for Local LLMs 🔄
27/08/2024
Amazon Cloud Chief Spicy Takes 🚀 // Zuckerberg's AI Vision 📈 // Multimodal Models for Safety 🔒
23/08/2024
Grok-2 Beta Release 🚀 // Apple's $1,000 Home Robot 🏡 // ChemVLM Breakthrough in Chemistry 🔬
15/08/2024
Gemini Live AI Assistant 📱 // OpenAI’s Coding Benchmark ✅ // LongWriter’s 10K Word Generation ✍️
14/08/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.