Twitter Will Train on Your Data 🏋️ // Teaching with AI 🧑‍🏫 // LLMs for Speech and Language 🗣️

05/09/2023 15 min

Listen "Twitter Will Train on Your Data 🏋️ // Teaching with AI 🧑‍🏫 // LLMs for Speech and Language 🗣️"

Episode Synopsis

Twitter's updated privacy policy and how they plan to use public data to train their AI models. We also dive into OpenAI's Guide to Teaching with AI and explore the potential benefits and limitations of using AI in education. Additionally, we highlight some cutting-edge research papers on large language and speech models, a unified speech tokenizer, and a multimodal wine dataset. And, for a bit of fun, we have an entertaining ad for SplashTech's SuperSoak water gun.
Contact:  [email protected]
Timestamps:
00:34 Introduction
01:41 Twitter’s privacy policy confirms it will use public data to train AI models
02:56 OpenAI's Guide to Teaching with AI
04:33 Introducing Refact Code LLM: 1.6B State-of-the-Art LLM for Code that Reaches 32% HumanEval
05:48 Fake sponsor
08:20 LLaSM: Large Language and Speech Model
09:50 SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models
11:52 Learning to Taste: A Multimodal Wine Dataset
13:36 Outro

More episodes of the podcast GPT Reviews