Ep 24: The Future of Speech Recognition: How Transformer is Transforming the Game

22/04/2024 25 min

Listen "Ep 24: The Future of Speech Recognition: How Transformer is Transforming the Game"

Episode Synopsis

Summary:
Exploring ASR Technologies: Dive into the world of Automatic Speech Recognition (ASR) and its evolving landscape.
Comparative Analysis: Unpack the differences and advantages between Word2Vec and OpenAI's Whisper models in speech processing.
Technical Insights: Understand the role of waveform encoding and Hidden Markov Models (HMMs) in enhancing speech recognition systems.
Tune in to discover how these cutting-edge technologies are revolutionizing speech recognition for professionals and industry experts in AI. Don’t miss out on the latest advancements—subscribe now!

AI News:

Introducing Meta Llama 3: The most capable openly available LLM to date
https://www.microsoft.com/en-us/research/project/vasa-1/
AI System Can Detect Parkinson's Disease from Brain Waves, Study Finds
[2404.10981] A Survey on Retrieval-Augmented Text Generation for Large Language Models
[2404.11584] The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey

References for main topic:

https://arxiv.org/abs/2006.11477v3
Robust Speech Recognition via Large-Scale Weak Supervision


More episodes of the podcast Machine Learning Made Simple