Listen "Unlocking Unstructured Data with LLMs"
Episode Synopsis
Shreya Shankar is a PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a framework using a MapReduce approach with LLMs for semantic extraction, thematic analysis, and summarization at scale.Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
More episodes of the podcast The Data Exchange with Ben Lorica
Teaching AI How to Forget
15/01/2026
The Junior Data Engineer is Now an AI Agent
08/01/2026
The Truth About Agents in Production
31/12/2025
The best books we read this year 📚
24/12/2025
The Developer’s Guide to LLM Security
18/12/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.