Benchmarking Domain Intelligence | Data Brew | Episode 45

24/04/2025 31 min

Listen "Benchmarking Domain Intelligence | Data Brew | Episode 45"

Episode Synopsis

In this episode, Pallavi Koppol, Research Scientist at Databricks, explores the importance of domain-specific intelligence in large language models (LLMs). She discusses how enterprises need models tailored to their unique jargon, data, and tasks rather than relying solely on general benchmarks.Highlights include:- Why benchmarking LLMs for domain-specific tasks is critical for enterprise AI.- An introduction to the Databricks Intelligence Benchmarking Suite (DIBS).- Evaluating models on real-world applications like RAG, text-to-JSON, and function calling.- The evolving landscape of open-source vs. closed-source LLMs.- How industry and academia can collaborate to improve AI benchmarking.

More episodes of the podcast Data Brew by Databricks

Reinforcement Fine-Tuning and the Future of Specialized AI Models 05/08/2025

SWE-bench & SWE-agent | Data Brew | Episode 44 17/04/2025

Enterprise AI: Research to Product | Data Brew | Episode 43 10/04/2025

Multimodal AI | Data Brew | Episode 42 07/04/2025

Age of Agents | Data Brew | Episode 41 27/03/2025

Reward Models | Data Brew | Episode 40 20/03/2025

Retrieval, rerankers, and RAG tips and tricks | Data Brew | Episode 39 20/02/2025

The Power of Synthetic Data | Data Brew | Episode 38 04/02/2025

Secret to Production AI: Tools & Infrastructure | Data Brew | Episode 37 22/01/2025

Mixture of Memory Experts (MoME) | Data Brew | Episode 36 10/01/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Benchmarking Domain Intelligence | Data Brew | Episode 45

Listen "Benchmarking Domain Intelligence | Data Brew | Episode 45"

Episode Synopsis

More episodes of the podcast Data Brew by Databricks

7 Advices to Prevent Identity Theft

WWW. Is it obsolete or not? Should we use it?

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD