AI Testing and Evaluation: Reflections

21/07/2025 29 min

Listen "AI Testing and Evaluation: Reflections"

Descargar episodio Ver en sitio original

Episode Synopsis

In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/

More episodes of the podcast Microsoft Research Podcast

Ideas: Community building, machine learning, and the future of AI 01/12/2025

Ideas: More AI-resilient biosecurity with the Paraphrase Project 06/10/2025

Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education 21/08/2025

Reimagining healthcare delivery and public health with AI 07/08/2025

Navigating medical education in the era of generative AI 24/07/2025

AI Testing and Evaluation: Learnings from cybersecurity 14/07/2025

How AI will accelerate biomedical research and discovery 10/07/2025

AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices 07/07/2025

AI Testing and Evaluation: Learnings from genome editing 30/06/2025

AI Testing and Evaluation: Learnings from Science and Industry 23/06/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

AI Testing and Evaluation: Reflections

Listen "AI Testing and Evaluation: Reflections"

Episode Synopsis

More episodes of the podcast Microsoft Research Podcast

Choose a domain name, or change it!

Deep web or Invisible Internet

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Internet Predators on the prowl

Gray Hat Hacking, those with ambiguous ethics…

Dot COM: The Internet’s dominant TLD