Hypothesis vs. Hallucinations: Property Testing AI-Generated Code

10/12/2025 1h 18min Temporada 1 Episodio 11

Listen "Hypothesis vs. Hallucinations: Property Testing AI-Generated Code"

Descargar episodio Ver en sitio original

Episode Synopsis

Large Language Models can generate code in a flash, but that code is notoriously unreliable. Traditional unit tests often can’t put enough guardrails in place to ensure correctness… even if they’re written by the LLM itself.This is where property-based testing (PBT) becomes essential.Today, we're joined by David R. MacIver, creator of the PBT library Hypothesis, and now an Antithesis employee! We discuss how to build robust feedback loops that are needed to make AI-generated code trustworthy.We'll cover why standard AI coding benchmarks are flawed, how Hypothesis makes PBT approachable, and the challenge of getting developers to think in "invariants." David also shares his perspective on the future of AI in software engineering.If you want to build a reliability backstop for your code, vibed or otherwise, stick around.

More episodes of the podcast The BugBash Podcast

From the Lab to Production: Making Cutting-Edge Testing Practical 26/11/2025

Ergonomics, reliability, durability 12/11/2025

No actually, you can property test your UI 30/10/2025

Slow down to go fast: TDD in the age of AI with Clare Sudbery 15/10/2025

Fixing five "two-year" bugs per day 01/10/2025

No really, some bugs aren’t real 18/09/2025

Every map is wrong, but we made one anyway 03/09/2025

Fail loudly, fail fast, fail in production 20/08/2025

Scaling Correctness: Marc Brooker on a Decade of Formal Methods at AWS 06/08/2025

FoundationDB: From Idea to Apple Acquisition 23/07/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Hypothesis vs. Hallucinations: Property Testing AI-Generated Code

Listen "Hypothesis vs. Hallucinations: Property Testing AI-Generated Code"

Episode Synopsis

More episodes of the podcast The BugBash Podcast

7 Advices to Prevent Identity Theft

Increase the rate of email delivery

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD