Hypothesis vs. Hallucinations: Property Testing AI-Generated Code

10/12/2025 1h 18min Temporada 1 Episodio 11

Listen "Hypothesis vs. Hallucinations: Property Testing AI-Generated Code"

Episode Synopsis

Large Language Models can generate code in a flash, but that code is notoriously unreliable. Traditional unit tests often can’t put enough guardrails in place to ensure correctness… even if they’re written by the LLM itself.This is where property-based testing (PBT) becomes essential.Today, we're joined by David R. MacIver, creator of the PBT library Hypothesis, and now an Antithesis employee! We discuss how to build robust feedback loops that are needed to make AI-generated code trustworthy.We'll cover why standard AI coding benchmarks are flawed, how Hypothesis makes PBT approachable, and the challenge of getting developers to think in "invariants." David also shares his perspective on the future of AI in software engineering.If you want to build a reliability backstop for your code, vibed or otherwise, stick around.