Episode Synopsis "Episode 1: JD Long - The Big Join: Testing pipelines to join 30 billion rows of data… quickly"
JD Long is a veteran Quantitative Risk Analyst. He builds stochastic models to predict losses during catastrophic events like hurricanes, earthquakes, or droughts. He shares his data engineering team's painful experience standing up tooling pipelines to load 10s of billions of rows for imbalanced queries into multiple distributed systems. He’s the perfect first guest because he covers multiple tools and techniques and is not shy to share his team's mistakes. He calls it “learning out loud” and I enjoyed every minute of it.