Listen "S2E5: Crosswords"
Episode Synopsis
This episode’s guests are George Ho and Saul Pwanson, whose crossword datasets were featured in the Data Is Plural newsletter in 2021 and 2016, respectively. Saul and George explain the difference between American-style and cryptic crosswords, how they collected their datasets, and what they learned along the way.Relevant and mentioned links:Saul’s xd archive, grid comparison, and .xd file formatFiveThirtyEight’s coverage of the plagiarism scandal Saul’s analysis unearthed and Saul’s csv,conf talk, “How a File Format Led to a Crossword Scandal”George’s dataset of cryptic crossword cluesGeorge’s datasheet for the datasetTimnit Gebru et al.’s “Datasheets for Datasets”XWord Info, from which Saul gathered New York Times crossword dataDavid Steinberg’s Pre-Shortzian Puzzle Project, with “litzing” contributions from Barry Haldiman and othersTheme music by Nikhil Sonnad.
More episodes of the podcast Data Is Plural
S2E4: Canadian Wildfires
13/12/2023
S2E3: Missing Migrants
06/12/2023
S2E2: Income Patterns
29/11/2023
S2E1: Jeans Pockets
22/11/2023
S1E1: Giant Pumpkins
29/03/2023
S1E2: The London Stage
29/03/2023
S1E3: Roadkill in the Andes
29/03/2023
S1E4: Pathogen Genetics
29/03/2023
S1E5: Atari Emails
29/03/2023
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.