Listen "108: PySpark - Jonathan Rioux"
Episode Synopsis
Apache Spark is a unified analytics engine for large-scale data processing. PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task.Johnathan Rioux, author of "PySpark in Action", joins the show and gives us a great introduction of Spark and PySpark to help us decide how to get started and decide whether or not to decide if Spark and PySpark are right you.Special Guest: Jonathan Rioux.Links:PySpark in ActionSparkPySpark documentationJoel Grus, livecoding
More episodes of the podcast Test & Code
238: So Long, and Thanks for All the Fish
15/08/2025
237: FastAPI Cloud - Sebastián Ramírez
11/08/2025
236: Git Tips for Testing - Adam Johnson
30/07/2025
235: pytest-django - Adam Johnson
22/07/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.