Apache Beam with Kenn Knowles and Pablo Estrada

06/07/2022 38 min

Listen "Apache Beam with Kenn Knowles and Pablo Estrada"

Episode Synopsis

Eric Anderson (@ericmander) reunites with old colleagues Kenn Knowles (@KennKnowles) and Pablo Estrada (@polecitoem) for a conversation on Apache Beam, the open-source programming model for data processing. The trio once worked together at Google, and Beam was a turning point in the history of open-source there. Today, both Kenn and Pablo are members of the Beam PMC, and join the show with the inside scoop on Beam’s past, present and future.
In this episode we discuss:

Transitioning Beam to the Apache Way
How “inner source” works at Google
Thoughts on the relationship between batch processing and streaming
Some ways that community “power users” have contributed to Beam
Information on Beam Summit 2022, the first onsite summit since COVID began

The first few people to register can use code BEAM_POD_INV for a discount on tickets!



Links:

Apache Beam
Apache Spark
Apache Flink
Apache Nemo
Apache Samza
Apache Crunch
MapReduce paper 
MillWheel paper
FlumeJava paper
Dataflow paper
Beam Summit 2022 Website

Other episodes:

TensorFlow with Rajat Monga