Listen "How Computers See"
Episode Synopsis
We explore the two defining eras of computer vision: how machines learn to interpret the visual world. We'll dive into Classical Computer Vision, a "human-guided" approach where experts meticulously design algorithms to detect explicit features like edges or corners, exemplified by techniques such as SIFT, SURF, and HOG. Then, we'll turn to the revolutionary Deep Learning paradigm, notably with Convolutional Neural Networks (CNNs), which are "data-driven" and learn to identify salient features directly from massive datasets, representing a profound shift from programming to training. We'll discuss this fundamental philosophical change from a deductive to an inductive approach, highlighting key trade-offs in data requirements, computational cost, and the crucial distinction between the transparent "white box" nature of classical algorithms and the often uninterpretable "black box" of deep learning models. Finally, we'll see how these paradigms translate into our daily lives, from SIFT-powered panorama stitching and HOG-based early pedestrian detection to CNNs driving facial recognition, autonomous vehicles, and medical image analysis, emphasizing that the choice between them is a strategic one, with a future likely dominated by intelligent hybrid models.Please see https://tinyurl.com/SM-S1E3
More episodes of the podcast Seeing Machines: A Podcast on Computer Vision by AI
S2E4: Data Augmentation
02/09/2025
S2E3: Datasets
25/08/2025
S2E2: Annotation tools
19/08/2025
S2E1: Computer Vision Libraries
13/08/2025
S1Bonus: SciFi to Reality
05/08/2025
S1E8: Computer Vision Challenges
02/08/2025
S1E7: Segmentation
26/07/2025
S1E5: Object Detection
18/07/2025
Image Classification
14/07/2025
Building Computer Vision Models
05/07/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.