Listen "S2E3: Datasets"
Episode Synopsis
This episode delves into the unsung heroes of the artificial intelligence revolution: the foundational datasets that taught computers to "see". We explore the evolutionary journey of computer vision through four landmark datasets: PASCAL VOC, which standardized object detection and established common benchmarks; ImageNet, whose unprecedented scale ignited the deep learning revolution and popularized transfer learning; COCO (Common Objects in Context), which advanced the field towards complex scene understanding with rich annotations like instance segmentation and keypoint detection; and Cityscapes, a critical benchmark for achieving pixel-perfect semantic understanding in dense urban environments for autonomous driving. Discover how these meticulously curated collections of images are not just passive data, but active instruments of scientific progress, defining challenges, measuring advancement, and ultimately catalyzing the innovations that power everything from self-driving cars to augmented reality and medical diagnostics in our daily lives.
More episodes of the podcast Seeing Machines: A Podcast on Computer Vision by AI
S2E4: Data Augmentation
02/09/2025
S2E2: Annotation tools
19/08/2025
S2E1: Computer Vision Libraries
13/08/2025
S1Bonus: SciFi to Reality
05/08/2025
S1E8: Computer Vision Challenges
02/08/2025
S1E7: Segmentation
26/07/2025
S1E5: Object Detection
18/07/2025
Image Classification
14/07/2025
Building Computer Vision Models
05/07/2025
How Computers See
28/06/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.