The world's largest open library dataset

01/12/2020 43 min Episodio 114

Listen "The world's largest open library dataset"

Descargar episodio Ver en sitio original

Episode Synopsis

Unsplash has released the world’s largest open library dataset, which includes 2M+ high-quality Unsplash photos, 5M keywords, and over 250M searches. They have big ideas about how the dataset might be used by ML/AI folks, and there have already been some interesting applications. In this episode, Luke and Tim discuss why they released this data and what it take to maintain a dataset of this size.Sponsors:Linode – Get $100 in free credit to get started on Linode – our cloud of choice and the home of Changelog.com. Head to linode.com/changelog OR text CHANGELOG to 474747 to get instant access to that $100 in free credit. Changelog++ – You love our content and you want to take it to the next level by showing your support. We’ll take you closer to the metal with no ads, extended episodes, outtakes, bonus content, a deep discount in our merch store (soon), and more to come. Let’s do this! LaunchDarkly – Power experimentation at any scale. Fast and reliable feature management for the modern enterprise. Featuring:Luke Chesser – Website, XTimothy Carbone – GitHub, XChris Benson – Website, GitHub, LinkedIn, XDaniel Whitenack – Website, GitHub, XShow Notes:UnsplashThe world’s largest open library dataset from UnsplashThe Unsplash dataset on GitHubUpcoming Events: Register for upcoming webinars here!

More episodes of the podcast Practical AI

2025 was the year of agents, what's coming in 2026? 09/01/2026

Beyond chatbots: Agents that tackle your SOPs 17/12/2025

The AI engineer skills gap 10/12/2025

Technical advances in document understanding 02/12/2025

Chris on AI, autonomous swarming, home automation and Rust! 26/11/2025

Beyond note-taking with Fireflies 19/11/2025

Autonomous Vehicle Research at Waymo 13/11/2025

Are we in an AI bubble? 10/11/2025

While loops with tool calls 30/10/2025

Tiny Recursive Networks 24/10/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

The world's largest open library dataset

Listen "The world's largest open library dataset"

Episode Synopsis

More episodes of the podcast Practical AI

Educational Technology: From traditional to digital

Gray Hat Hacking, those with ambiguous ethics…

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD