I-Con: A Unifying Framework for Representation Learning

24/04/2025 21 min Episodio 710
I-Con: A Unifying Framework for Representation Learning

Listen "I-Con: A Unifying Framework for Representation Learning"

Episode Synopsis



🤗 Upvotes: 24 | cs.LG, cs.AI, cs.CV, cs.IT, math.IT

Authors:
Shaden Alshammari, John Hershey, Axel Feldmann, William T. Freeman, Mark Hamilton

Title:
I-Con: A Unifying Framework for Representation Learning

Arxiv:
http://arxiv.org/abs/2504.16929v1

Abstract:
As the field of representation learning grows, there has been a proliferation of different loss functions to solve different classes of problems. We introduce a single information-theoretic equation that generalizes a large collection of modern loss functions in machine learning. In particular, we introduce a framework that shows that several broad classes of machine learning methods are precisely minimizing an integrated KL divergence between two conditional distributions: the supervisory and learned representations. This viewpoint exposes a hidden information geometry underlying clustering, spectral methods, dimensionality reduction, contrastive learning, and supervised learning. This framework enables the development of new loss functions by combining successful techniques from across the literature. We not only present a wide array of proofs, connecting over 23 different approaches, but we also leverage these theoretical results to create state-of-the-art unsupervised image classifiers that achieve a +8% improvement over the prior state-of-the-art on unsupervised classification on ImageNet-1K. We also demonstrate that I-Con can be used to derive principled debiasing methods which improve contrastive representation learners.

More episodes of the podcast Daily Paper Cast