Latest episodes of the podcast Daily Paper Cast
Mostrando página 71 de 73
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models
11/11/2024
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
08/11/2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
08/11/2024
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
08/11/2024
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
08/11/2024
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
07/11/2024
Self-Consistency Preference Optimization
07/11/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.