Latest episodes of the podcast Daily Paper Cast
Mostrando página 40 de 73
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
15/04/2025
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
15/04/2025
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
14/04/2025
Kimi-VL Technical Report
11/04/2025
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing
11/04/2025
DDT: Decoupled Diffusion Transformer
10/04/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.