Latest episodes of the podcast Daily Paper Cast
Mostrando página 27 de 73
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
04/07/2025
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
04/07/2025
Kwai Keye-VL Technical Report
03/07/2025
Depth Anything at Any Condition
03/07/2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
02/07/2025
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
02/07/2025
SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
02/07/2025
MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
02/07/2025
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation
02/07/2025
Ovis-U1 Technical Report
01/07/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.