2023-11-22 cs: "Revolutionary Tech Unleashed: Monocular Video Perception to Multi-Modal AI - Discover the Future of Visual Understanding and Language Models!"

22/11/2023

Listen "2023-11-22 cs: "Revolutionary Tech Unleashed: Monocular Video Perception to Multi-Modal AI - Discover the Future of Visual Understanding and Language Models!""

Episode Synopsis

ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12796 Title Physics guided Shape from Template Monocular Video Perception through Neural Surrogate Models
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12793 Title ShareGPT4V Improving Large Multi Modal Models with Better Captions
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12792 Title Intrinsic Image Decomposition via Ordinal Shading
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12786 Title Mechanistically analyzing the effects of fine tuning on procedurally defined tasks
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12785 Title Prompting Frameworks for Large Language Models A Survey

More episodes of the podcast Arxiv Podcast GPT Computer Science