Cognition, Contracts, and Compression

06/08/2025 26 min

Listen "Cognition, Contracts, and Compression"

Descargar episodio Ver en sitio original

Episode Synopsis

Generated Google NotebookLM.Episode Description:In this episode, we explore 10 new papers advancing our understanding of how LLMs think, how agents can be trusted, and how systems can scale more efficiently:What LLMs really "know" – UCCT proposes a formal theory of cognition in LLMs, arguing intelligence is emergent and context-triggered—not intrinsic.Rethinking RAG – CoCoA and CoCoA-zero show how multi-agent collaboration improves synergy between internal model memory and retrieved context.Efficiency, by design – Efficient Agents sheds light on cost/performance trade-offs in agent systems, while Blueprint First separates logic from generation to enable deterministic workflows.Contrastive learning, upgraded – Context-Adaptive Multi-Prompt Embedding improves vision-language alignment with adaptive token prompts and diversity constraints.Inference-time teaming – CTTS scales up LLM performance via collective test-time scaling, using reward model ensembles and agent collaboration.At the edge – A new adaptive agent placement and migration framework uses LLMs and ant colony optimization to meet real-time edge constraints.Smarter chains of thought – A step entropy metric allows LLMs to prune redundant reasoning during inference, improving cost-efficiency without sacrificing accuracy.Quantization, vision-style – VLMQ brings post-training quantization to Vision-Language Models, optimizing for both modality balance and efficiency.Reliable by contract – A Design-by-Contract–inspired layer enables neurosymbolic agents to enforce input-output constraints, offering a formal basis for agent safety.From the nature of LLM cognition to practical methods for verifiable, scalable deployment, this episode highlights where theory meets engineering—and where structure enhances trust.Sources:The Unified Cognitive Consciousness Theory for Language Models (UCCT) | HTMLCoCoA: Collaborative Chain-of-Agents for Parametric-Retrieved Knowledge Synergy | HTMLEfficient Agents: Building Effective Agents While Reducing Cost | HTMLBlueprint First, Model Second: A Framework for Deterministic LLM Workflow | HTMLContext-Adaptive Multi-Prompt LLM Embedding for Vision-Language Alignment | HTMLCTTS: Collective Test-Time Scaling | HTMLAdaptive AI Agent Placement and Migration in Edge Intelligence Systems | HTMLCompressing Chain-of-Thought in LLMs via Step Entropy | HTMLVLMQ: Efficient Post-Training Quantization for Vision-Language Models | HTMLA DbC Inspired Neurosymbolic Layer for Trustworthy Agent Design | HTML

More episodes of the podcast Today in arXiv AI

Architectures, Attacks, and Autonomy 06/08/2025

Jailbreaks, Collaboration, and Cognitive Shifts 31/07/2025

Planning Agents, Emotional Bias, and Trustworthy Responses 30/07/2025

Factuality, Alignment, and Edge Efficiency 30/07/2025

Safety, Evaluation, and Reasoning 25/07/2025

The AI Frontier: Confronting Hallucinations, Deepening Reasoning, and Building Trust 24/07/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Cognition, Contracts, and Compression

Listen "Cognition, Contracts, and Compression"

Episode Synopsis

More episodes of the podcast Today in arXiv AI

Orthographic errors in Web pages

Dot COM: The Internet’s dominant TLD

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD