Inference, Guardrails, and Observability for LLMs with Jonathan Cohen

08/11/2024 53 min Temporada 1 Episodio 11
Inference, Guardrails, and Observability for LLMs with Jonathan Cohen

Listen "Inference, Guardrails, and Observability for LLMs with Jonathan Cohen"

Episode Synopsis

In this episode of AI Explained, we are joined by Jonathan Cohen, VP of Applied Research at NVIDIA. 
We will explore the intricacies of NVIDIA's NeMo platform and its components like NeMo Guardrails and NIMS. Jonathan explains how these tools help in deploying and managing AI models with a focus on observability, security, and efficiency. They also explore topics such as the evolving role of AI agents, the importance of guardrails in maintaining responsible AI, and real-world examples of successful AI deployments in enterprises like Amdocs. Listeners will gain insights into NVIDIA's AI strategy and the practical aspects of deploying large language models in various industries.