Listen "CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation"
Episode Synopsis
CORAL, a novel benchmark dataset for evaluating Retrieval-Augmented Generation (RAG) systems in a multi-turn conversational setting. The authors highlight the limitations of existing datasets in assessing conversational RAG and detail CORAL's unique features, including open-domain coverage, knowledge intensity, free-form responses, topic shifts, and citation labeling. They explain how CORAL is derived from Wikipedia, automatically converting its content into conversational formats, and outline the three core tasks it supports: conversational passage retrieval, response generation, and citation labeling. The authors present a unified framework for evaluating conversational RAG methods and report on experiments conducted on CORAL, showcasing the performance of different conversational search and generation models.
More episodes of the podcast Artificial Discourse
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
19/11/2024
A Survey of Small Language Models
12/11/2024
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
11/11/2024
The Llama 3 Herd of Models
10/11/2024
Kolmogorov-Arnold Network (KAN)
09/11/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.