GINGER: Grounded Information Nugget-Based Generation of Responses

  • 2025-03-23 19:10:23
  • Weronika Ɓajewska, Krisztian Balog
  • 0

Abstract

Retrieval-augmented generation (RAG) faces challenges related to factualcorrectness, source attribution, and response completeness. To address them, wepropose a modular pipeline for grounded response generation that operates oninformation nuggets-minimal, atomic units of relevant information extractedfrom retrieved documents. The multistage pipeline encompasses nugget detection,clustering, ranking, top cluster summarization, and fluency enhancement. Itguarantees grounding in specific facts, facilitates source attribution, andensures maximum information inclusion within length constraints. Extensiveexperiments on the TREC RAG'24 dataset evaluated with the AutoNuggetizerframework demonstrate that GINGER achieves state-of-the-art performance on thisbenchmark.