Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search

Abstract

Generative retrieval (GR) has revolutionized document retrieval with theadvent of large language models (LLMs), and LLM-based GR is gradually beingadopted by the industry. Despite its remarkable advantages and potential,LLM-based GR suffers from hallucination and generates documents that areirrelevant to the query in some instances, severely challenging its credibilityin practical applications. We thereby propose an optimized GR frameworkdesigned to alleviate retrieval hallucination, which integrates knowledgedistillation reasoning in model training and incorporate decision agent tofurther improve retrieval precision. Specifically, we employ LLMs to assess andreason GR retrieved query-document (q-d) pairs, and then distill the reasoningdata as transferred knowledge to the GR model. Moreover, we utilize a decisionagent as post-processing to extend the GR retrieved documents through retrievalmodel and select the most relevant ones from multi perspectives as the finalgenerative retrieval result. Extensive offline experiments on real-worlddatasets and online A/B tests on Fund Search and Insurance Search in Alipaydemonstrate our framework's superiority and effectiveness in improving searchquality and conversion gains.