CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation

Abstract

Large language models (LLMs) have revolutionized natural language processing(NLP), particularly through Retrieval-Augmented Generation (RAG), whichenhances LLM capabilities by integrating external knowledge. However,traditional RAG systems face critical limitations, including disruptedcontextual integrity due to text chunking, and over-reliance on semanticsimilarity for retrieval. To address these issues, we propose CausalRAG, anovel framework that incorporates causal graphs into the retrieval process. Byconstructing and tracing causal relationships, CausalRAG preserves contextualcontinuity and improves retrieval precision, leading to more accurate andinterpretable responses. We evaluate CausalRAG against regular RAG andgraph-based RAG approaches, demonstrating its superiority across severalmetrics. Our findings suggest that grounding retrieval in causal reasoningprovides a promising approach to knowledge-intensive tasks.