Abstract
Large Language Models (LLMs) excel at generating creative narratives butstruggle with long-term coherence and emotional consistency in complex stories.To address this, we propose SCORE (Story Coherence and Retrieval Enhancement),a framework integrating three components: 1) Dynamic State Tracking (monitoringobjects/characters via symbolic logic), 2) Context-Aware Summarization(hierarchical episode summaries for temporal progression), and 3) HybridRetrieval (combining TF-IDF keyword relevance with cosine similarity-basedsemantic embeddings). The system employs a temporally-alignedRetrieval-Augmented Generation (RAG) pipeline to validate contextualconsistency. Evaluations show SCORE achieves 23.6% higher coherence (NCI-2.0benchmark), 89.7% emotional consistency (EASM metric), and 41.8% fewerhallucinations versus baseline GPT models. Its modular design supportsincremental knowledge graph construction for persistent story memory andmulti-LLM backend compatibility, offering an explainable solution forindustrial-scale narrative systems requiring long-term consistency.