NAACL2025 Tutorial: Adaptation of Large Language Models

Abstract

This tutorial on adaptation of LLMs is designed to address the growing demandfor models that go beyond the static capabilities of generic LLMs by providingan overview of dynamic, domain-specific, and task-adaptive LLM adaptationtechniques. While general LLMs have demonstrated strong generalization across avariety of tasks, they often struggle to perform well in specialized domainssuch as finance, healthcare, and code generation for underrepresentedlanguages. Additionally, their static nature limits their ability to evolvewith the changing world, and they are often extremely large in size, makingthem impractical and costly to deploy at scale. As a result, the adaptation ofLLMs has drawn much attention since the birth of LLMs and is of coreimportance, both for industry, which focuses on serving its targeted users, andacademia, which can greatly benefit from small but powerful LLMs. To addressthis gap, this tutorial aims to provide an overview of the LLM adaptationtechniques. We start with an introduction to LLM adaptation, from both the dataperspective and the model perspective. We then emphasize how the evaluationmetrics and benchmarks are different from other techniques. After establishingthe problems, we explore various adaptation techniques. We categorizeadaptation techniques into two main families. The first is parametric knowledgeadaptation, which focuses on updating the parametric knowledge within LLMs.Additionally, we will discuss real-time adaptation techniques, including modelediting, which allows LLMs to be updated dynamically in productionenvironments. The second kind of adaptation is semi-parametric knowledgeadaptation, where the goal is to update LLM parameters to better leverageexternal knowledge or tools through techniques like retrieval-augmentedgeneration (RAG) and agent-based systems.