Abstract
Offsite-tuning is a privacy-preserving method for tuning large languagemodels (LLMs) by sharing a lossy compressed emulator from the LLM owners withdata owners for downstream task tuning. This approach protects the privacy ofboth the model and data owners. However, current offsite tuning methods oftensuffer from adaptation degradation, high computational costs, and limitedprotection strength due to uniformly dropping LLM layers or relying onexpensive knowledge distillation. To address these issues, we propose ScaleOT,a novel privacy-utility-scalable offsite-tuning framework that effectivelybalances privacy and utility. ScaleOT introduces a novel layerwise lossycompression algorithm that uses reinforcement learning to obtain the importanceof each layer. It employs lightweight networks, termed harmonizers, to replacethe raw LLM layers. By combining important original LLM layers and harmonizersin different ratios, ScaleOT generates emulators tailored for optimalperformance with various model scales for enhanced privacy protection.Additionally, we present a rank reduction method to further compress theoriginal LLM layers, significantly enhancing privacy with negligible impact onutility. Comprehensive experiments show that ScaleOT can achieve nearlylossless offsite tuning performance compared with full fine-tuning whileobtaining better model privacy.