Abstract
State-of-the-art cross-encoders can be fine-tuned to be highly effective inpassage re-ranking. The typical fine-tuning process of cross-encoders asre-rankers requires large amounts of manually labelled data, a contrastivelearning objective, and a set of heuristically sampled negatives. Analternative recent approach for fine-tuning instead involves teaching the modelto mimic the rankings of a highly effective large language model using adistillation objective. These fine-tuning strategies can be applied eitherindividually, or in sequence. In this work, we systematically investigate theeffectiveness of point-wise cross-encoders when fine-tuned independently in asingle stage, or sequentially in two stages. Our experiments show that theeffectiveness of point-wise cross-encoders fine-tuned using contrastivelearning is indeed on par with that of models fine-tuned with multi-stageapproaches. Code is available for reproduction athttps://github.com/fpezzuti/multistage-finetuning.