Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

15 October 2025

Yingchen Zhang

Main:8 Pages

5 Figures

Bibliography:1 Pages

9 Tables

Appendix:1 Pages

Abstract

Generative retrieval (GR) is an emerging paradigm that leverages large language models (LLMs) to autoregressively generate document identifiers (docids) relevant to a given query. Prior works have focused on leveraging the generative capabilities of LLMs to improve GR, while overlooking that their reasoning capabilities could likewise help. This raises a key question: Can explicit reasoning benefit GR? To investigate, we first conduct a preliminary study where an LLM is prompted to generate free-form chain-of-thought (CoT) reasoning before performing constrained docid decoding. Although this method outperforms standard GR, the generated reasoning tends to be verbose and poorly aligned with the docid space. These limitations motivate the development of a reasoning mechanism better tailored to GR.

View on arXiv

Comments on this paper