76

Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction

Junjie Chen
Haitao Li
Qilei Zhang
Zhenghua Li
Ya Zhang
Quan Zhou
Cheng Luo
Yiqun Liu
Dongsheng Guo
Qingyao Ai
Main:9 Pages
4 Figures
Bibliography:2 Pages
4 Tables
Abstract

Legal judgment prediction (LJP) aims to predict judicial outcomes from case facts and typically includes law article, charge, and sentencing prediction. While recent methods perform well on the first two subtasks, legal sentencing prediction (LSP) remains difficult due to its need for fine-grained objective knowledge and flexible subjective reasoning. To address these limitations, we propose MSR2MSR^2, a framework that integrates multi-source retrieval and reasoning in LLMs with reinforcement learning. MSR2MSR^2 enables LLMs to perform multi-source retrieval based on reasoning needs and applies a process-level reward to guide intermediate subjective reasoning steps. Experiments on two real-world datasets show that MSR2MSR^2 improves both accuracy and interpretability in LSP, providing a promising step toward practical legal AI. Our code is available atthis https URL.

View on arXiv
Comments on this paper