Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

18 February 2025

Papers citing "Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards"

2 / 2 papers shown

Title
Model-Agnostic Policy Explanations with Large Language Models Zhang Xi-Jia Yue (Sophie) Guo Shufei Chen Simon Stepputtis Matthew C. Gombolay Katia P. Sycara Joseph Campbell LM&Ro LRM 52 0 0 08 Apr 2025
BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization Tonghan Wang Yanchen Jiang David C. Parkes 81 0 0 24 Feb 2025