ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.12530
  4. Cited By
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

18 February 2025
Xinyi Yang
Liang Zeng
Heng Dong
C. Yu
X. Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
ArXivPDFHTML

Papers citing "Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards"

2 / 2 papers shown
Title
Model-Agnostic Policy Explanations with Large Language Models
Model-Agnostic Policy Explanations with Large Language Models
Zhang Xi-Jia
Yue (Sophie) Guo
Shufei Chen
Simon Stepputtis
Matthew C. Gombolay
Katia P. Sycara
Joseph Campbell
LM&Ro
LRM
52
0
0
08 Apr 2025
BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization
BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization
Tonghan Wang
Yanchen Jiang
David C. Parkes
81
0
0
24 Feb 2025
1