ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13505
  4. Cited By
Multi-Action Dialog Policy Learning from Logged User Feedback

Multi-Action Dialog Policy Learning from Logged User Feedback

27 February 2023
Shuo Zhang
Junzhou Zhao
Pinghui Wang
Tianxiang Wang
Zi Liang
Jing Tao
Y. Huang
Junlan Feng
    OffRL
ArXivPDFHTML

Papers citing "Multi-Action Dialog Policy Learning from Logged User Feedback"

4 / 4 papers shown
Title
FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo
  Labeling
FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling
Bowen Zhang
Yidong Wang
Wenxin Hou
Hao Wu
Jindong Wang
Manabu Okumura
T. Shinozaki
AAML
218
861
0
15 Oct 2021
In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label
  Selection Framework for Semi-Supervised Learning
In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning
Mamshad Nayeem Rizve
Kevin Duarte
Y. S. Rawat
M. Shah
206
506
0
15 Jan 2021
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
61
18
0
09 May 2020
Learning Representations for Counterfactual Inference
Learning Representations for Counterfactual Inference
Fredrik D. Johansson
Uri Shalit
David Sontag
CML
OOD
BDL
210
718
0
12 May 2016
1