Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13505
Cited By
Multi-Action Dialog Policy Learning from Logged User Feedback
27 February 2023
Shuo Zhang
Junzhou Zhao
Pinghui Wang
Tianxiang Wang
Zi Liang
Jing Tao
Y. Huang
Junlan Feng
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Action Dialog Policy Learning from Logged User Feedback"
4 / 4 papers shown
Title
FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling
Bowen Zhang
Yidong Wang
Wenxin Hou
Hao Wu
Jindong Wang
Manabu Okumura
T. Shinozaki
AAML
218
861
0
15 Oct 2021
In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning
Mamshad Nayeem Rizve
Kevin Duarte
Y. S. Rawat
M. Shah
206
506
0
15 Jan 2021
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
61
18
0
09 May 2020
Learning Representations for Counterfactual Inference
Fredrik D. Johansson
Uri Shalit
David Sontag
CML
OOD
BDL
210
718
0
12 May 2016
1