Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.04687
Cited By
Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
25 November 2020
Thibault Cordier
Tanguy Urvoy
L. Rojas-Barahona
F. Lefèvre
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation"
2 / 2 papers shown
Title
What Does The User Want? Information Gain for Hierarchical Dialogue Policy Optimisation
Christian Geishauser
Songbo Hu
Hsien-Chin Lin
Nurul Lubis
Michael Heck
Shutong Feng
Carel van Niekerk
Milica Gavsić
OffRL
15
3
0
15 Sep 2021
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
1