ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.00737
  4. Cited By
Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient
v1v2v3v4v5 (latest)

Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient

Spoken Language Technology Workshop (SLT), 2018
2 July 2018
Rui Zhao
Volker Tresp
    LLMAG
ArXiv (abs)PDFHTML

Papers citing "Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient"

11 / 11 papers shown
Title
Spot the Difference: A Cooperative Object-Referring Game in
  Non-Perfectly Co-Observable Scene
Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Duo Zheng
Fandong Meng
Q. Si
Hairun Fan
Zipeng Xu
Jie Zhou
Fangxiang Feng
Xiaojie Wang
154
0
0
16 Mar 2022
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning
  and Augmented Guesser
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented GuesserConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Duo Zheng
Zipeng Xu
Fandong Meng
Caixia Yuan
Jiaan Wang
Jie Zhou
94
13
0
06 Sep 2021
Modeling Explicit Concerning States for Reinforcement Learning in Visual
  Dialogue
Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Zipeng Xu
Fandong Meng
Caixia Yuan
Duo Zheng
Chenxu Lv
Jie Zhou
OffRL
149
6
0
12 Jul 2021
Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
Answer-Driven Visual State Estimator for Goal-Oriented Visual DialogueACM Multimedia (ACM MM), 2020
Zipeng Xu
Fangxiang Feng
Xiaojie Wang
Yushu Yang
Huixing Jiang
Zhongyuan Ouyang
141
7
0
01 Oct 2020
Learning Individualized Treatment Rules with Estimated Translated
  Inverse Propensity Score
Learning Individualized Treatment Rules with Estimated Translated Inverse Propensity Score
Zhiliang Wu
Yinchong Yang
Yunpu Ma
Yushan Liu
Rui Zhao
Michael Moor
Volker Tresp
97
1
0
02 Jul 2020
Mutual Information-based State-Control for Intrinsically Motivated
  Reinforcement Learning
Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning
Rui Zhao
Yang Gao
Pieter Abbeel
Volker Tresp
Wenyuan Xu
SSL
146
4
0
05 Feb 2020
What Should I Ask? Using Conversationally Informative Rewards for
  Goal-Oriented Visual Dialog
What Should I Ask? Using Conversationally Informative Rewards for Goal-Oriented Visual DialogAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Pushkar Shukla
Carlos E. L. Elmadjian
Richika Sharan
Vivek Kulkarni
Matthew Turk
William Yang Wang
164
34
0
28 Jul 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Maximum Entropy-Regularized Multi-Goal Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019
Rui Zhao
Xudong Sun
Volker Tresp
215
90
0
21 May 2019
Curiosity-Driven Experience Prioritization via Density Estimation
Curiosity-Driven Experience Prioritization via Density Estimation
Rui Zhao
Volker Tresp
344
59
0
20 Feb 2019
Efficient Dialog Policy Learning via Positive Memory Retention
Efficient Dialog Policy Learning via Positive Memory Retention
Rui Zhao
Volker Tresp
168
10
0
02 Oct 2018
Energy-Based Hindsight Experience Prioritization
Energy-Based Hindsight Experience Prioritization
Rui Zhao
Volker Tresp
376
76
0
02 Oct 2018
1