ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.05415
  4. Cited By
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!

Learning from Dialogue after Deployment: Feed Yourself, Chatbot!

16 January 2019
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
ArXivPDFHTML

Papers citing "Learning from Dialogue after Deployment: Feed Yourself, Chatbot!"

50 / 109 papers shown
Title
A Mixture-of-Expert Approach to RL-based Dialogue Management
A Mixture-of-Expert Approach to RL-based Dialogue Management
Yinlam Chow
Azamat Tulepbergenov
Ofir Nachum
Moonkyung Ryu
Mohammad Ghavamzadeh
Craig Boutilier
MoE
25
14
0
31 May 2022
Training Language Models with Language Feedback
Training Language Models with Language Feedback
Jérémy Scheurer
Jon Ander Campos
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
53
48
0
29 Apr 2022
Using Interactive Feedback to Improve the Accuracy and Explainability of
  Question Answering Systems Post-Deployment
Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment
Zichao Li
Prakhar Sharma
Xing Han Lù
Jackie C.K. Cheung
Siva Reddy
HAI
25
26
0
06 Apr 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
411
12,150
0
04 Mar 2022
Survey of Hallucination in Natural Language Generation
Survey of Hallucination in Natural Language Generation
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
...
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILM
LRM
82
2,254
0
08 Feb 2022
Toward Self-learning End-to-End Task-Oriented Dialog Systems
Toward Self-learning End-to-End Task-Oriented Dialog Systems
Xiaoying Zhang
Baolin Peng
Jianfeng Gao
Helen M. Meng
27
7
0
18 Jan 2022
Findings from Experiments of On-line Joint Reinforcement Learning of
  Semantic Parser and Dialogue Manager with real Users
Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users
Matthieu Riou
Bassam Jabaian
Stéphane Huet
F. Lefèvre
OffRL
24
0
0
25 Oct 2021
Improved Goal Oriented Dialogue via Utterance Generation and Look Ahead
Improved Goal Oriented Dialogue via Utterance Generation and Look Ahead
Hong Huang
Boaz Carmeli
Ateret Anaby-Tavor
35
2
0
24 Oct 2021
SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety
  Failures
SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures
Megan Ung
Jing Xu
Y-Lan Boureau
19
47
0
14 Oct 2021
Topic-time Heatmaps for Human-in-the-loop Topic Detection and Tracking
Topic-time Heatmaps for Human-in-the-loop Topic Detection and Tracking
Doug Beeferman
Hang Jiang
15
1
0
12 Oct 2021
Deciding Whether to Ask Clarifying Questions in Large-Scale Spoken
  Language Understanding
Deciding Whether to Ask Clarifying Questions in Large-Scale Spoken Language Understanding
Joo-Kyung Kim
Guoyin Wang
Sungjin Lee
Young-Bum Kim
14
9
0
25 Sep 2021
Recursively Summarizing Books with Human Feedback
Recursively Summarizing Books with Human Feedback
Jeff Wu
Long Ouyang
Daniel M. Ziegler
Nissan Stiennon
Ryan J. Lowe
Jan Leike
Paul Christiano
ALM
40
296
0
22 Sep 2021
Building and Evaluating Open-Domain Dialogue Corpora with Clarifying
  Questions
Building and Evaluating Open-Domain Dialogue Corpora with Clarifying Questions
Mohammad Aliannejadi
Julia Kiseleva
A. Chuklin
Jeffrey Stephen Dalton
Andrey Kravchenko
79
97
0
13 Sep 2021
A Survey of Human-in-the-loop for Machine Learning
A Survey of Human-in-the-loop for Machine Learning
Xingjiao Wu
Luwei Xiao
Yixuan Sun
Junhang Zhang
Tianlong Ma
Liangbo He
SyDa
46
507
0
02 Aug 2021
Anticipating Safety Issues in E2E Conversational AI: Framework and
  Tooling
Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling
Emily Dinan
Gavin Abercrombie
A. S. Bergman
Shannon L. Spruit
Dirk Hovy
Y-Lan Boureau
Verena Rieser
43
105
0
07 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training,
  Deployment & Operations
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations
AI Redefined
S. Gottipati
Sagar Kurandwad
Clodéric Mars
Gregory Szriftgiser
Franccois Chabot
29
8
0
21 Jun 2021
Software-Based Dialogue Systems: Survey, Taxonomy and Challenges
Software-Based Dialogue Systems: Survey, Taxonomy and Challenges
Quim Motger
Xavier Franch
Jordi Marco
29
40
0
21 Jun 2021
Grounding 'Grounding' in NLP
Grounding 'Grounding' in NLP
Khyathi Raghavi Chandu
Yonatan Bisk
A. Black
30
51
0
04 Jun 2021
HERALD: An Annotation Efficient Method to Detect User Disengagement in
  Social Conversations
HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations
Weixin Liang
Kai-Hui Liang
Zhou Yu
45
15
0
01 Jun 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic
  Survey
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Min Zhang
63
270
0
10 May 2021
Dynabench: Rethinking Benchmarking in NLP
Dynabench: Rethinking Benchmarking in NLP
Douwe Kiela
Max Bartolo
Yixin Nie
Divyansh Kaushik
Atticus Geiger
...
Pontus Stenetorp
Robin Jia
Joey Tianyi Zhou
Christopher Potts
Adina Williams
24
392
0
07 Apr 2021
Putting Humans in the Natural Language Processing Loop: A Survey
Putting Humans in the Natural Language Processing Loop: A Survey
Zijie J. Wang
Dongjin Choi
Shenyu Xu
Diyi Yang
LM&MA
20
72
0
06 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
206
27,929
0
26 Feb 2021
Evaluate On-the-job Learning Dialogue Systems and a Case Study for
  Natural Language Understanding
Evaluate On-the-job Learning Dialogue Systems and a Case Study for Natural Language Understanding
Mathilde Veron
S. Rosset
Olivier Galibert
Guillaume Bernard
21
3
0
26 Feb 2021
Studying Catastrophic Forgetting in Neural Ranking Models
Studying Catastrophic Forgetting in Neural Ranking Models
Jesús Lovón-Melgarejo
Laure Soulier
K. Pinel-Sauvagnat
L. Tamine
CLL
47
13
0
18 Jan 2021
Offline Reinforcement Learning from Human Feedback in Real-World
  Sequence-to-Sequence Tasks
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
Julia Kreutzer
Stefan Riezler
Carolin (Haas) Lawrence
RALM
OffRL
13
15
0
04 Nov 2020
Improving Conversational Question Answering Systems after Deployment
  using Feedback-Weighted Learning
Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning
Jon Ander Campos
Kyunghyun Cho
Arantxa Otegi
Aitor Soroa Etxabe
Gorka Azkune
Eneko Agirre
20
6
0
01 Nov 2020
Learning Improvised Chatbots from Adversarial Modifications of Natural
  Language Feedback
Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Makesh Narsimhan Sreedhar
Kun Ni
Siva Reddy
AAML
24
2
0
14 Oct 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
40
93
0
12 Oct 2020
Lifelong Learning Dialogue Systems: Chatbots that Self-Learn On the Job
Lifelong Learning Dialogue Systems: Chatbots that Self-Learn On the Job
Bing-Quan Liu
Sahisnu Mazumder
14
4
0
22 Sep 2020
Learning to summarize from human feedback
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
80
2,002
0
02 Sep 2020
Deploying Lifelong Open-Domain Dialogue Learning
Deploying Lifelong Open-Domain Dialogue Learning
Kurt Shuster
Jack Urbanek
Emily Dinan
Arthur Szlam
Jason Weston
24
22
0
18 Aug 2020
Simulating the Effects of Social Presence on Trust, Privacy Concerns &
  Usage Intentions in Automated Bots for Finance
Simulating the Effects of Social Presence on Trust, Privacy Concerns & Usage Intentions in Automated Bots for Finance
Magdalene Ng
Kovila P. L. Coopamootoo
Ehsan Toreini
Mhairi Aitken
Karen Elliott
Aad van Moorsel
6
40
0
27 Jun 2020
Open-Domain Conversational Agents: Current Progress, Open Problems, and
  Future Directions
Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions
Stephen Roller
Y-Lan Boureau
Jason Weston
Antoine Bordes
Emily Dinan
...
Kurt Shuster
Eric Michael Smith
Arthur Szlam
Jack Urbanek
Mary Williamson
LLMAG
AI4CE
30
51
0
22 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented
  Agents: Research Directions and Challenges
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges
M. Eskénazi
Tiancheng Zhao
LLMAG
AI4TS
AI4CE
36
9
0
10 Jun 2020
Offline and Online Satisfaction Prediction in Open-Domain Conversational
  Systems
Offline and Online Satisfaction Prediction in Open-Domain Conversational Systems
J. Choi
Ali Ahmadvand
Eugene Agichtein
OffRL
19
28
0
02 Jun 2020
Quantifying the Effects of Prosody Modulation on User Engagement and
  Satisfaction in Conversational Systems
Quantifying the Effects of Prosody Modulation on User Engagement and Satisfaction in Conversational Systems
J. Choi
Eugene Agichtein
21
3
0
02 Jun 2020
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in
  Dialogue Systems
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in Dialogue Systems
Zehao Lin
Shaobo Cui
Guodun Li
Xiaoming Kang
Feng Ji
Feng-Lin Li
Zhongzhou Zhao
Haiqing Chen
Yin Zhang
34
1
0
27 May 2020
Speak to your Parser: Interactive Text-to-SQL with Natural Language
  Feedback
Speak to your Parser: Interactive Text-to-SQL with Natural Language Feedback
Ahmed Elgohary
Saghar Hosseini
Ahmed Hassan Awadallah
23
67
0
05 May 2020
An Imitation Game for Learning Semantic Parsers from User Interaction
An Imitation Game for Learning Semantic Parsers from User Interaction
Ziyu Yao
Yiqi Tang
Wen-tau Yih
Huan Sun
Yu-Chuan Su
30
34
0
02 May 2020
Recipes for building an open-domain chatbot
Recipes for building an open-domain chatbot
Stephen Roller
Emily Dinan
Naman Goyal
Da Ju
Mary Williamson
...
Myle Ott
Kurt Shuster
Eric Michael Smith
Y-Lan Boureau
Jason Weston
ALM
29
997
0
28 Apr 2020
The Gutenberg Dialogue Dataset
The Gutenberg Dialogue Dataset
Richard Csaky
Gábor Recski
22
14
0
27 Apr 2020
XPersona: Evaluating Multilingual Personalized Chatbot
XPersona: Evaluating Multilingual Personalized Chatbot
Zhaojiang Lin
Zihan Liu
Genta Indra Winata
Samuel Cahyawijaya
Andrea Madotto
Yejin Bang
Etsuko Ishii
Pascale Fung
50
57
0
17 Mar 2020
Pseudo Labeling and Negative Feedback Learning for Large-scale
  Multi-label Domain Classification
Pseudo Labeling and Negative Feedback Learning for Large-scale Multi-label Domain Classification
Joo-Kyung Kim
Young-Bum Kim
19
11
0
08 Mar 2020
Attention over Parameters for Dialogue Systems
Attention over Parameters for Dialogue Systems
Andrea Madotto
Zhaojiang Lin
Chien-Sheng Wu
Jamin Shin
Pascale Fung
30
20
0
07 Jan 2020
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded
  Conversational Agents
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents
Kurt Shuster
Da Ju
Stephen Roller
Emily Dinan
Y-Lan Boureau
Jason Weston
32
81
0
09 Nov 2019
ALOHA: Artificial Learning of Human Attributes for Dialogue Agents
ALOHA: Artificial Learning of Human Attributes for Dialogue Agents
Aaron W. Li
Veronica Jiang
Steven Y. Feng
Julia Sprague
Wei Zhou
Jesse Hoey
17
27
0
18 Oct 2019
Towards a Metric for Automated Conversational Dialogue System Evaluation
  and Improvement
Towards a Metric for Automated Conversational Dialogue System Evaluation and Improvement
Jan Deriu
Mark Cieliebak
ELM
8
7
0
26 Sep 2019
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
301
1,620
0
18 Sep 2019
Hierarchical Reinforcement Learning for Open-Domain Dialog
Hierarchical Reinforcement Learning for Open-Domain Dialog
Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
J. Shen
Rosalind W. Picard
OffRL
19
59
0
17 Sep 2019
Previous
123
Next