Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.09050
Cited By
Ethical Challenges in Data-Driven Dialogue Systems
24 November 2017
Peter Henderson
Koustuv Sinha
Nicolas Angelard-Gontier
Nan Rosemary Ke
G. Fried
Ryan J. Lowe
Joelle Pineau
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ethical Challenges in Data-Driven Dialogue Systems"
37 / 37 papers shown
Title
Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks
Mohammad Saleha
Azadeh Tabatabaeib
52
0
0
14 Apr 2025
From Pixels to Personas: Investigating and Modeling Self-Anthropomorphism in Human-Robot Dialogues
Yu Li
Devamanyu Hazarika
Di Jin
Julia Hirschberg
Yang Liu
28
0
0
04 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELM
PILM
74
7
0
03 Oct 2024
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Tomer Ashuach
Martin Tutek
Yonatan Belinkov
KELM
MU
71
4
0
13 Jun 2024
The Mosaic Memory of Large Language Models
Igor Shilov
Matthieu Meeus
Yves-Alexandre de Montjoye
44
3
0
24 May 2024
Navigating LLM Ethics: Advancements, Challenges, and Future Directions
Junfeng Jiao
S. Afroogh
Yiming Xu
Connor Phillips
AILaw
62
19
0
14 May 2024
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models
Mingfeng Xue
Dayiheng Liu
Kexin Yang
Guanting Dong
Wenqiang Lei
Zheng Yuan
Chang Zhou
Jingren Zhou
LLMAG
19
2
0
25 Oct 2023
Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems
Biplav Srivastava
Kausik Lakkaraju
T. Koppel
Vignesh Narayanan
Ashish Kundu
Sachindra Joshi
29
2
0
09 Sep 2023
Reducing Sensitivity on Speaker Names for Text Generation from Dialogues
Qi Jia
Haifeng Tang
Kenny Q. Zhu
18
2
0
23 May 2023
Conversational AI-Powered Design: ChatGPT as Designer, User, and Product
A. Kocaballi
24
38
0
15 Feb 2023
Advances in Automatically Rating the Trustworthiness of Text Processing Services
Biplav Srivastava
Kausik Lakkaraju
Mariana Bernagozzi
Marco Valtorta
30
6
0
04 Feb 2023
Extracting Training Data from Diffusion Models
Nicholas Carlini
Jamie Hayes
Milad Nasr
Matthew Jagielski
Vikash Sehwag
Florian Tramèr
Borja Balle
Daphne Ippolito
Eric Wallace
DiffM
63
569
0
30 Jan 2023
On Safe and Usable Chatbots for Promoting Voter Participation
Bharath Muppasani
Vishal Pallagani
Kausik Lakkaraju
Shuge Lei
Biplav Srivastava
Brett W. Robertson
Andrea A. Hickerson
Vignesh Narayanan
16
2
0
16 Dec 2022
An Empathetic AI Coach for Self-Attachment Therapy
Lisa Alazraki
Ali Ghachem
Neophytos Polydorou
Foaad Khosmood
A. Edalat
22
9
0
17 Sep 2022
In conversation with Artificial Intelligence: aligning language models with human values
Atoosa Kasirzadeh
Iason Gabriel
21
98
0
01 Sep 2022
Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation
Prakhar Gupta
Harsh Jhamtani
Jeffrey P. Bigham
49
12
0
19 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin P. Adewumi
F. Liwicki
Marcus Liwicki
29
15
0
02 May 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
72
2,330
0
12 Apr 2022
PanGu-Bot: Efficient Generative Dialogue Pre-training from Pre-trained Language Model
Fei Mi
Yitong Li
Yulong Zeng
Jingyan Zhou
Yasheng Wang
Chuanfei Xu
Lifeng Shang
Xin Jiang
Shiqi Zhao
Qun Liu
ALM
39
18
0
31 Mar 2022
Do Language Models Plagiarize?
Jooyoung Lee
Thai Le
Jinghui Chen
Dongwon Lee
33
74
0
15 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,953
0
04 Mar 2022
Does Entity Abstraction Help Generative Transformers Reason?
Nicolas Angelard-Gontier
Siva Reddy
C. Pal
31
5
0
05 Jan 2022
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
30
109
0
28 Dec 2021
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models
Soumya Barikeri
Anne Lauscher
Ivan Vulić
Goran Glavas
21
178
0
07 Jun 2021
Evaluating Gender Bias in Natural Language Inference
Shanya Sharma
Manan Dey
Koustuv Sinha
20
41
0
12 May 2021
Detoxifying Language Models Risks Marginalizing Minority Voices
Albert Xu
Eshaan Pathak
Eric Wallace
Suchin Gururangan
Maarten Sap
Dan Klein
13
121
0
13 Apr 2021
Detecting and Classifying Malevolent Dialogue Responses: Taxonomy, Data and Methodology
Yangjun Zhang
Pengjie Ren
Maarten de Rijke
26
11
0
21 Aug 2020
Chat as Expected: Learning to Manipulate Black-box Neural Dialogue Models
Haochen Liu
Zhiwei Wang
Tyler Derr
Jiliang Tang
AAML
19
15
0
27 May 2020
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation
Emily Dinan
Angela Fan
Adina Williams
Jack Urbanek
Douwe Kiela
Jason Weston
24
205
0
10 Nov 2019
A Crowd-based Evaluation of Abuse Response Strategies in Conversational Agents
A. C. Curry
Verena Rieser
22
31
0
10 Sep 2019
Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack
Emily Dinan
Samuel Humeau
Bharath Chintagunta
Jason Weston
13
243
0
17 Aug 2019
A Virtual Conversational Agent for Teens with Autism: Experimental Results and Design Lessons
M. R. Ali
Zahra Razavi
A. Mamun
Raina Langevin
Benjamin Kane
Reza Rawassizadeh
Lenhart Schubert
M Ehsan Hoque
14
25
0
07 Nov 2018
The RLLChatbot: a solution to the ConvAI challenge
Nicolas Angelard-Gontier
Koustuv Sinha
Peter Henderson
Iulian Serban
Michael Noseworthy
Prasanna Parthasarathi
Joelle Pineau
OffRL
30
0
0
07 Nov 2018
Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models
Tong Niu
Joey Tianyi Zhou
AAML
21
85
0
06 Sep 2018
Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer
Cicero Nogueira dos Santos
Igor Melnyk
Inkit Padhi
22
153
0
20 May 2018
A Review of Evaluation Techniques for Social Dialogue Systems
A. C. Curry
H. Hastie
Verena Rieser
118
13
0
13 Sep 2017
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
214
1,326
0
05 Jun 2016
1