Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.00374
Cited By
CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
1 June 2023
Rahul Madhavan
Rishabh Garg
Kahini Wadhawan
S. Mehta
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation"
10 / 10 papers shown
Title
Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models
Ruta Binkyte
Ivaxi Sheth
Zhijing Jin
Mohammad Havaei
Bernhard Schölkopf
Mario Fritz
134
0
0
28 Feb 2025
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Shixuan Fan
Wei Wei
Wendi Li
Xian-Ling Mao
Wenfeng Xie
Dangyang Chen
98
1
0
04 Jun 2024
Large Language Models and Causal Inference in Collaboration: A Survey
Xiaoyu Liu
Paiheng Xu
Junda Wu
Jiaxin Yuan
Yifan Yang
...
Haoliang Wang
Tong Yu
Julian McAuley
Wei Ai
Furong Huang
ELM
LRM
77
34
0
14 Mar 2024
Causal ATE Mitigates Unintended Bias in Controlled Text Generation
Rahul Madhavan
Kahini Wadhawan
43
0
0
19 Nov 2023
Challenges in Detoxifying Language Models
Johannes Welbl
Amelia Glaese
J. Uesato
Sumanth Dathathri
John F. J. Mellor
Lisa Anne Hendricks
Kirsty Anderson
Pushmeet Kohli
Ben Coppin
Po-Sen Huang
LM&MA
250
193
0
15 Sep 2021
Towards generalisable hate speech detection: a review on obstacles and solutions
Wenjie Yin
A. Zubiaga
117
164
0
17 Feb 2021
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
290
1,815
0
14 Dec 2020
Exploring Controllable Text Generation Techniques
Shrimai Prabhumoye
A. Black
Ruslan Salakhutdinov
AI4CE
158
84
0
04 May 2020
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
280
1,595
0
18 Sep 2019
The Woman Worked as a Babysitter: On Biases in Language Generation
Emily Sheng
Kai-Wei Chang
Premkumar Natarajan
Nanyun Peng
217
616
0
03 Sep 2019
1