Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.09010
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 1,072 papers shown
Towards a Responsible AI Development Lifecycle: Lessons From Information Security
Erick Galinkin
SILM
149
6
0
06 Mar 2022
System Cards for AI-Based Decision-Making for Public Policy
Furkan Gursoy
I. Kakadiaris
MLAU
236
21
0
01 Mar 2022
Healthsheet: Development of a Transparency Artifact for Health Datasets
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Negar Rostamzadeh
Diana Mincu
Subhrajit Roy
A. Smart
Lauren Wilcox
Mahima Pushkarna
Jessica Schrouff
Razvan Amironesei
Nyalleng Moorosi
Katherine A. Heller
214
75
0
26 Feb 2022
The four-fifths rule is not disparate impact: a woeful tale of epistemic trespassing in algorithmic fairness
Conference on Fairness, Accountability and Transparency (FAccT), 2022
E. A. Watkins
Michael McKenna
Jiahao Chen
141
45
0
19 Feb 2022
Personalization Trade-offs in Designing a Dialogue-based Information System for Support-Seeking of Sexual Violence Survivors
International Conference on Human Factors in Computing Systems (CHI), 2022
Hyeok Kim
Youjin Hwang
Jieun Lee
Youngjin Kwon
Yujin Park
Joonhwan Lee
112
6
0
18 Feb 2022
Symphony: Composing Interactive Interfaces for Machine Learning
International Conference on Human Factors in Computing Systems (CHI), 2022
Alex Bäuerle
Ángel Alexander Cabrera
Fred Hohman
Megan Maher Welsh
David Koski
Xavier Suau
Titus Barik
Dominik Moritz
179
59
0
18 Feb 2022
Seeing Like a Toolkit: How Toolkits Envision the Work of AI Ethics
Richmond Y. Wong
Michael A. Madaio
Nick Merrill
286
109
0
17 Feb 2022
Impact of Pretraining Term Frequencies on Few-Shot Reasoning
Yasaman Razeghi
Robert L Logan IV
Matt Gardner
Sameer Singh
ReLM
LRM
311
173
0
15 Feb 2022
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text
Journal of Artificial Intelligence Research (JAIR), 2022
Sebastian Gehrmann
Elizabeth Clark
Thibault Sellam
ELM
AI4CE
705
221
0
14 Feb 2022
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?
Conference on Fairness, Accountability and Transparency (FAccT), 2022
P. Schramowski
Christopher Tauchmann
Kristian Kersting
FaML
362
147
0
14 Feb 2022
Accountability in an Algorithmic Society: Relationality, Responsibility, and Robustness in Machine Learning
Conference on Fairness, Accountability and Transparency (FAccT), 2022
A. Feder Cooper
Emanuel Moss
Benjamin Laufer
Helen Nissenbaum
MLAU
289
112
0
10 Feb 2022
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
European Conference on Computer Vision (ECCV), 2022
Jack Hessel
Jena D. Hwang
Jinho Park
Rowan Zellers
Chandra Bhagavatula
Anna Rohrbach
Kate Saenko
Yejin Choi
ReLM
497
61
0
10 Feb 2022
The craft and coordination of data curation: complicating "workflow" views of data science
A. Thomer
Dharma Akmon
J. York
Allison R. B. Tyler
Faye O. Polasek
Sara Lafia
Libby Hemphill
E. Yakel
134
25
0
09 Feb 2022
Towards a consistent interpretation of AIOps models
ACM Transactions on Software Engineering and Methodology (TOSEM), 2022
Yingzhe Lyu
Gopi Krishnan Rajbahadur
Dayi Lin
Boyuan Chen
Zhen Ming
Z. Jiang
AI4CE
233
26
0
04 Feb 2022
Towards Training Reproducible Deep Learning Models
International Conference on Software Engineering (ICSE), 2022
Boyuan Chen
Mingzhi Wen
Yong Shi
Dayi Lin
Gopi Krishnan Rajbahadur
Zhen Ming
Z. Jiang
SyDa
150
49
0
04 Feb 2022
Net benefit, calibration, threshold selection, and training objectives for algorithmic fairness in healthcare
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Stephen Pfohl
Yizhe Xu
Agata Foryciarz
Nikolaos Ignatiadis
Julian Z. Genkins
N. Shah
209
34
0
03 Feb 2022
Adaptive Sampling Strategies to Construct Equitable Training Datasets
Conference on Fairness, Accountability and Transparency (FAccT), 2022
William Cai
R. Encarnación
Bobbie Chern
S. Corbett-Davies
Miranda Bogen
Stevie Bergman
Sharad Goel
249
33
0
31 Jan 2022
Fair ranking: a critical review, challenges, and future directions
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Gourab K. Patro
Lorenzo Porcaro
Laura Mitchell
Qiuyue Zhang
Meike Zehlike
Nikhil Garg
227
66
0
29 Jan 2022
IMACS: Image Model Attribution Comparison Summaries
E. Schoop
Benjamin D. Wedin
A. Kapishnikov
Tolga Bolukbasi
Michael Terry
FAtt
203
1
0
26 Jan 2022
Natural Language Descriptions of Deep Visual Features
International Conference on Learning Representations (ICLR), 2022
Evan Hernandez
Sarah Schwettmann
David Bau
Teona Bagashvili
Antonio Torralba
Jacob Andreas
MILM
986
150
0
26 Jan 2022
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Suchin Gururangan
Dallas Card
Sarah K. Drier
E. K. Gade
Leroy Z. Wang
Zeyu Wang
Luke Zettlemoyer
Noah A. Smith
467
94
0
25 Jan 2022
An Algorithmic Framework for Bias Bounties
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Ira Globus-Harris
Michael Kearns
Aaron Roth
FedML
450
31
0
25 Jan 2022
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Angelina McMillan-Major
Zaid Alyafeai
Stella Biderman
Kimbo Chen
F. Toni
...
Aitor Soroa Etxabe
Pedro Ortiz Suarez
Zeerak Talat
Daniel Alexander van Strien
Yacine Jernite
210
14
0
25 Jan 2022
Evaluating a Methodology for Increasing AI Transparency: A Case Study
David Piorkowski
John T. Richards
Michael Hind
203
6
0
24 Jan 2022
Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals
Journal of Biomedical Informatics (JBI), 2022
Kathrin Blagec
J. Kraiger
Wolfgang Frühwirt
Matthias Samwald
AI4MH
215
37
0
18 Jan 2022
OmniPrint: A Configurable Printed Character Synthesizer
Haozhe Sun
Wei-Wei Tu
Isabelle M Guyon
SyDa
203
7
0
17 Jan 2022
The Dataset Nutrition Label (2nd Gen): Leveraging Context to Mitigate Harms in Artificial Intelligence
Kasia Chmielinski
S. Newman
Matt Taylor
Joshua Joseph
Kemi Thomas
Jessica Yurkofsky
Yue Qiu
214
68
0
10 Jan 2022
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Computer Vision and Pattern Recognition (CVPR), 2022
Rowan Zellers
Jiasen Lu
Ximing Lu
Youngjae Yu
Yanpeng Zhao
Mohammadreza Salehi
Aditya Kusupati
Jack Hessel
Ali Farhadi
Yejin Choi
500
238
0
07 Jan 2022
Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation
Transactions of the Association for Computational Linguistics (TACL), 2022
Zoey Liu
Emily Tucker Prudhommeaux
299
9
0
05 Jan 2022
STEREO: Scientific Text Reuse in Open Access Publications
Scientific Data (Sci Data), 2021
Lukas Gienapp
Wolfgang Kircheis
Bjarne Sievers
Benno Stein
Martin Potthast
215
10
0
22 Dec 2021
Validation and Transparency in AI systems for pharmacovigilance: a case study applied to the medical literature monitoring of adverse events
Bruno Ohana
Jack D. Sullivan
Nicole L. Baker
94
1
0
21 Dec 2021
AI Ethics Principles in Practice: Perspectives of Designers and Developers
Conrad Sanderson
David M. Douglas
Qinghua Lu
Emma Schleiger
Jon Whittle
J. Lacey
G. Newnham
S. Hajkowicz
Cathy J. Robinson
David Hansen
FaML
422
69
0
14 Dec 2021
A Framework for Fairness: A Systematic Review of Existing Fair AI Solutions
Brianna Richardson
J. Gilbert
FaML
183
48
0
10 Dec 2021
Whose Ground Truth? Accounting for Individual and Collective Identities Underlying Dataset Annotation
Emily L. Denton
Mark Díaz
Ian D Kivlichan
Vinodkumar Prabhakaran
Rachel Rosen
155
79
0
08 Dec 2021
Dataset Geography: Mapping Language Data to Language Users
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Fahim Faisal
Yinkai Wang
Antonios Anastasopoulos
221
27
0
07 Dec 2021
Text2Mesh: Text-Driven Neural Stylization for Meshes
Computer Vision and Pattern Recognition (CVPR), 2021
O. Michel
Roi Bar-On
Richard Liu
Sagie Benaim
Rana Hanocka
CLIP
AI4CE
1.3K
416
0
06 Dec 2021
Thinking Beyond Distributions in Testing Machine Learned Models
Negar Rostamzadeh
B. Hutchinson
Christina Greer
Vinodkumar Prabhakaran
TTA
220
6
0
06 Dec 2021
Toward a Taxonomy of Trust for Probabilistic Machine Learning
Science Advances (Sci Adv), 2021
Tamara Broderick
Andrew Gelman
Rachael Meager
Anna L. Smith
Tian Zheng
196
15
0
05 Dec 2021
Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution
Sahan Bulathwela
Maria Perez-Ortiz
C. Holloway
John Shawe-Taylor
183
26
0
03 Dec 2021
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research
Bernard Koch
Emily L. Denton
A. Hanna
J. Foster
221
165
0
03 Dec 2021
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer
Moein Sorkhei
Yue Liu
Hossein Azizpour
E. Azavedo
Karin Dembrower
Dimitra Ntoula
Athanasios Zouzos
Fredrik Strand
Kevin Smith
167
17
0
02 Dec 2021
A Causal Approach for Unfair Edge Prioritization and Discrimination Removal
Asian Conference on Machine Learning (ACML), 2021
Pavan Ravishankar
Pranshu Malviya
Balaraman Ravindran
209
1
0
29 Nov 2021
AI and the Everything in the Whole Wide World Benchmark
Inioluwa Deborah Raji
Emily M. Bender
Amandalynne Paullada
Emily L. Denton
A. Hanna
245
397
0
26 Nov 2021
RedCaps: web-curated image-text data created by the people, for the people
Karan Desai
Gaurav Kaul
Zubin Aysola
Justin Johnson
283
191
0
22 Nov 2021
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions
Computer Vision and Pattern Recognition (CVPR), 2021
Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
B. Guo
AI4TS
VLM
253
251
0
19 Nov 2021
ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
Laurynas Karazija
Iro Laina
Christian Rupprecht
3DV
VOS
311
103
0
19 Nov 2021
A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling
Eustache Diemert
Artem Betlei
Christophe Renaudin
Massih-Reza Amini
T. Gregoir
Thibaud Rahier
CML
177
10
0
19 Nov 2021
Software Engineering for Responsible AI: An Empirical Study and Operationalised Patterns
Qinghua Lu
Liming Zhu
Xiwei Xu
Jon Whittle
David M. Douglas
Conrad Sanderson
154
44
0
18 Nov 2021
Who Decides if AI is Fair? The Labels Problem in Algorithmic Auditing
Abhilash Mishra
Yash Gorana
72
4
0
16 Nov 2021
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Maarten Sap
Swabha Swayamdipta
Laura Vianna
Xuhui Zhou
Yejin Choi
Noah A. Smith
238
335
0
15 Nov 2021
Previous
1
2
3
...
16
17
18
...
20
21
22
Next