v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018

Timnit Gebru

Jamie Morgenstern

Briana Vecchione

Jennifer Wortman Vaughan

Papers citing "Datasheets for Datasets"

50 / 1,072 papers shown

Towards a Responsible AI Development Lifecycle: Lessons From Information Security

Erick Galinkin

SILM

149

06 Mar 2022

System Cards for AI-Based Decision-Making for Public Policy

Furkan Gursoy

I. Kakadiaris

MLAU

236

01 Mar 2022

Healthsheet: Development of a Transparency Artifact for Health DatasetsConference on Fairness, Accountability and Transparency (FAccT), 2022

214

26 Feb 2022

The four-fifths rule is not disparate impact: a woeful tale of epistemic trespassing in algorithmic fairnessConference on Fairness, Accountability and Transparency (FAccT), 2022

E. A. Watkins

Michael McKenna

Jiahao Chen

141

19 Feb 2022

Personalization Trade-offs in Designing a Dialogue-based Information System for Support-Seeking of Sexual Violence SurvivorsInternational Conference on Human Factors in Computing Systems (CHI), 2022

112

18 Feb 2022

Symphony: Composing Interactive Interfaces for Machine LearningInternational Conference on Human Factors in Computing Systems (CHI), 2022

Alex Bäuerle

Ángel Alexander Cabrera

Dominik Moritz

179

18 Feb 2022

Seeing Like a Toolkit: How Toolkits Envision the Work of AI Ethics

Richmond Y. Wong

Michael A. Madaio

Nick Merrill

286

109

17 Feb 2022

Impact of Pretraining Term Frequencies on Few-Shot Reasoning

311

173

15 Feb 2022

Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated TextJournal of Artificial Intelligence Research (JAIR), 2022

705

221

14 Feb 2022

Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?Conference on Fairness, Accountability and Transparency (FAccT), 2022

P. Schramowski

Christopher Tauchmann

Kristian Kersting

FaML

362

147

14 Feb 2022

Accountability in an Algorithmic Society: Relationality, Responsibility, and Robustness in Machine LearningConference on Fairness, Accountability and Transparency (FAccT), 2022

289

112

10 Feb 2022

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive ReasoningEuropean Conference on Computer Vision (ECCV), 2022

Yejin Choi

497

10 Feb 2022

The craft and coordination of data curation: complicating "workflow" views of data science

134

09 Feb 2022

Towards a consistent interpretation of AIOps modelsACM Transactions on Software Engineering and Methodology (TOSEM), 2022

Yingzhe Lyu

Gopi Krishnan Rajbahadur

233

04 Feb 2022

Towards Training Reproducible Deep Learning ModelsInternational Conference on Software Engineering (ICSE), 2022

Gopi Krishnan Rajbahadur

Zhen Ming

Z. Jiang

SyDa

150

04 Feb 2022

Net benefit, calibration, threshold selection, and training objectives for algorithmic fairness in healthcareConference on Fairness, Accountability and Transparency (FAccT), 2022

209

03 Feb 2022

Adaptive Sampling Strategies to Construct Equitable Training DatasetsConference on Fairness, Accountability and Transparency (FAccT), 2022

249

31 Jan 2022

Fair ranking: a critical review, challenges, and future directionsConference on Fairness, Accountability and Transparency (FAccT), 2022

227

29 Jan 2022

IMACS: Image Model Attribution Comparison Summaries

203

26 Jan 2022

Natural Language Descriptions of Deep Visual FeaturesInternational Conference on Learning Representations (ICLR), 2022

Antonio Torralba

986

150

26 Jan 2022

Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Luke Zettlemoyer

467

25 Jan 2022

An Algorithmic Framework for Bias BountiesConference on Fairness, Accountability and Transparency (FAccT), 2022

450

25 Jan 2022

Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources

Angelina McMillan-Major

...

Daniel Alexander van Strien

Yacine Jernite

210

25 Jan 2022

Evaluating a Methodology for Increasing AI Transparency: A Case Study

David Piorkowski

John T. Richards

Michael Hind

203

24 Jan 2022

Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionalsJournal of Biomedical Informatics (JBI), 2022

215

18 Jan 2022

OmniPrint: A Configurable Printed Character Synthesizer

203

17 Jan 2022

The Dataset Nutrition Label (2nd Gen): Leveraging Context to Mitigate Harms in Artificial Intelligence

214

10 Jan 2022

MERLOT Reserve: Neural Script Knowledge through Vision and Language and SoundComputer Vision and Pattern Recognition (CVPR), 2022

Yejin Choi

500

238

07 Jan 2022

Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological SegmentationTransactions of the Association for Computational Linguistics (TACL), 2022

Zoey Liu

Emily Tucker Prudhommeaux

299

05 Jan 2022

STEREO: Scientific Text Reuse in Open Access PublicationsScientific Data (Sci Data), 2021

215

22 Dec 2021

Validation and Transparency in AI systems for pharmacovigilance: a case study applied to the medical literature monitoring of adverse events

Bruno Ohana

Jack D. Sullivan

Nicole L. Baker

21 Dec 2021

AI Ethics Principles in Practice: Perspectives of Designers and Developers

422

14 Dec 2021

A Framework for Fairness: A Systematic Review of Existing Fair AI Solutions

Brianna Richardson

J. Gilbert

FaML

183

10 Dec 2021

Whose Ground Truth? Accounting for Individual and Collective Identities Underlying Dataset Annotation

Emily L. Denton

Mark Díaz

Ian D Kivlichan

Vinodkumar Prabhakaran

Rachel Rosen

155

08 Dec 2021

Dataset Geography: Mapping Language Data to Language UsersAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Fahim Faisal

Yinkai Wang

Antonios Anastasopoulos

221

07 Dec 2021

Text2Mesh: Text-Driven Neural Stylization for MeshesComputer Vision and Pattern Recognition (CVPR), 2021

Sagie Benaim

1.3K

416

06 Dec 2021

Thinking Beyond Distributions in Testing Machine Learned Models

Negar Rostamzadeh

B. Hutchinson

Christina Greer

Vinodkumar Prabhakaran

TTA

220

06 Dec 2021

Toward a Taxonomy of Trust for Probabilistic Machine LearningScience Advances (Sci Adv), 2021

196

05 Dec 2021

Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution

183

03 Dec 2021

Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research

221

165

03 Dec 2021

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

167

02 Dec 2021

A Causal Approach for Unfair Edge Prioritization and Discrimination RemovalAsian Conference on Machine Learning (ACML), 2021

Pavan Ravishankar

Pranshu Malviya

Balaraman Ravindran

209

29 Nov 2021

AI and the Everything in the Whole Wide World Benchmark

Inioluwa Deborah Raji

245

397

26 Nov 2021

RedCaps: web-curated image-text data created by the people, for the people

283

191

22 Nov 2021

Advancing High-Resolution Video-Language Representation with Large-Scale Video TranscriptionsComputer Vision and Pattern Recognition (CVPR), 2021

253

251

19 Nov 2021

ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation

Laurynas Karazija

Iro Laina

Christian Rupprecht

3DV VOS

311

103

19 Nov 2021

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

177

19 Nov 2021

Software Engineering for Responsible AI: An Empirical Study and Operationalised Patterns

Liming Zhu

154

18 Nov 2021

Who Decides if AI is Fair? The Labels Problem in Algorithmic Auditing

Abhilash Mishra

Yash Gorana

16 Nov 2021

Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Maarten Sap

Swabha Swayamdipta

Laura Vianna

Xuhui Zhou

Yejin Choi

Noah A. Smith

238

335

15 Nov 2021