v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018

Timnit Gebru

Jamie Morgenstern

Briana Vecchione

Jennifer Wortman Vaughan

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown

Five policy uses of algorithmic transparency and explainability

Matthew R. O’Shaughnessy

351

06 Feb 2023

The Gradient of Generative AI Release: Methods and ConsiderationsConference on Fairness, Accountability and Transparency (FAccT), 2023

Irene Solaiman

197

125

05 Feb 2023

TempEL: Linking Dynamically Evolving and Newly Emerging EntitiesNeural Information Processing Systems (NeurIPS), 2023

311

05 Feb 2023

Advances in Automatically Rating the Trustworthiness of Text Processing ServicesAI and Ethics (AE), 2023

146

04 Feb 2023

Lived Experience Matters: Automatic Detection of Stigma on Social Media Toward People Who Use Substances

Salvatore Giorgi

Douglas Bellew

Daniel Roy Sadek Habib

G. Sherman

Joao Sedoc

Chase Smitterberg

Amanda Devoto

McKenzie Himelein-Wachowiak

Brenda L. Curtis

162

04 Feb 2023

Out of Context: Investigating the Bias and Fairness Concerns of "Artificial Intelligence as a Service"International Conference on Human Factors in Computing Systems (CHI), 2023

258

02 Feb 2023

TAPS Responsibility Matrix: A tool for responsible data science by designJournal of Responsible Innovation (JRI), 2023

02 Feb 2023

Charting the Sociotechnical Gap in Explainable AI: A Framework to Address the Gap in XAI

268

01 Feb 2023

Mathematical Capabilities of ChatGPTNeural Information Processing Systems (NeurIPS), 2023

496

526

31 Jan 2023

Designing Data: Proactive Data Collection and Iteration for Machine Learning

Aspen K. Hopkins

Fred Hohman

Luca Zappella

Xavier Suau Cuadros

Dominik Moritz

188

24 Jan 2023

Unveiling the Risks of NFT Promotion ScamsInternational Conference on Web and Social Media (ICWSM), 2023

197

24 Jan 2023

Simplistic Collection and Labeling Practices Limit the Utility of Benchmark Datasets for Twitter Bot DetectionThe Web Conference (WWW), 2023

236

17 Jan 2023

PlasmoFAB: A Benchmark to Foster Machine Learning for Plasmodium falciparum Protein Antigen Candidate Prediction

Jonas C. Ditz

Jacqueline Wistuba-Hamprecht

132

16 Jan 2023

Computational Assessment of Hyperpartisanship in News TitlesInternational Conference on Web and Social Media (ICWSM), 2023

211

16 Jan 2023

PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets

375

14 Jan 2023

How Data Scientists Review the Scholarly LiteratureConference on Human Information Interaction and Retrieval (CHIIR), 2023

Andre Kenneth Chase Randall

Narges Mahyar

AI4CE

174

10 Jan 2023

EgoTracks: A Long-term Egocentric Visual Object Tracking DatasetNeural Information Processing Systems (NeurIPS), 2023

420

09 Jan 2023

AI Maintenance: A Robustness PerspectiveComputer (IEEE Computer), 2023

Pin-Yu Chen

Payel Das

310

08 Jan 2023

GeoDE: a Geographically Diverse Evaluation Dataset for Object RecognitionNeural Information Processing Systems (NeurIPS), 2023

Laurens van der Maaten

Deepti Ghadiyaram

Olga Russakovsky

329

05 Jan 2023

FATE in AI: Towards Algorithmic Inclusivity and AccessibilityConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023

Isa Inuwa-Dutse

184

03 Jan 2023

Causal Deep LearningInternational Conference on Pattern Recognition (ICPR), 2023

M. Alex O. Vasilescu

CML

604

01 Jan 2023

Large Language Models Encode Clinical KnowledgeNature (Nature), 2022

...

Alan Karthikesalingam

Vivek Natarajan

LM&MA ELM AI4MH

602

3,407

26 Dec 2022

Introduction to Machine Learning for Physicians: A Survival Guide for Data Deluge

138

23 Dec 2022

Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification BiasConference on Fairness, Accountability and Transparency (FAccT), 2022

335

21 Dec 2022

Trustworthy Social Bias MeasurementAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022

Rishi Bommasani

Abigail Z. Jacobs

243

20 Dec 2022

Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Simon Mille

...

190

20 Dec 2022

Efficient aggregation of face embeddings for decentralized face recognition deployments (extended version)International Conference on Information Systems Security and Privacy (ICISSP), 2022

147

20 Dec 2022

Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political DiscussionWeb Search and Data Mining (WSDM), 2022

Patrícia G. C. Rossini

Dirk Hovy

Rebekah Tromble

N. Tintarev

128

18 Dec 2022

The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

232

15 Dec 2022

Tensions Between the Proxies of Human Values in AI

212

14 Dec 2022

Trust, but Verify: Cross-Modality Fusion for HD Map Change Detection

John Lambert

James Hays

197

14 Dec 2022

Position: Considerations for Differentially Private Learning with Large-Scale Public PretrainingInternational Conference on Machine Learning (ICML), 2022

400

13 Dec 2022

Angelina McMillan-Major

Douwe Kiela

234

09 Dec 2022

Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning BenchmarksLOG IN (LOG IN), 2022

Jiaqi Ma

285

08 Dec 2022

Human-in-the-Loop Hate Speech Classification in a Multilingual Context

220

05 Dec 2022

The Grind for Good Data: Understanding ML Practitioners' Struggles and Aspirations in Making Good Data

177

28 Nov 2022

The Principles of Data-Centric AI (DCAI)Communications of the ACM (CACM), 2022

M. H. Jarrahi

Ali Memariani

Shion Guha

158

26 Nov 2022

Elements of effective machine learning datasets in astronomy

239

25 Nov 2022

Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML EvaluationNeural Information Processing Systems (NeurIPS), 2022

234

24 Nov 2022

Video compression dataset and benchmark of learning-based video-quality metricsNeural Information Processing Systems (NeurIPS), 2022

Anastasia Antsiferova

181

22 Nov 2022

The Stack: 3 TB of permissively licensed source code

...

245

406

20 Nov 2022

SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation

Yi Wang

Nassim Ait Ali Braham

Zhitong Xiong

Chenying Liu

C. Albrecht

Xiao Xiang Zhu

232

13 Nov 2022

Seamful XAI: Operationalizing Seamful Design in Explainable AI

239

12 Nov 2022

An Inclusive Notion of TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Ilia Kuznetsov

Iryna Gurevych

162

10 Nov 2022

Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

507

448

09 Nov 2022

DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systemsIEEE Transactions on Artificial Intelligence (IEEE TAI), 2022

Nabeel Seedat

F. Imrie

M. Schaar

235

09 Nov 2022

The Legal Argument Reasoning Task in Civil Procedure

151

05 Nov 2022

SMAuC -- The Scientific Multi-Authorship CorpusACM/IEEE Joint Conference on Digital Libraries (JCDL), 2022

167

04 Nov 2022

ImageNet-X: Understanding Model Mistakes with Factor of Variation AnnotationsInternational Conference on Learning Representations (ICLR), 2022

Pascal Vincent

211

03 Nov 2022

My Face My Choice: Privacy Enhancing Deepfakes for Social Media AnonymizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

217

02 Nov 2022