Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.09010
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 1,069 papers shown
Five policy uses of algorithmic transparency and explainability
Matthew R. O’Shaughnessy
351
1
0
06 Feb 2023
The Gradient of Generative AI Release: Methods and Considerations
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Irene Solaiman
197
125
0
05 Feb 2023
TempEL: Linking Dynamically Evolving and Newly Emerging Entities
Neural Information Processing Systems (NeurIPS), 2023
Klim Zaporojets
Lucie-Aimée Kaffee
Johannes Deleu
Thomas Demeester
Chris Develder
Isabelle Augenstein
KELM
311
19
0
05 Feb 2023
Advances in Automatically Rating the Trustworthiness of Text Processing Services
AI and Ethics (AE), 2023
Biplav Srivastava
Kausik Lakkaraju
Mariana Bernagozzi
Marco Valtorta
146
8
0
04 Feb 2023
Lived Experience Matters: Automatic Detection of Stigma on Social Media Toward People Who Use Substances
Salvatore Giorgi
Douglas Bellew
Daniel Roy Sadek Habib
G. Sherman
Joao Sedoc
Chase Smitterberg
Amanda Devoto
McKenzie Himelein-Wachowiak
Brenda L. Curtis
162
4
0
04 Feb 2023
Out of Context: Investigating the Bias and Fairness Concerns of "Artificial Intelligence as a Service"
International Conference on Human Factors in Computing Systems (CHI), 2023
Kornel Lewicki
M. S. Lee
Jennifer Cobbe
Jatinder Singh
258
31
0
02 Feb 2023
TAPS Responsibility Matrix: A tool for responsible data science by design
Journal of Responsible Innovation (JRI), 2023
V. Urovi
R. Çelebi
Chang Sun
Linda Rieswijk
M. Erard
Arif Yilmaz
Kodylan Moodley
Parveen Kumar
Michel Dumontier
92
3
0
02 Feb 2023
Charting the Sociotechnical Gap in Explainable AI: A Framework to Address the Gap in XAI
Upol Ehsan
Koustuv Saha
M. D. Choudhury
Mark O. Riedl
268
72
0
01 Feb 2023
Mathematical Capabilities of ChatGPT
Neural Information Processing Systems (NeurIPS), 2023
Simon Frieder
Luca Pinchetti
Alexis Chevalier
Ryan-Rhys Griffiths
Tommaso Salvatori
Thomas Lukasiewicz
P. Petersen
Julius Berner
ELM
AI4MH
496
526
0
31 Jan 2023
Designing Data: Proactive Data Collection and Iteration for Machine Learning
Aspen K. Hopkins
Fred Hohman
Luca Zappella
Xavier Suau Cuadros
Dominik Moritz
188
6
0
24 Jan 2023
Unveiling the Risks of NFT Promotion Scams
International Conference on Web and Social Media (ICWSM), 2023
Sayak Saha Roy
Dipanjan Das
Priyanka Bose
Christopher Kruegel
Giovanni Vigna
Shirin Nilizadeh
197
19
0
24 Jan 2023
Simplistic Collection and Labeling Practices Limit the Utility of Benchmark Datasets for Twitter Bot Detection
The Web Conference (WWW), 2023
Chris Hays
Zachary Schutzman
Manish Raghavan
Erin Walk
Philipp Zimmer
236
37
0
17 Jan 2023
PlasmoFAB: A Benchmark to Foster Machine Learning for Plasmodium falciparum Protein Antigen Candidate Prediction
Jonas C. Ditz
Jacqueline Wistuba-Hamprecht
Timo Maier
Rolf Fendel
N. Pfeifer
Bernhard Reuter
132
3
0
16 Jan 2023
Computational Assessment of Hyperpartisanship in News Titles
International Conference on Web and Social Media (ICWSM), 2023
Hanjia Lyu
Jinsheng Pan
Zichen Wang
Jiebo Luo
211
7
0
16 Jan 2023
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets
Shuo Sun
Molei Qin
Xinrun Wang
Bo An
FaML
OffRL
AIFin
375
10
0
14 Jan 2023
How Data Scientists Review the Scholarly Literature
Conference on Human Information Interaction and Retrieval (CHIIR), 2023
Sheshera Mysore
Mahmood Jasim
Haoru Song
Sarah Akbar
Andre Kenneth Chase Randall
Narges Mahyar
AI4CE
174
9
0
10 Jan 2023
EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset
Neural Information Processing Systems (NeurIPS), 2023
Hao Tang
Kevin J. Liang
Matt Feiszli
Weiyao Wang
EgoV
420
26
0
09 Jan 2023
AI Maintenance: A Robustness Perspective
Computer (IEEE Computer), 2023
Pin-Yu Chen
Payel Das
310
18
0
08 Jan 2023
GeoDE: a Geographically Diverse Evaluation Dataset for Object Recognition
Neural Information Processing Systems (NeurIPS), 2023
V. V. Ramaswamy
S. Lin
Dora Zhao
Aaron B. Adcock
Laurens van der Maaten
Deepti Ghadiyaram
Olga Russakovsky
329
52
0
05 Jan 2023
FATE in AI: Towards Algorithmic Inclusivity and Accessibility
Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Isa Inuwa-Dutse
184
16
0
03 Jan 2023
Causal Deep Learning
International Conference on Pattern Recognition (ICPR), 2023
M. Alex O. Vasilescu
CML
604
3
1
01 Jan 2023
Large Language Models Encode Clinical Knowledge
Nature (Nature), 2022
K. Singhal
Shekoofeh Azizi
T. Tu
S. S. Mahdavi
Jason W. Wei
...
A. Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
LM&MA
ELM
AI4MH
602
3,407
0
26 Dec 2022
Introduction to Machine Learning for Physicians: A Survival Guide for Data Deluge
Ricards Marcinkevics
Ece Ozkan
Julia E. Vogt
OOD
LM&MA
FedML
138
2
0
23 Dec 2022
Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification Bias
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Robert Wolfe
Yiwei Yang
Billy Howe
Aylin Caliskan
DiffM
335
72
0
21 Dec 2022
Trustworthy Social Bias Measurement
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022
Rishi Bommasani
Abigail Z. Jacobs
243
13
0
20 Dec 2022
Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Lining Zhang
Simon Mille
Yufang Hou
Daniel Deutsch
Elizabeth Clark
...
Saad Mahamood
Sebastian Gehrmann
Miruna Clinciu
Khyathi Chandu
João Sedoc
190
14
0
20 Dec 2022
Efficient aggregation of face embeddings for decentralized face recognition deployments (extended version)
International Conference on Information Systems Security and Privacy (ICISSP), 2022
Philipp Hofer
Michael Roland
Philipp Schwarz
René Mayrhofer
CVBM
FedML
147
4
0
20 Dec 2022
Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political Discussion
Web Search and Data Mining (WSDM), 2022
Rishav Hada
A. E. Fard
Sarah Shugars
Federico Bianchi
Patrícia G. C. Rossini
Dirk Hovy
Rebekah Tromble
N. Tintarev
128
10
0
18 Dec 2022
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding Systems
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Akshatha Arodi
Martin Pömsl
Kaheer Suleman
Adam Trischler
Alexandra Olteanu
Jackie C.K. Cheung
ELM
232
5
0
15 Dec 2022
Tensions Between the Proxies of Human Values in AI
Teresa Datta
D. Nissani
Max Cembalest
Akash Khanna
Haley Massa
John P. Dickerson
212
4
0
14 Dec 2022
Trust, but Verify: Cross-Modality Fusion for HD Map Change Detection
John Lambert
James Hays
197
38
0
14 Dec 2022
Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining
International Conference on Machine Learning (ICML), 2022
Florian Tramèr
Gautam Kamath
Nicholas Carlini
SILM
400
96
0
13 Dec 2022
Measuring Data
Margaret Mitchell
A. Luccioni
Nathan Lambert
Marissa Gerchick
Angelina McMillan-Major
Ezinwanne Ozoani
Nazneen Rajani
Tristan Thrush
Yacine Jernite
Douwe Kiela
234
19
0
09 Dec 2022
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks
LOG IN (LOG IN), 2022
Jiaqi Ma
Xingjian Zhang
Hezheng Fan
Jin Huang
Tianyue Li
Tinghong Li
Yiwen Tu
Chen Zhu
Qiaozhu Mei
285
5
0
08 Dec 2022
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Ana Kotarcic
Dominik Hangartner
Fabrizio Gilardi
Selina Kurer
K. Donnay
220
4
0
05 Dec 2022
The Grind for Good Data: Understanding ML Practitioners' Struggles and Aspirations in Making Good Data
Inha Cha
Juhyun Oh
Cheul Young Park
Jiyoon Han
Hwalsuk Lee
177
2
0
28 Nov 2022
The Principles of Data-Centric AI (DCAI)
Communications of the ACM (CACM), 2022
M. H. Jarrahi
Ali Memariani
Shion Guha
158
80
0
26 Nov 2022
Elements of effective machine learning datasets in astronomy
Bernie Boscoe
Tuan Do
E. Jones
Yunqiang Li
Kevin Alfaro
Christy Ma
239
3
0
25 Nov 2022
Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation
Neural Information Processing Systems (NeurIPS), 2022
Sérgio Jesus
José P. Pombal
Duarte M. Alves
André F. Cruz
Pedro Saleiro
Rita P. Ribeiro
João Gama
P. Bizarro
234
44
0
24 Nov 2022
Video compression dataset and benchmark of learning-based video-quality metrics
Neural Information Processing Systems (NeurIPS), 2022
Anastasia Antsiferova
Sergey Lavrushkin
Maksim Smirnov
Alexander Gushchin
D. Vatolin
D. Kulikov
181
42
0
22 Nov 2022
The Stack: 3 TB of permissively licensed source code
Denis Kocetkov
Raymond Li
Loubna Ben Allal
Jia Li
Chenghao Mou
...
Sean M. Hughes
Thomas Wolf
Dzmitry Bahdanau
Leandro von Werra
H. D. Vries
245
406
0
20 Nov 2022
SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation
Yi Wang
Nassim Ait Ali Braham
Zhitong Xiong
Chenying Liu
C. Albrecht
Xiao Xiang Zhu
232
95
0
13 Nov 2022
Seamful XAI: Operationalizing Seamful Design in Explainable AI
Upol Ehsan
Q. V. Liao
Samir Passi
Mark O. Riedl
Hal Daumé
239
34
0
12 Nov 2022
An Inclusive Notion of Text
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Ilia Kuznetsov
Iryna Gurevych
162
0
0
10 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
507
448
0
09 Nov 2022
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2022
Nabeel Seedat
F. Imrie
M. Schaar
235
18
0
09 Nov 2022
The Legal Argument Reasoning Task in Civil Procedure
Leonard Bongard
Lena Held
Ivan Habernal
AILaw
ELM
151
22
0
05 Nov 2022
SMAuC -- The Scientific Multi-Authorship Corpus
ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2022
Janek Bevendorff
Philipp Sauer
Lukas Gienapp
Wolfgang Kircheis
Erik Korner
Benno Stein
Martin Potthast
167
2
0
04 Nov 2022
ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations
International Conference on Learning Representations (ICLR), 2022
Badr Youbi Idrissi
Diane Bouchacourt
Randall Balestriero
Ivan Evtimov
C. Hazirbas
Nicolas Ballas
Pascal Vincent
M. Drozdzal
David Lopez-Paz
Mark Ibrahim
VLM
ViT
211
49
0
03 Nov 2022
My Face My Choice: Privacy Enhancing Deepfakes for Social Media Anonymization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
U. Ciftci
Gokturk Yuksek
Ilke Demir
PICV
CVBM
217
24
0
02 Nov 2022
Previous
1
2
3
...
12
13
14
...
20
21
22
Next