Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.09010
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 1,069 papers shown
Attribute Diversity Determines the Systematicity Gap in VQA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ian Berlot-Attwell
Kumar Krishna Agrawal
A. M. Carrell
Yash Sharma
Naomi Saphra
254
2
0
15 Nov 2023
Fairness Hacking: The Malicious Practice of Shrouding Unfairness in Algorithms
Kristof Meding
Thilo Hagendorff
150
8
0
12 Nov 2023
MultiIoT: Benchmarking Machine Learning for the Internet of Things
Shentong Mo
Louis-Philippe Morency
Russ Salakhutdinov
Paul Pu Liang
199
2
0
10 Nov 2023
Is a Seat at the Table Enough? Engaging Teachers and Students in Dataset Specification for ML in Education
Mei Tan
Hansol Lee
Dakuo Wang
Hariharan Subramonyam
175
12
0
09 Nov 2023
Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Joan Nwatu
Oana Ignat
Amélie Reymond
206
13
0
09 Nov 2023
On Leakage in Machine Learning Pipelines
Leonard Sasse
Eliana Nicolaisen-Sobesky
Juergen Dukart
Simon B. Eickhoff
Michael Götz
...
Abhijit Kulkarni
Juha Lahnakoski
Bradley C. Love
F. Raimondo
K. Patil
AI4CE
293
16
0
07 Nov 2023
Benefits and Harms of Large Language Models in Digital Mental Health
Munmun De Choudhury
Sachin R. Pendse
Neha Kumar
LM&MA
AI4MH
201
63
0
07 Nov 2023
Contextual Confidence and Generative AI
Shrey Jain
Zoe Hitzig
Pamela Mishkin
307
4
0
02 Nov 2023
FAIRLABEL: Correcting Bias in Labels
Srinivasan H. Sengamedu
Hien Pham
132
0
0
01 Nov 2023
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Neural Information Processing Systems (NeurIPS), 2023
Jieming Cui
Ziren Gong
Baoxiong Jia
Siyuan Huang
Zilong Zheng
Jianzhu Ma
Yixin Zhu
209
4
0
01 Nov 2023
Sentiment Analysis in Digital Spaces: An Overview of Reviews
L. Ayravainen
Joanne Hinds
Brittany I. Davidson
240
1
0
30 Oct 2023
A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture
Qianqian Shen
Yunhan Zhao
Nahyun Kwon
Jeeeun Kim
Yanan Li
Shu Kong
171
3
0
30 Oct 2023
There Are No Data Like More Data- Datasets for Deep Learning in Earth Observation
IEEE Geoscience and Remote Sensing Magazine (GRSM), 2023
Michael Schmitt
S. A. Ahmadi
Yonghao Xu
G. Taşkın
Ujjwal Verma
F. Sica
Ronny Hansch
187
36
0
30 Oct 2023
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
Neural Information Processing Systems (NeurIPS), 2023
Zitong S. Chen
Chau Pham
Siqi Wang
Michael Doron
Nikita Moshkov
Bryan A. Plummer
Juan C. Caicedo
184
14
0
30 Oct 2023
AI for Open Science: A Multi-Agent Perspective for Ethically Translating Data to Knowledge
Chase Yakaboski
Gregory Hyde
Clement Nyanhongo
Eugene Santos
219
1
0
28 Oct 2023
CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data
Neural Information Processing Systems (NeurIPS), 2023
Taiki Miyanishi
Fumiya Kitamori
Shuhei Kurita
Jungdae Lee
M. Kawanabe
Nakamasa Inoue
AI4TS
3DPC
222
15
0
28 Oct 2023
WCLD: Curated Large Dataset of Criminal Cases from Wisconsin Circuit Courts
Neural Information Processing Systems (NeurIPS), 2023
Elliott Ash
Naman Goel
Nianyun Li
Claudia Marangon
Peiyao Sun
265
2
0
28 Oct 2023
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (JSTARS), 2023
Yi Wang
Hugo Hernández Hernández
C. Albrecht
Xiao Xiang Zhu
252
54
0
28 Oct 2023
Socially Cognizant Robotics for a Technology Enhanced Society
Kristin J. Dana
Clinton Andrews
Kostas Bekris
Jacob Feldman
Matthew Stone
Pernille Hemmer
Aaron Mazzeo
Hal Salzman
Jingang Yi
207
3
0
27 Oct 2023
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
...
K. Bollacker
Tongshuang Wu
Luis Villa
Sandy Pentland
Sara Hooker
302
84
0
25 Oct 2023
AI Hazard Management: A framework for the systematic management of root causes for AI risks
Ronald Schnitzer
Andreas Hapfelmeier
Sven Gaube
Sonja Zillner
276
6
0
25 Oct 2023
Can You Rely on Your Model Evaluation? Improving Model Evaluation with Synthetic Test Data
Neural Information Processing Systems (NeurIPS), 2023
B. V. Breugel
Nabeel Seedat
F. Imrie
M. Schaar
SyDa
211
36
0
25 Oct 2023
ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors
Neural Information Processing Systems (NeurIPS), 2023
Xiaoxuan Ma
Stephan P. Kaufhold
Jiajun Su
Wentao Zhu
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
230
26
0
25 Oct 2023
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Sander Schulhoff
Jeremy Pinto
Anaum Khan
Louis-Franccois Bouchard
Chenglei Si
Svetlina Anati
Valen Tagliabue
Anson Liu Kost
Christopher Carnahan
Jordan L. Boyd-Graber
SILM
355
63
0
24 Oct 2023
On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms
S. Mittal
K. Thakral
Richa Singh
Mayank Vatsa
Tamar Glaser
Cristian Canton Ferrer
Tal Hassner
FaML
292
3
0
24 Oct 2023
RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions
Neural Information Processing Systems (NeurIPS), 2023
Lingdong Kong
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
OODD
281
57
0
23 Oct 2023
The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Pranav Narayanan Venkit
Mukund Srinath
Sanjana Gautam
Saranya Venkatraman
Vipul Gupta
R. Passonneau
Shomir Wilson
204
18
0
18 Oct 2023
CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Myra Cheng
Tiziano Piccardi
Diyi Yang
LLMAG
323
106
0
17 Oct 2023
A State-Vector Framework for Dataset Effects
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
E. Sahak
Zining Zhu
Frank Rudzicz
217
1
0
17 Oct 2023
The AI Incident Database as an Educational Tool to Raise Awareness of AI Harms: A Classroom Exploration of Efficacy, Limitations, & Future Improvements
Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Michael Feffer
Nikolas Martelaro
Hoda Heidari
180
27
0
10 Oct 2023
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lucie-Aimée Kaffee
Arnav Arora
Isabelle Augenstein
204
6
0
09 Oct 2023
InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nils Feldhus
Qianli Wang
Tatiana Anikina
Sahil Chopra
Cennet Oguz
Sebastian Möller
310
19
0
09 Oct 2023
DORIS-MAE: Scientific Document Retrieval using Multi-level Aspect-based Queries
Neural Information Processing Systems (NeurIPS), 2023
Jianyou Wang
Kaicheng Wang
Xiaoyue Wang
Prudhviraj Naidu
Leon Bergen
R. Paturi
473
14
0
07 Oct 2023
The Empty Signifier Problem: Towards Clearer Paradigms for Operationalising "Alignment" in Large Language Models
Hannah Rose Kirk
Bertie Vidgen
Paul Röttger
Scott A. Hale
381
9
0
03 Oct 2023
Grasping AI: experiential exercises for designers
Ai & Society (AI & Society), 2023
Dave Murray-Rust
M. Lupetti
Iohanna Nicenboim
W. V. D. Hoog
148
16
0
02 Oct 2023
HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count
Neural Information Processing Systems (NeurIPS), 2023
N. Wiederhold
Ava Megyeri
DiMaggio Paris
Sean Banerjee
N. Banerjee
454
21
0
01 Oct 2023
Berkeley Open Extended Reality Recordings 2023 (BOXRR-23): 4.7 Million Motion Capture Recordings from 105,852 Extended Reality Device Users
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
V. Nair
Wenbo Guo
Rui Wang
J. F. O'Brien
Louis B. Rosenberg
Dawn Song
263
9
0
30 Sep 2023
LagrangeBench: A Lagrangian Fluid Mechanics Benchmarking Suite
Neural Information Processing Systems (NeurIPS), 2023
Stefania Costantini
Gianluca Galletti
Fabian Fritz
Stefan Adami
Nikolaus A. Adams
272
22
0
28 Sep 2023
More than Model Documentation: Uncovering Teachers' Bespoke Information Needs for Informed Classroom Integration of ChatGPT
International Conference on Human Factors in Computing Systems (CHI), 2023
Mei Tan
Hariharan Subramonyam
250
31
0
25 Sep 2023
SINCERE: Supervised Information Noise-Contrastive Estimation REvisited
Patrick Feeney
M. C. Hughes
239
3
0
25 Sep 2023
Affective Game Computing: A Survey
Proceedings of the IEEE (Proc. IEEE), 2023
Georgios N. Yannakakis
Dávid Melhárt
238
19
0
25 Sep 2023
VidChapters-7M: Video Chapters at Scale
Neural Information Processing Systems (NeurIPS), 2023
Antoine Yang
Arsha Nagrani
Ivan Laptev
Josef Sivic
Cordelia Schmid
VGen
242
38
0
25 Sep 2023
Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data
Neural Information Processing Systems (NeurIPS), 2023
Wai Tong Chung
Bassem Akoush
Pushan Sharma
Alex Tamkin
Kihoon Jung
...
D. Brouzet
M. Talei
B. Savard
A. Poludnenko
M. Ihme
AI4CE
345
28
0
23 Sep 2023
Unlocking Model Insights: A Dataset for Automated Model Card Generation
Shruti Singh
Hitesh Lodwal
Husain Malwat
Rakesh Thakur
Mayank Singh
SyDa
160
4
0
22 Sep 2023
The Cambridge Law Corpus: A Dataset for Legal AI Research
Social Science Research Network (SSRN), 2023
Andreas Ostling
Holli Sargeant
Huiyuan Xie
Ludwig Bull
Alexander Terenin
Leif Jonsson
Maans Magnusson
Felix Steffek
ELM
AILaw
271
12
0
21 Sep 2023
Learning and DiSentangling Patient Static Information from Time-series Electronic HEalth Record (STEER)
PLOS Digital Health (PDH), 2023
Wei-Duen Liao
J. Voldman
OOD
CML
181
0
0
20 Sep 2023
How to Data in Datathons
Neural Information Processing Systems (NeurIPS), 2023
Carlos Mougan
Richard Plant
Clare Teng
Marya Bazzi
Alvaro Cabregas-Ejea
Ryan Sze-Yin Chan
David Salvador Jasin
Martin Stoffel
K. Whitaker
Jules Manser
161
1
0
18 Sep 2023
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving
IEEE Robotics and Automation Letters (RA-L), 2023
Ahmed Rida Sekkat
Rohit Mohan
Oliver Sawade
Elmar Matthes
Abhinav Valada
363
14
0
12 Sep 2023
Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color
IEEE International Conference on Computer Vision (ICCV), 2023
William Thong
Przemyslaw K. Joniak
Alice Xiang
281
26
0
10 Sep 2023
Augmenting Chest X-ray Datasets with Non-Expert Annotations
Annual Conference on Medical Image Understanding and Analysis (MIUA), 2023
Veronika Cheplygina
Cathrine Damgaard
Dovile Juodelyte
Veronika Cheplygina
Amelia Jiménez-Sánchez
280
5
0
05 Sep 2023
Previous
1
2
3
...
8
9
10
...
20
21
22
Next