Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.09010
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 1,068 papers shown
Title
Evaluating representation learning on the protein structure universe
Arian R. Jamasb
Alex Morehead
Chaitanya K. Joshi
Zuobai Zhang
Kieran Didi
...
Charles Harris
Jian Tang
Jianlin Cheng
Pietro Lio
Tom L. Blundell
SSL
221
22
0
19 Jun 2024
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Yufang Hou
Alessandra Pascale
Javier Carnerero-Cano
T. Tchrakian
Radu Marinescu
Elizabeth M. Daly
Inkit Padhi
P. Sattigeri
156
21
0
19 Jun 2024
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
Rushikesh Zawar
Shaurya Dewan
Andrew F. Luo
Margaret M. Henderson
Michael J. Tarr
Leila Wehbe
VGen
CoGe
138
1
0
19 Jun 2024
Is Your HD Map Constructor Reliable under Sensor Corruptions?
Xiaoshuai Hao
Mengchuan Wei
Yifan Yang
Haimei Zhao
Hui Zhang
Yi Zhou
Qiang Wang
Weiming Li
Lingdong Kong
Jing Zhang
3DV
215
30
0
18 Jun 2024
IDs for AI Systems
Alan Chan
Noam Kolt
Peter Wills
Usman Anwar
Christian Schroeder de Witt
Nitarshan Rajkumar
Lewis Hammond
David M. Krueger
Lennart Heim
Markus Anderljung
305
12
0
17 Jun 2024
Centering Policy and Practice: Research Gaps around Usable Differential Privacy
Rachel Cummings
Jayshree Sarathy
204
11
0
17 Jun 2024
Extrinsic Evaluation of Cultural Competence in Large Language Models
Shaily Bhatt
Fernando Diaz
ELM
EGVM
304
16
0
17 Jun 2024
They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias
Salma Abdel Magid
Jui-Hsien Wang
Kushal Kafle
Hanspeter Pfister
252
2
0
17 Jun 2024
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
Fengxiang Wang
H. Wang
Haiyan Zhao
Zonghao Guo
Zhenyu Zhong
Long Lan
Wenjing Yang
Jing Zhang
373
0
0
17 Jun 2024
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness Evaluation in Large Language Models
Yuqing Wang
Yun Zhao
LRM
AAML
ELM
215
5
0
16 Jun 2024
Rideshare Transparency: Translating Gig Worker Insights on AI Platform Design to Policy
Varun Nagaraj Rao
Samantha Dalal
Eesha Agarwal
D. Calacci
Andrés Monroy-Hernández
205
11
0
16 Jun 2024
DocNet: Semantic Structure in Inductive Bias Detection Models
Jessica Zhu
Iain Cruickshank
Michel Cukier
218
0
0
16 Jun 2024
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment
Darshana Saravanan
Darshan Singh
Varun Gupta
Zeeshan Khan
Vineet Gandhi
Makarand Tapaswi
CoGe
121
6
0
16 Jun 2024
From Pixels to Prose: A Large Dataset of Dense Image Captions
Vasu Singla
Kaiyu Yue
Sukriti Paul
Reza Shirkavand
Mayuka Jayawardhana
Alireza Ganjdanesh
Heng Huang
A. Bhatele
Gowthami Somepalli
Tom Goldstein
3DV
VLM
254
38
0
14 Jun 2024
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Neural Information Processing Systems (NeurIPS), 2024
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Ivan Rodkin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALM
ALM
LRM
ReLM
ELM
218
130
0
14 Jun 2024
TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs
Neural Information Processing Systems (NeurIPS), 2024
J. Gastinger
Shenyang Huang
Mikhail Galkin
Erfan Loghmani
Ali Parviz
...
Emanuele Rossi
Ioannis Koutis
Heiner Stuckenschmidt
Reihaneh Rabbany
Guillaume Rabusseau
160
21
0
14 Jun 2024
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024
Grigor Bezirganyan
Sana Sellami
Laure Berti-Équille
Sébastien Fournier
245
6
0
14 Jun 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Holy Lovenia
Rahmad Mahendra
Salsabil Maulana Akbar
Lester James V. Miranda
Jennifer Santoso
...
Genta Indra Winata
Ruochen Zhang
Fajri Koto
Zheng-Xin Yong
Samuel Cahyawijaya
439
32
0
14 Jun 2024
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming
Neural Information Processing Systems (NeurIPS), 2024
Victor-Alexandru Pădurean
Adish Singla
ELM
265
5
0
14 Jun 2024
ReMI: A Dataset for Reasoning with Multiple Images
Mehran Kazemi
Nishanth Dikkala
Ankit Anand
Petar Dević
Ishita Dasgupta
...
Bahare Fatemi
Pranjal Awasthi
Dee Guo
Sreenivas Gollapudi
Ahmed Qureshi
LRM
VLM
276
25
0
13 Jun 2024
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Bahare Fatemi
Mehran Kazemi
Anton Tsitsulin
Karishma Malkan
Jinyeong Yim
John Palowitch
Sungyong Seo
Jonathan J. Halcrow
Bryan Perozzi
LRM
298
68
0
13 Jun 2024
SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution
Soufiane Belharbi
Mara K M Whitford
Phuong Hoang
Shakeeb Murtaza
Luke McCaffrey
Eric Granger
166
1
0
13 Jun 2024
ECBD: Evidence-Centered Benchmark Design for NLP
Yu Lu Liu
Su Lin Blodgett
Jackie Chi Kit Cheung
Q. Vera Liao
Alexandra Olteanu
Ziang Xiao
239
18
0
13 Jun 2024
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition
Mehreen Saeed
Adrian Chan
Anupam Mijar
Joseph Moukarzel
Georges Habchi
Carlos Younes
Amin Elias
Chau-Wai Wong
Akram Khater
325
6
0
13 Jun 2024
DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
Mohamed Elrefaie
Florin Morar
Angela Dai
Faez Ahmed
PINN
AI4CE
359
47
0
13 Jun 2024
Are Large Language Models Good Statisticians?
Yizhang Zhu
Shiyin Du
Boyan Li
Yuyu Luo
Nan Tang
ELM
187
33
0
12 Jun 2024
DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses
Abdurrahim Yilmaz
Sirin Pekcan Yasar
G. Gencoglan
Burak Temelkuran
95
11
0
11 Jun 2024
A Taxonomy of Challenges to Curating Fair Datasets
Dora Zhao
M. Scheuerman
Pooja Chitre
Jerone T. A. Andrews
Georgia Panagiotidou
Shawn Walker
Kathleen H. Pine
Alice Xiang
257
3
0
10 Jun 2024
STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics
Jiawen Chen
Muqing Zhou
Wenrong Wu
Jinwei Zhang
Yun Li
Didong Li
219
23
0
10 Jun 2024
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
Andrew M. Bean
Simi Hellsten
Harry Mayne
Jabez Magomere
Ethan A. Chi
Ryan A. Chi
Scott A. Hale
Hannah Rose Kirk
ELM
LRM
310
23
0
10 Jun 2024
Situated Ground Truths: Enhancing Bias-Aware AI by Situating Data Labels with SituAnnotate
Delfina Sol Martinez Pandiani
Valentina Presutti
165
1
0
10 Jun 2024
LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models
Lukas Helff
Felix Friedrich
Manuel Brack
Kristian Kersting
P. Schramowski
VLM
344
1
0
07 Jun 2024
Reconfiguring Participatory Design to Resist AI Realism
Aakash Gautam
208
5
0
05 Jun 2024
A Standardized Machine-readable Dataset Documentation Format for Responsible AI
Nitisha Jain
Mubashara Akhtar
Joan Giner-Miguelez
Rajat Shinde
Joaquin Vanschoren
...
Costanza Conforti
Michael Kuchnik
Lora Aroyo
Omar Benjelloun
Elena Simperl
160
6
0
04 Jun 2024
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Li Lin
Santosh
Xin Eric Wang
Shu Hu
Shu Hu
EGVM
637
27
0
02 Jun 2024
Gender Bias Detection in Court Decisions: A Brazilian Case Study
Raysa Benatti
F. Severi
Sandra Avila
Esther Luna Colombini
181
4
0
01 Jun 2024
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
Chunhui Zhang
Li Liu
Guanjie Huang
Hao Wen
Xi Zhou
Yanfeng Wang
298
21
0
30 May 2024
A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models
Hirofumi Tsuruta
Hiroyuki Yamazaki
R. Maeda
Ryotaro Tamura
Akihiro Imura
105
2
0
29 May 2024
Artificial Intelligence in Industry 4.0: A Review of Integration Challenges for Industrial Systems
Alexander Windmann
Philipp Wittenberg
Marvin Schieseck
Oliver Niggemann
AI4CE
206
17
0
28 May 2024
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models
Hanjun Luo
Ziye Deng
Ruizhe Chen
Zuo-Qiang Liu
EGVM
467
11
0
28 May 2024
Posts of Peril: Detecting Information About Hazards in Text
Keith Burghardt
D. Fessler
Chyna Tang
Anne C. Pisor
Kristina Lerman
258
0
0
28 May 2024
Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP
Vagrant Gautam
Arjun Subramonian
Anne Lauscher
O. Keyes
216
15
0
27 May 2024
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text
Han Yu
Peikun Guo
Akane Sano
166
33
0
26 May 2024
Understanding Stakeholders' Perceptions and Needs Across the LLM Supply Chain
Agathe Balayn
Lorenzo Corti
Fanny Rancourt
Fabio Casati
U. Gadiraju
166
6
0
25 May 2024
Paths of A Million People: Extracting Life Trajectories from Wikipedia
Ying Zhang
Xiaofeng Li
Zhaoyang Liu
Haipeng Zhang
159
0
0
25 May 2024
A Multilingual Similarity Dataset for News Article Frame
Xi Chen
Mattia Samory
Scott A. Hale
David Jurgens
Przemyslaw A. Grabowicz
193
2
0
22 May 2024
Pragmatic auditing: a pilot-driven approach for auditing Machine Learning systems
Djalel Benbouzid
Christiane Plociennik
Laura Lucaj
Mihai Maftei
Iris Merget
A. Burchardt
Marc P. Hauer
Abdeldjallil Naceri
Patrick van der Smagt
MLAU
115
0
0
21 May 2024
Cascade-based Randomization for Inferring Causal Effects under Diffusion Interference
Zahra Fatemi
Jean Pouget-Abadie
Elena Zheleva
CML
193
0
0
20 May 2024
On Efficient and Statistical Quality Estimation for Data Annotation
Jan-Christoph Klie
Juan Haladjian
Marc Kirchner
Rahul Nair
180
6
0
20 May 2024
Societal Adaptation to Advanced AI
Jamie Bernardi
Gabriel Mukobi
Hilary Greaves
Lennart Heim
Markus Anderljung
382
13
0
16 May 2024
Previous
1
2
3
...
5
6
7
...
20
21
22
Next