Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.09010
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 1,069 papers shown
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
Neural Information Processing Systems (NeurIPS), 2023
Kaiyu Yang
Aidan M. Swope
Alex Gu
Rahul Chalamala
Peiyang Song
Shixing Yu
Saad Godil
R. Prenger
Anima Anandkumar
RALM
376
338
0
27 Jun 2023
Use case cards: a use case reporting framework inspired by the European AI Act
Ethics and Information Technology (EIT), 2023
Isabelle Hupont
David Fernández Llorca
S. Baldassarri
Emilia Gómez
154
32
0
23 Jun 2023
Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval
Katrin Glinka
Claudia Muller-Birn
97
16
0
22 Jun 2023
Realistic Synthetic Financial Transactions for Anti-Money Laundering Models
Neural Information Processing Systems (NeurIPS), 2023
Erik Altman
Jovan Blanuvsa
Luc von Niederhäusern
Béni Egressy
Andreea Anghel
Kubilay Atasu
346
76
0
22 Jun 2023
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities
Xudong Shen
H. Brown
Jiashu Tao
Martin Strobel
Yao Tong
Akshay Narayan
Harold Soh
Finale Doshi-Velez
334
3
0
22 Jun 2023
VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Neural Information Processing Systems (NeurIPS), 2023
Elizaveta Semenova
F. G. Abrantes
Hanwen Zhu
Grace A. Sodunke
Aleksandar Shtedritski
Hannah Rose Kirk
CoGe
377
57
0
21 Jun 2023
An Overview of Catastrophic AI Risks
Dan Hendrycks
Mantas Mazeika
Thomas Woodside
SILM
600
247
0
21 Jun 2023
Benchmarking the Influence of Pre-training on Explanation Performance in MR Image Classification
Marta Oliveira
Rick Wilming
Benedict Clark
Céline Budding
Fabian Eitel
K. Ritter
Stefan Haufe
180
1
0
21 Jun 2023
Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events
Neural Information Processing Systems (NeurIPS), 2023
Matthew B. A. McDermott
Bret A. Nestor
Peniel Argaw
I. Kohane
AI4TS
372
41
0
20 Jun 2023
Quilt-1M: One Million Image-Text Pairs for Histopathology
Neural Information Processing Systems (NeurIPS), 2023
Wisdom O. Ikezogwo
M. S. Seyfioglu
Fatemeh Ghezloo
Dylan Stefan Chan Geva
Fatwir Sheikh Mohammed
Pavan Kumar Anand
Ranjay Krishna
Linda G. Shapiro
CLIP
VLM
736
196
0
20 Jun 2023
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification
IEEE Transactions on Big Data (IEEE Trans. Big Data), 2023
Le-le Cao
Vilhelm von Ehrenheim
Mark Granroth-Wilding
Richard Anselmo Stahl
Andrew McCornack
Armin Catovic
Dhiana Deva Cavalcanti Rocha
303
4
0
18 Jun 2023
The Importance of Human-Labeled Data in the Era of LLMs
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Yang Liu
ALM
239
11
0
18 Jun 2023
Reproducibility in NLP: What Have We Learned from the Checklist?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ian H. Magnusson
Noah A. Smith
Jesse Dodge
170
13
0
16 Jun 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Neural Information Processing Systems (NeurIPS), 2023
Kazuki Shimada
Archontis Politis
Parthasaarathy Sudarsanam
D. Krause
Kengo Uchida
...
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Maria Sandsten
Yuki Mitsufuji
273
87
0
15 Jun 2023
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion
International Conference on Machine Learning (ICML), 2023
Isha Rawal
Alexander Matyasko
Shantanu Jaiswal
Basura Fernando
Cheston Tan
276
7
0
15 Jun 2023
LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting
Neural Information Processing Systems (NeurIPS), 2023
Xu Liu
Yutong Xia
Yuxuan Liang
Junfeng Hu
Yiwei Wang
Mengwei He
Chaoqin Huang
Zhenguang Liu
Bryan Hooi
Roger Zimmermann
AI4TS
180
146
0
14 Jun 2023
V-LoL: A Diagnostic Dataset for Visual Logical Learning
Lukas Helff
Wolfgang Stammer
Hikaru Shindo
Devendra Singh Dhami
Kristian Kersting
NAI
302
9
0
13 Jun 2023
Unraveling the Interconnected Axes of Heterogeneity in Machine Learning for Democratic and Inclusive Advancements
Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Maryam Molamohammadi
Afaf Taik
Nicolas Le Roux
G. Farnadi
194
2
0
11 Jun 2023
Evaluating the Social Impact of Generative AI Systems in Systems and Society
Irene Solaiman
Zeerak Talat
William Agnew
Lama Ahmad
Dylan K. Baker
...
Marie-Therese Png
Shubham Singh
A. Strait
Lukas Struppek
Arjun Subramonian
ELM
EGVM
486
150
0
09 Jun 2023
AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs
Neural Information Processing Systems (NeurIPS), 2023
Adam D. Cobb
Anirban Roy
Daniel Elenius
F. M. Heim
Brian Swenson
...
Theodore Bapty
Joseph Hite
K. Ramani
Christopher McComb
Susmit Jha
176
16
0
08 Jun 2023
Explainable Predictive Maintenance
Sepideh Pashami
Sławomir Nowaczyk
Yuantao Fan
Jakub Jakubowski
Nuno Paiva
...
Bruno Veloso
M. Sayed-Mouchaweh
L. Rajaoarisoa
Grzegorz J. Nalepa
João Gama
213
19
0
08 Jun 2023
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Jielin Qiu
Jiacheng Zhu
William Jongwon Han
Aditesh Kumar
Karthik Mittal
...
Linjie Li
Jianfeng Wang
Ding Zhao
Bo Li
Lijuan Wang
VGen
231
14
0
07 Jun 2023
Art and the science of generative AI: A deeper dive
Science (Science), 2023
Ziv Epstein
Aaron Hertzmann
L. Herman
Robert Mahari
M. Frank
...
Jessica Fjeld
Hany Farid
Neil Leach
Alex Pentland
Olga Russakovsky
271
494
0
07 Jun 2023
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models
Jose Berengueres
Marybeth Sandell
181
0
0
06 Jun 2023
AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca for Predicting Antigen-Antibody Interactions
Neural Information Processing Systems (NeurIPS), 2023
Hirofumi Tsuruta
Hiroyuki Yamazaki
R. Maeda
Ryotaro Tamura
Jennifer Wei
...
Poomarin Phloyphisut
H. Shimokawa
J. Ledsam
Lucy J. Colwell
Akihiro Imura
115
10
0
06 Jun 2023
AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms
Zana Buçinca
Chau Minh Pham
Maurice Jakesch
Marco Tulio Ribeiro
Alexandra Olteanu
Saleema Amershi
205
45
0
05 Jun 2023
NLPositionality: Characterizing Design Biases of Datasets and Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sebastin Santy
Jenny T Liang
Ronan Le Bras
Katharina Reinecke
Maarten Sap
317
106
0
02 Jun 2023
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
Q. V. Liao
J. Vaughan
318
222
0
02 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Michael Stephen Saxon
William Yang Wang
EGVM
140
10
0
02 Jun 2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Guilherme Penedo
Quentin Malartic
Daniel Hesslow
Ruxandra-Aimée Cojocaru
Alessandro Cappelli
Hamza Alobeidli
B. Pannier
Ebtesam Almazrouei
Julien Launay
422
881
0
01 Jun 2023
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness?
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
EGVM
167
18
0
28 May 2023
Optimization's Neglected Normative Commitments
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Benjamin Laufer
T. Gilbert
Helen Nissenbaum
OffRL
218
8
0
27 May 2023
On Degrees of Freedom in Defining and Testing Natural Language Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Saku Sugawara
S. Tsugita
ELM
326
2
0
24 May 2023
TalkUp: Paving the Way for Understanding Empowering Language
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lucille Njoo
Chan Young Park
Octavia Stappart
Marvin Thielk
Yi Chu
Yulia Tsvetkov
254
4
0
23 May 2023
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
678
1,406
0
17 May 2023
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
Hua Shen
Huang Chieh-Yang
Tongshuang Wu
Ting-Hao 'Kenneth' Huang
457
45
0
16 May 2023
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Arjun Subramonian
Xingdi Yuan
Hal Daumé
Su Lin Blodgett
205
22
0
15 May 2023
DATED: Guidelines for Creating Synthetic Datasets for Engineering Design Applications
Design Automation Conference (DAC), 2023
Cyril Picard
Jürg Schiffmann
Faez Ahmed
167
14
0
15 May 2023
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ashok Urlana
Pinzhen Chen
Zheng Zhao
Shay B. Cohen
Manish Shrivastava
Barry Haddow
185
13
0
15 May 2023
Certification Labels for Trustworthy AI: Insights From an Empirical Mixed-Method Study
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Nicolas Scharowski
Michaela Benk
S. J. Kühne
Léane Wettstein
Florian Brühlmann
199
33
0
15 May 2023
What's the Meaning of Superhuman Performance in Today's NLU?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Simone Tedeschi
Johan Bos
T. Declerck
Jan Hajic
Daniel Hershcovich
...
Simon Krek
Steven Schockaert
Rico Sennrich
Ekaterina Shutova
Roberto Navigli
ELM
LM&MA
VLM
ReLM
LRM
309
37
0
15 May 2023
The Ethics of AI in Games
IEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023
Dávid Melhárt
Julian Togelius
Benedikte Mikkelsen
Christoffer Holmgård
Georgios N. Yannakakis
155
30
0
12 May 2023
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Rahul Aralikatte
Ziling Cheng
Sumanth Doddapaneni
Jackie C.K. Cheung
270
10
0
10 May 2023
When Do Neural Nets Outperform Boosted Trees on Tabular Data?
Neural Information Processing Systems (NeurIPS), 2023
Duncan C. McElfresh
Sujay Khandagale
Jonathan Valverde
C. VishakPrasad
Ben Feuer
Chinmay Hegde
Ganesh Ramakrishnan
Micah Goldblum
Colin White
LMTD
305
248
0
04 May 2023
AutoML-GPT: Automatic Machine Learning with GPT
Shujian Zhang
Chengyue Gong
Lemeng Wu
Xingchao Liu
Mi Zhou
LLMAG
317
89
0
04 May 2023
Judgment Sieve: Reducing Uncertainty in Group Judgments through Interventions Targeting Ambiguity versus Disagreement
Quan Ze Chen
Amy X. Zhang
199
12
0
02 May 2023
SoK: Log Based Transparency Enhancing Technologies
A. Hicks
200
3
0
02 May 2023
Racial Bias within Face Recognition: A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
Seyma Yucer
Furkan Tektas
Noura Al Moubayed
T. Breckon
FaML
235
26
0
01 May 2023
Generating Process-Centric Explanations to Enable Contestability in Algorithmic Decision-Making: Challenges and Opportunities
Mireia Yurrita
Agathe Balayn
U. Gadiraju
183
3
0
01 May 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
591
163
0
28 Apr 2023
Previous
1
2
3
...
10
11
12
...
20
21
22
Next