ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.09010
  4. Cited By
Datasheets for Datasets
v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
ArXiv (abs)PDFHTML

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown
Title
Exploring Data Pipelines through the Process Lens: a Reference Model
  forComputer Vision
Exploring Data Pipelines through the Process Lens: a Reference Model forComputer Vision
Agathe Balayn
B. Kulynych
S. Guerses
139
4
0
05 Jul 2021
Ethics Sheets for AI Tasks
Ethics Sheets for AI Tasks
Saif M. Mohammad
302
34
0
02 Jul 2021
An Information Retrieval Approach to Building Datasets for Hate Speech
  Detection
An Information Retrieval Approach to Building Datasets for Hate Speech Detection
Md. Mustafizur Rahman
Dinesh Balakrishnan
Dhiraj Murthy
Mucahid Kutlu
Matthew Lease
284
24
0
17 Jun 2021
Modeling Worlds in Text
Modeling Worlds in Text
Prithviraj Ammanabrolu
Mark O. Riedl
VGenLM&Ro
107
15
0
17 Jun 2021
Understanding and Evaluating Racial Biases in Image Captioning
Understanding and Evaluating Racial Biases in Image Captioning
Dora Zhao
Angelina Wang
Olga Russakovsky
267
159
0
16 Jun 2021
Physion: Evaluating Physical Prediction from Vision in Humans and
  Machines
Physion: Evaluating Physical Prediction from Vision in Humans and Machines
Daniel M. Bear
E. Wang
Damian Mrowca
Felix Binder
Hsiau-Yu Fish Tung
...
Li Fei-Fei
Nancy Kanwisher
J. Tenenbaum
Daniel L. K. Yamins
Judith E. Fan
OOD
451
116
0
15 Jun 2021
CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Ningyu Zhang
Mosha Chen
Zhen Bi
Xiaozhuan Liang
Lei Li
...
Jun Yan
Hongying Zan
Kunli Zhang
Buzhou Tang
Qingcai Chen
LM&MAELM
324
218
0
15 Jun 2021
Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks
  with Knowledge Distillation
Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks with Knowledge Distillation
Cody Blakeney
Nathaniel Huish
Yan Yan
Ziliang Zong
119
19
0
15 Jun 2021
A Discussion on Building Practical NLP Leaderboards: The Case of Machine
  Translation
A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation
Sebastin Santy
Prasanta Bhattacharya
LLMAG
262
4
0
11 Jun 2021
Hard Choices in Artificial Intelligence
Hard Choices in Artificial IntelligenceArtificial Intelligence (AI), 2021
Roel Dobbe
T. Gilbert
Yonatan Dov Mintz
147
67
0
10 Jun 2021
BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue
  Modeling
BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling
Mohammad Kachuee
Andrea Madotto
Genta Indra Winata
Peng Xu
Feijun Jiang
Yuxiang Hu
Chen Shi
Pascale Fung
237
64
0
05 Jun 2021
MERLOT: Multimodal Neural Script Knowledge Models
MERLOT: Multimodal Neural Script Knowledge ModelsNeural Information Processing Systems (NeurIPS), 2021
Rowan Zellers
Ximing Lu
Jack Hessel
Youngjae Yu
J. S. Park
Jize Cao
Ali Farhadi
Yejin Choi
VLMLRM
300
424
0
04 Jun 2021
Annotation Curricula to Implicitly Train Non-Expert Annotators
Annotation Curricula to Implicitly Train Non-Expert AnnotatorsComputational Linguistics (CL), 2021
Ji-Ung Lee
Jan-Christoph Klie
Iryna Gurevych
207
14
0
04 Jun 2021
The Contestation of Tech Ethics: A Sociotechnical Approach to Technology
  Ethics in Practice
The Contestation of Tech Ethics: A Sociotechnical Approach to Technology Ethics in PracticeJournal of Social Computing (JSC), 2021
Benson K. Green
AILaw
104
70
0
03 Jun 2021
Know Your Model (KYM): Increasing Trust in AI and Machine Learning
Know Your Model (KYM): Increasing Trust in AI and Machine Learning
Mary Roszel
Robert Norvill
Jean Hilger
R. State
181
8
0
31 May 2021
Changing the World by Changing the Data
Changing the World by Changing the DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Anna Rogers
189
77
0
28 May 2021
Towards Knowledge Organization Ecosystems
Towards Knowledge Organization Ecosystems
Mayukh Bagchi
60
0
0
23 May 2021
Measuring Coding Challenge Competence With APPS
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
Basel Alomair
Jacob Steinhardt
ELMAIMatALM
1.1K
897
0
20 May 2021
KLUE: Korean Language Understanding Evaluation
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELMVLM
452
218
0
20 May 2021
Conversational AI Systems for Social Good: Opportunities and Challenges
Conversational AI Systems for Social Good: Opportunities and Challenges
Peng Qi
Jing Huang
Youzheng Wu
Xiaodong He
Bowen Zhou
231
5
0
13 May 2021
Feature Interactions on Steroids: On the Composition of ML Models
Feature Interactions on Steroids: On the Composition of ML Models
Jane Hsieh
Eunsuk Kang
S. Apel
130
10
0
13 May 2021
Providing Assurance and Scrutability on Shared Data and Machine Learning
  Models with Verifiable Credentials
Providing Assurance and Scrutability on Shared Data and Machine Learning Models with Verifiable CredentialsConcurrency and Computation (CCPE), 2021
I. Barclay
Alun D. Preece
Ian J. Taylor
S. Radha
J. Nabrzyski
140
17
0
13 May 2021
Addressing "Documentation Debt" in Machine Learning Research: A
  Retrospective Datasheet for BookCorpus
Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus
Jack Bandy
Nicholas Vincent
145
68
0
11 May 2021
e-ViL: A Dataset and Benchmark for Natural Language Explanations in
  Vision-Language Tasks
e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language TasksIEEE International Conference on Computer Vision (ICCV), 2021
Maxime Kayser
Oana-Maria Camburu
Leonard Salewski
Cornelius Emde
Virginie Do
Zeynep Akata
Thomas Lukasiewicz
VLM
316
108
0
08 May 2021
What's in the Box? A Preliminary Analysis of Undesirable Content in the
  Common Crawl Corpus
What's in the Box? A Preliminary Analysis of Undesirable Content in the Common Crawl CorpusAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
A. Luccioni
J. Viviano
349
135
0
06 May 2021
Reliability Testing for Natural Language Processing Systems
Reliability Testing for Natural Language Processing SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Samson Tan
Shafiq Joty
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
319
42
0
06 May 2021
An Examination of Fairness of AI Models for Deepfake Detection
An Examination of Fairness of AI Models for Deepfake DetectionInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Loc Trinh
Wenshu Fan
CVBM
213
49
0
02 May 2021
SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation
SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation
Robin Shing Moon Chan
Krzysztof Lis
Svenja Uhlemeyer
Hermann Blum
S. Honari
Roland Siegwart
Pascal Fua
Mathieu Salzmann
Matthias Rottmann
UQCV
241
167
0
30 Apr 2021
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean
  Crawled Corpus
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled CorpusConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Jesse Dodge
Maarten Sap
Ana Marasović
William Agnew
Gabriel Ilharco
Dirk Groeneveld
Margaret Mitchell
Matt Gardner
AILaw
285
553
0
18 Apr 2021
Frequency-based Distortions in Contextualized Word Embeddings
Frequency-based Distortions in Contextualized Word Embeddings
Kaitlyn Zhou
Kawin Ethayarajh
Dan Jurafsky
132
23
0
17 Apr 2021
Concadia: Towards Image-Based Text Generation with a Purpose
Concadia: Towards Image-Based Text Generation with a PurposeConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Elisa Kreiss
Fei Fang
Noah D. Goodman
Christopher Potts
198
25
0
16 Apr 2021
Semantic maps and metrics for science Semantic maps and metrics for
  science using deep transformer encoders
Semantic maps and metrics for science Semantic maps and metrics for science using deep transformer encoders
Brendan Chambers
James A. Evans
MedIm
147
0
0
13 Apr 2021
XFORMAL: A Benchmark for Multilingual Formality Style Transfer
XFORMAL: A Benchmark for Multilingual Formality Style TransferNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Eleftheria Briakou
Di Lu
Ke Zhang
Joel R. Tetreault
172
62
0
08 Apr 2021
ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
ORBIT: A Real-World Few-Shot Dataset for Teachable Object RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Daniela Massiceti
L. Zintgraf
J. Bronskill
Lida Theodorou
Matthew Tobias Harris
Edward Cutrell
C. Morrison
Katja Hofmann
Simone Stumpf
398
49
0
08 Apr 2021
Question-Driven Design Process for Explainable AI User Experiences
Question-Driven Design Process for Explainable AI User Experiences
Q. V. Liao
Milena Pribić
Jaesik Han
Sarah Miller
Daby M. Sow
302
63
0
08 Apr 2021
The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions
The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions
Jennifer J. Sun
Tomomi Karigo
Dipam Chakraborty
Sharada Mohanty
Benjamin Wild
...
Chen Chen
D. Anderson
Pietro Perona
Yisong Yue
Ann Kennedy
418
62
0
06 Apr 2021
AI4D -- African Language Program
AI4D -- African Language Program
Kathleen Siminyu
Godson Kalipe
D. Orlic
Jade Z. Abbott
Vukosi Marivate
...
T. Diop
Davis David
Chayma Fourati
Hatem Haddad
Malek Naski
130
22
0
06 Apr 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
What Will it Take to Fix Benchmarking in Natural Language Understanding?North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Samuel R. Bowman
George E. Dahl
ELMALM
253
185
0
05 Apr 2021
Visual Semantic Role Labeling for Video Understanding
Visual Semantic Role Labeling for Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2021
Arka Sadhu
Tanmay Gupta
Mark Yatskar
Ram Nevatia
Aniruddha Kembhavi
VLM
244
88
0
02 Apr 2021
Towards An Ethics-Audit Bot
Towards An Ethics-Audit Bot
Siani Pearson
Martin Lloyd
Vivek Nallur
61
0
0
29 Mar 2021
Automation: An Essential Component Of Ethical AI?
Automation: An Essential Component Of Ethical AI?
Vivek Nallur
Martin Lloyd
Siani Pearson
54
1
0
29 Mar 2021
A Multistakeholder Approach Towards Evaluating AI Transparency
  Mechanisms
A Multistakeholder Approach Towards Evaluating AI Transparency Mechanisms
Ana Lucic
Madhulika Srikumar
Umang Bhatt
Alice Xiang
Ankur Taly
Q. V. Liao
Maarten de Rijke
118
5
0
27 Mar 2021
Characterizing and Detecting Mismatch in Machine-Learning-Enabled
  Systems
Characterizing and Detecting Mismatch in Machine-Learning-Enabled SystemsWorkshop on AI Engineering - Software Engineering for AI (ESEA), 2021
Grace A. Lewis
S. Bellomo
Ipek Ozkaya
130
42
0
25 Mar 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Quality at a Glance: An Audit of Web-Crawled Multilingual DatasetsTransactions of the Association for Computational Linguistics (TACL), 2021
Julia Kreutzer
Isaac Caswell
Lisa Wang
Ahsan Wahab
D. Esch
...
Duygu Ataman
Orevaoghene Ahia
Oghenefego Ahia
Sweta Agrawal
Mofetoluwa Adeyemi
393
309
0
22 Mar 2021
#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
#PraCegoVer: A Large Dataset for Image Captioning in PortugueseInternational Conference on Data Technologies and Applications (DATA), 2021
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
198
12
0
21 Mar 2021
The Human Evaluation Datasheet 1.0: A Template for Recording Details of
  Human Evaluation Experiments in NLP
The Human Evaluation Datasheet 1.0: A Template for Recording Details of Human Evaluation Experiments in NLP
Anastasia Shimorina
Anya Belz
137
37
0
17 Mar 2021
Preregistering NLP Research
Preregistering NLP ResearchNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Emiel van Miltenburg
Chris van der Lee
E. Krahmer
AI4CE
195
24
0
11 Mar 2021
Designing Disaggregated Evaluations of AI Systems: Choices,
  Considerations, and Tradeoffs
Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and TradeoffsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021
Solon Barocas
Anhong Guo
Ece Kamar
J. Krones
Meredith Ringel Morris
Jennifer Wortman Vaughan
Duncan Wadsworth
Hanna M. Wallach
159
88
0
10 Mar 2021
Rissanen Data Analysis: Examining Dataset Characteristics via
  Description Length
Rissanen Data Analysis: Examining Dataset Characteristics via Description LengthInternational Conference on Machine Learning (ICML), 2021
Ethan Perez
Douwe Kiela
Dong Wang
186
25
0
05 Mar 2021
A framework for fostering transparency in shared artificial intelligence
  models by increasing visibility of contributions
A framework for fostering transparency in shared artificial intelligence models by increasing visibility of contributionsConcurrency and Computation (CCPE), 2020
I. Barclay
Harrison Taylor
Alun D. Preece
Ian J. Taylor
D. Verma
Geeth de Mel
105
16
0
05 Mar 2021
Previous
123...1819202122
Next