Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.09010
Cited By
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 966 papers shown
Title
Understanding and Evaluating Racial Biases in Image Captioning
Dora Zhao
Angelina Wang
Olga Russakovsky
16
134
0
16 Jun 2021
Physion: Evaluating Physical Prediction from Vision in Humans and Machines
Daniel M. Bear
E. Wang
Damian Mrowca
Felix Binder
Hsiau-Yu Fish Tung
...
Li Fei-Fei
Nancy Kanwisher
J. Tenenbaum
Daniel L. K. Yamins
Judith E. Fan
OOD
45
86
0
15 Jun 2021
CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Ningyu Zhang
Mosha Chen
Zhen Bi
Xiaozhuan Liang
Lei Li
...
Jun Yan
Hongying Zan
Kunli Zhang
Buzhou Tang
Qingcai Chen
LM&MA
ELM
20
178
0
15 Jun 2021
Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks with Knowledge Distillation
Cody Blakeney
Nathaniel Huish
Yan Yan
Ziliang Zong
13
18
0
15 Jun 2021
A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation
Sebastin Santy
Prasanta Bhattacharya
LLMAG
30
2
0
11 Jun 2021
Hard Choices in Artificial Intelligence
Roel Dobbe
T. Gilbert
Yonatan Dov Mintz
19
52
0
10 Jun 2021
BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling
Zhaojiang Lin
Andrea Madotto
Genta Indra Winata
Peng-Tao Xu
Feijun Jiang
Yuxiang Hu
Chen Shi
Pascale Fung
19
61
0
05 Jun 2021
MERLOT: Multimodal Neural Script Knowledge Models
Rowan Zellers
Ximing Lu
Jack Hessel
Youngjae Yu
J. S. Park
Jize Cao
Ali Farhadi
Yejin Choi
VLM
LRM
22
372
0
04 Jun 2021
Annotation Curricula to Implicitly Train Non-Expert Annotators
Ji-Ung Lee
Jan-Christoph Klie
Iryna Gurevych
8
11
0
04 Jun 2021
The Contestation of Tech Ethics: A Sociotechnical Approach to Technology Ethics in Practice
Benson K. Green
AILaw
17
53
0
03 Jun 2021
Know Your Model (KYM): Increasing Trust in AI and Machine Learning
Mary Roszel
Robert Norvill
Jean Hilger
R. State
17
4
0
31 May 2021
Changing the World by Changing the Data
Anna Rogers
14
71
0
28 May 2021
Towards Knowledge Organization Ecosystems
Mayukh Bagchi
20
0
0
23 May 2021
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
194
623
0
20 May 2021
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice H. Oh
Jung-Woo Ha
Kyunghyun Cho
ELM
VLM
14
191
0
20 May 2021
Conversational AI Systems for Social Good: Opportunities and Challenges
Peng Qi
Jing Huang
Youzheng Wu
Xiaodong He
Bowen Zhou
11
5
0
13 May 2021
Feature Interactions on Steroids: On the Composition of ML Models
Christian Kastner
Eunsuk Kang
S. Apel
16
7
0
13 May 2021
Providing Assurance and Scrutability on Shared Data and Machine Learning Models with Verifiable Credentials
I. Barclay
Alun D. Preece
Ian J. Taylor
S. Radha
J. Nabrzyski
14
14
0
13 May 2021
Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus
Jack Bandy
Nicholas Vincent
11
57
0
11 May 2021
e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Maxime Kayser
Oana-Maria Camburu
Leonard Salewski
Cornelius Emde
Virginie Do
Zeynep Akata
Thomas Lukasiewicz
VLM
21
100
0
08 May 2021
What's in the Box? A Preliminary Analysis of Undesirable Content in the Common Crawl Corpus
A. Luccioni
J. Viviano
10
113
0
06 May 2021
Reliability Testing for Natural Language Processing Systems
Samson Tan
Shafiq R. Joty
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
8
38
0
06 May 2021
An Examination of Fairness of AI Models for Deepfake Detection
Loc Trinh
Y. Liu
CVBM
77
35
0
02 May 2021
SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation
Robin Shing Moon Chan
Krzysztof Lis
Svenja Uhlemeyer
Hermann Blum
S. Honari
Roland Siegwart
Pascal Fua
Mathieu Salzmann
Matthias Rottmann
UQCV
13
136
0
30 Apr 2021
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus
Jesse Dodge
Maarten Sap
Ana Marasović
William Agnew
Gabriel Ilharco
Dirk Groeneveld
Margaret Mitchell
Matt Gardner
AILaw
21
424
0
18 Apr 2021
Frequency-based Distortions in Contextualized Word Embeddings
Kaitlyn Zhou
Kawin Ethayarajh
Dan Jurafsky
11
23
0
17 Apr 2021
Concadia: Towards Image-Based Text Generation with a Purpose
Elisa Kreiss
Fei Fang
Noah D. Goodman
Christopher Potts
14
23
0
16 Apr 2021
Semantic maps and metrics for science Semantic maps and metrics for science using deep transformer encoders
Brendan Chambers
James A. Evans
MedIm
8
0
0
13 Apr 2021
XFORMAL: A Benchmark for Multilingual Formality Style Transfer
Eleftheria Briakou
Di Lu
Ke Zhang
Joel R. Tetreault
16
55
0
08 Apr 2021
ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
Daniela Massiceti
L. Zintgraf
J. Bronskill
Lida Theodorou
Matthew Tobias Harris
Edward Cutrell
C. Morrison
Katja Hofmann
Simone Stumpf
8
44
0
08 Apr 2021
Question-Driven Design Process for Explainable AI User Experiences
Q. V. Liao
Milena Pribić
Jaesik Han
Sarah Miller
Daby M. Sow
15
52
0
08 Apr 2021
The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions
Jennifer J. Sun
Tomomi Karigo
Dipam Chakraborty
Sharada Mohanty
Benjamin Wild
...
Chen Chen
D. Anderson
Pietro Perona
Yisong Yue
Ann Kennedy
22
47
0
06 Apr 2021
AI4D -- African Language Program
Kathleen Siminyu
Godson Kalipe
D. Orlic
Jade Z. Abbott
Vukosi Marivate
...
T. Diop
Davis David
Chayma Fourati
Hatem Haddad
Malek Naski
11
21
0
06 Apr 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman
George E. Dahl
ELM
ALM
23
156
0
05 Apr 2021
Visual Semantic Role Labeling for Video Understanding
Arka Sadhu
Tanmay Gupta
Mark Yatskar
Ram Nevatia
Aniruddha Kembhavi
VLM
14
68
0
02 Apr 2021
Towards An Ethics-Audit Bot
Siani Pearson
Martin Lloyd
Vivek Nallur
20
0
0
29 Mar 2021
Automation: An Essential Component Of Ethical AI?
Vivek Nallur
Martin Lloyd
Siani Pearson
14
1
0
29 Mar 2021
A Multistakeholder Approach Towards Evaluating AI Transparency Mechanisms
Ana Lucic
Madhulika Srikumar
Umang Bhatt
Alice Xiang
Ankur Taly
Q. V. Liao
Maarten de Rijke
20
5
0
27 Mar 2021
Characterizing and Detecting Mismatch in Machine-Learning-Enabled Systems
Grace A. Lewis
S. Bellomo
Ipek Ozkaya
6
35
0
25 Mar 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Julia Kreutzer
Isaac Caswell
Lisa Wang
Ahsan Wahab
D. Esch
...
Duygu Ataman
Orevaoghene Ahia
Oghenefego Ahia
Sweta Agrawal
Mofetoluwa Adeyemi
20
265
0
22 Mar 2021
#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
23
10
0
21 Mar 2021
The Human Evaluation Datasheet 1.0: A Template for Recording Details of Human Evaluation Experiments in NLP
Anastasia Shimorina
Anya Belz
14
34
0
17 Mar 2021
Preregistering NLP Research
Emiel van Miltenburg
Chris van der Lee
E. Krahmer
AI4CE
13
22
0
11 Mar 2021
Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs
Solon Barocas
Anhong Guo
Ece Kamar
J. Krones
Meredith Ringel Morris
Jennifer Wortman Vaughan
Duncan Wadsworth
Hanna M. Wallach
8
74
0
10 Mar 2021
Rissanen Data Analysis: Examining Dataset Characteristics via Description Length
Ethan Perez
Douwe Kiela
Kyunghyun Cho
14
24
0
05 Mar 2021
A framework for fostering transparency in shared artificial intelligence models by increasing visibility of contributions
I. Barclay
Harrison Taylor
Alun D. Preece
Ian J. Taylor
D. Verma
Geeth de Mel
14
13
0
05 Mar 2021
Representation Matters: Assessing the Importance of Subgroup Allocations in Training Data
Esther Rolf
Theodora Worledge
Benjamin Recht
Michael I. Jordan
17
31
0
05 Mar 2021
Hypothesis Testing for Class-Conditional Label Noise
Rafael Poyiadzi
Weisong Yang
Niall Twomey
Raúl Santos-Rodríguez
NoLa
16
0
0
03 Mar 2021
Documentation Matters: Human-Centered AI System to Assist Data Science Code Documentation in Computational Notebooks
A. Wang
Dakuo Wang
Jaimie Drozdal
Michael J. Muller
Soya Park
Justin D. Weisz
Xuye Liu
Lingfei Wu
Casey Dugan
44
63
0
24 Feb 2021
Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing
Sarah Wiegreffe
Ana Marasović
XAI
11
141
0
24 Feb 2021
Previous
1
2
3
...
16
17
18
19
20
Next