Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.09010
Cited By
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 966 papers shown
Title
Advances in Automatically Rating the Trustworthiness of Text Processing Services
Biplav Srivastava
Kausik Lakkaraju
Mariana Bernagozzi
Marco Valtorta
30
6
0
04 Feb 2023
Lived Experience Matters: Automatic Detection of Stigma on Social Media Toward People Who Use Substances
Salvatore Giorgi
Douglas Bellew
Daniel Roy Sadek Habib
G. Sherman
Joao Sedoc
Chase Smitterberg
Amanda Devoto
McKenzie Himelein-Wachowiak
Brenda L. Curtis
19
3
0
04 Feb 2023
Out of Context: Investigating the Bias and Fairness Concerns of "Artificial Intelligence as a Service"
Kornel Lewicki
M. S. Lee
Jennifer Cobbe
Jatinder Singh
29
21
0
02 Feb 2023
TAPS Responsibility Matrix: A tool for responsible data science by design
V. Urovi
R. Çelebi
Chang Sun
Linda Rieswijk
M. Erard
Arif Yilmaz
Kodylan Moodley
Parveen Kumar
Michel Dumontier
13
1
0
02 Feb 2023
Charting the Sociotechnical Gap in Explainable AI: A Framework to Address the Gap in XAI
Upol Ehsan
Koustuv Saha
M. D. Choudhury
Mark O. Riedl
18
57
0
01 Feb 2023
Mathematical Capabilities of ChatGPT
Simon Frieder
Luca Pinchetti
Alexis Chevalier
Ryan-Rhys Griffiths
Tommaso Salvatori
Thomas Lukasiewicz
P. Petersen
Julius Berner
ELM
AI4MH
27
402
0
31 Jan 2023
Designing Data: Proactive Data Collection and Iteration for Machine Learning
Aspen K. Hopkins
Fred Hohman
Luca Zappella
Xavier Suau Cuadros
Dominik Moritz
17
6
0
24 Jan 2023
Unveiling the Risks of NFT Promotion Scams
S. Roy
Dipanjan Das
Priyanka Bose
Christopher Kruegel
Giovanni Vigna
Shirin Nilizadeh
20
12
0
24 Jan 2023
Simplistic Collection and Labeling Practices Limit the Utility of Benchmark Datasets for Twitter Bot Detection
Chris Hays
Zachary Schutzman
Manish Raghavan
Erin Walk
Philipp Zimmer
30
30
0
17 Jan 2023
PlasmoFAB: A Benchmark to Foster Machine Learning for Plasmodium falciparum Protein Antigen Candidate Prediction
Jonas C. Ditz
Jacqueline Wistuba-Hamprecht
Timo Maier
Rolf Fendel
N. Pfeifer
Bernhard Reuter
16
1
0
16 Jan 2023
Computational Assessment of Hyperpartisanship in News Titles
Hanjia Lyu
Jinsheng Pan
Zichen Wang
Jiebo Luo
11
6
0
16 Jan 2023
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets
Shuo Sun
Molei Qin
Xinrun Wang
Bo An
FaML
OffRL
AIFin
22
4
0
14 Jan 2023
How Data Scientists Review the Scholarly Literature
Sheshera Mysore
Mahmood Jasim
Haoru Song
Sarah Akbar
Andre Kenneth Chase Randall
Narges Mahyar
AI4CE
21
8
0
10 Jan 2023
EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset
Hao Tang
Kevin J Liang
Matt Feiszli
Weiyao Wang
EgoV
27
15
0
09 Jan 2023
AI Maintenance: A Robustness Perspective
Pin-Yu Chen
Payel Das
9
12
0
08 Jan 2023
FATE in AI: Towards Algorithmic Inclusivity and Accessibility
Isa Inuwa-Dutse
29
7
0
03 Jan 2023
Large Language Models Encode Clinical Knowledge
K. Singhal
Shekoofeh Azizi
T. Tu
S. S. Mahdavi
Jason W. Wei
...
A. Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
LM&MA
ELM
AI4MH
19
2,171
0
26 Dec 2022
Introduction to Machine Learning for Physicians: A Survival Guide for Data Deluge
Ricards Marcinkevics
Ece Ozkan
Julia E. Vogt
OOD
LM&MA
FedML
26
2
0
23 Dec 2022
Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification Bias
Robert Wolfe
Yiwei Yang
Billy Howe
Aylin Caliskan
DiffM
13
51
0
21 Dec 2022
Trustworthy Social Bias Measurement
Rishi Bommasani
Percy Liang
27
10
0
20 Dec 2022
Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization
Lining Zhang
Simon Mille
Yufang Hou
Daniel Deutsch
Elizabeth Clark
...
Saad Mahamood
Sebastian Gehrmann
Miruna Clinciu
Khyathi Raghavi Chandu
João Sedoc
6
10
0
20 Dec 2022
Efficient aggregation of face embeddings for decentralized face recognition deployments (extended version)
Philipp Hofer
Michael Roland
Philipp Schwarz
René Mayrhofer
CVBM
FedML
19
3
0
20 Dec 2022
Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political Discussion
Rishav Hada
A. E. Fard
Sarah Shugars
Federico Bianchi
Patrícia G. C. Rossini
Dirk Hovy
Rebekah Tromble
N. Tintarev
16
7
0
18 Dec 2022
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding Systems
Akshatha Arodi
Martin Pömsl
Kaheer Suleman
Adam Trischler
Alexandra Olteanu
Jackie C.K. Cheung
ELM
25
5
0
15 Dec 2022
Tensions Between the Proxies of Human Values in AI
Teresa Datta
D. Nissani
Max Cembalest
Akash Khanna
Haley Massa
John P. Dickerson
34
2
0
14 Dec 2022
Trust, but Verify: Cross-Modality Fusion for HD Map Change Detection
John Lambert
James Hays
26
28
0
14 Dec 2022
Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Florian Tramèr
Gautam Kamath
Nicholas Carlini
SILM
46
67
0
13 Dec 2022
Measuring Data
Margaret Mitchell
A. Luccioni
Nathan Lambert
Marissa Gerchick
Angelina McMillan-Major
Ezinwanne Ozoani
Nazneen Rajani
Tristan Thrush
Yacine Jernite
Douwe Kiela
27
16
0
09 Dec 2022
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks
Jiaqi Ma
Xingjian Zhang
Hezheng Fan
Jin Huang
Tianyue Li
Tinghong Li
Yiwen Tu
Chen Zhu
Qiaozhu Mei
35
5
0
08 Dec 2022
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Ana Kotarcic
Dominik Hangartner
Fabrizio Gilardi
Selina Kurer
K. Donnay
24
2
0
05 Dec 2022
The Grind for Good Data: Understanding ML Practitioners' Struggles and Aspirations in Making Good Data
Inha Cha
Juhyun Oh
Cheul Young Park
Jiyoon Han
Hwalsuk Lee
26
2
0
28 Nov 2022
The Principles of Data-Centric AI (DCAI)
M. H. Jarrahi
Ali Memariani
Shion Guha
24
55
0
26 Nov 2022
Elements of effective machine learning datasets in astronomy
Bernie Boscoe
Tuan Do
E. Jones
Yunqiang Li
Kevin Alfaro
Christy Ma
27
2
0
25 Nov 2022
Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation
Sérgio Jesus
José P. Pombal
Duarte M. Alves
André F. Cruz
Pedro Saleiro
Rita P. Ribeiro
João Gama
P. Bizarro
40
32
0
24 Nov 2022
Video compression dataset and benchmark of learning-based video-quality metrics
Anastasia Antsiferova
Sergey Lavrushkin
Maksim Smirnov
Alexander Gushchin
D. Vatolin
D. Kulikov
23
30
0
22 Nov 2022
The Stack: 3 TB of permissively licensed source code
Denis Kocetkov
Raymond Li
Loubna Ben Allal
Jia Li
Chenghao Mou
...
Sean M. Hughes
Thomas Wolf
Dzmitry Bahdanau
Leandro von Werra
H. D. Vries
58
307
0
20 Nov 2022
SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation
Yi Wang
Nassim Ait Ali Braham
Zhitong Xiong
Chenying Liu
C. Albrecht
Xiao Xiang Zhu
26
71
0
13 Nov 2022
Seamful XAI: Operationalizing Seamful Design in Explainable AI
Upol Ehsan
Q. V. Liao
Samir Passi
Mark O. Riedl
Hal Daumé
22
20
0
12 Nov 2022
Debiasing Methods for Fairer Neural Models in Vision and Language Research: A Survey
Otávio Parraga
Martin D. Móre
C. M. Oliveira
Nathan Gavenski
L. S. Kupssinskü
Adilson Medronha
L. V. Moura
Gabriel S. Simões
Rodrigo C. Barros
42
11
0
10 Nov 2022
An Inclusive Notion of Text
Ilia Kuznetsov
Iryna Gurevych
22
0
0
10 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
37
270
0
09 Nov 2022
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
Nabeel Seedat
F. Imrie
M. Schaar
27
12
0
09 Nov 2022
The Legal Argument Reasoning Task in Civil Procedure
Leonard Bongard
Lena Held
Ivan Habernal
AILaw
ELM
11
19
0
05 Nov 2022
SMAuC -- The Scientific Multi-Authorship Corpus
Janek Bevendorff
Philipp Sauer
Lukas Gienapp
Wolfgang Kircheis
Erik Korner
Benno Stein
Martin Potthast
19
0
0
04 Nov 2022
ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations
Badr Youbi Idrissi
Diane Bouchacourt
Randall Balestriero
Ivan Evtimov
C. Hazirbas
Nicolas Ballas
Pascal Vincent
M. Drozdzal
David Lopez-Paz
Mark Ibrahim
VLM
ViT
39
45
0
03 Nov 2022
My Face My Choice: Privacy Enhancing Deepfakes for Social Media Anonymization
U. Ciftci
Gokturk Yuksek
Ilke Demir
PICV
CVBM
16
17
0
02 Nov 2022
Developing Modular Autonomous Capabilities for sUAS Operations
Keegan Quigley
V. Goodwin
Luis E. Alvarez
Justin Yao
Yousef Salaman Maclara
26
0
0
01 Nov 2022
Where to start? Analyzing the potential value of intermediate models
Leshem Choshen
Elad Venezian
Shachar Don-Yehiya
Noam Slonim
Yoav Katz
MoMe
17
27
0
31 Oct 2022
Artificial intelligence in government: Concepts, standards, and a unified framework
Vince J. Straub
Deborah Morgan
Jonathan Bright
Helen Z. Margetts
AI4TS
30
31
0
31 Oct 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
21
288
0
26 Oct 2022
Previous
1
2
3
...
10
11
12
...
18
19
20
Next