Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.09010
Cited By
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 966 papers shown
Title
Is a Seat at the Table Enough? Engaging Teachers and Students in Dataset Specification for ML in Education
Mei Tan
Hansol Lee
Dakuo Wang
Hariharan Subramonyam
21
7
0
09 Nov 2023
Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models
Joan Nwatu
Oana Ignat
Rada Mihalcea
18
9
0
09 Nov 2023
On Leakage in Machine Learning Pipelines
Leonard Sasse
Eliana Nicolaisen-Sobesky
Juergen Dukart
Simon B. Eickhoff
Michael Götz
...
Abhijit Kulkarni
Juha Lahnakoski
Bradley C. Love
F. Raimondo
K. Patil
AI4CE
21
3
0
07 Nov 2023
Benefits and Harms of Large Language Models in Digital Mental Health
Munmun De Choudhury
Sachin R. Pendse
Neha Kumar
LM&MA
AI4MH
22
41
0
07 Nov 2023
Contextual Confidence and Generative AI
Shrey Jain
Zoe Hitzig
Pamela Mishkin
33
5
0
02 Nov 2023
FAIRLABEL: Correcting Bias in Labels
Srinivasan H. Sengamedu
Hien Pham
11
0
0
01 Nov 2023
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Jieming Cui
Ziren Gong
Baoxiong Jia
Siyuan Huang
Zilong Zheng
Jianzhu Ma
Yixin Zhu
25
3
0
01 Nov 2023
Sentiment Analysis in Digital Spaces: An Overview of Reviews
L. Ayravainen
Joanne Hinds
Brittany I. Davidson
30
0
0
30 Oct 2023
A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture
Qianqian Shen
Yunhan Zhao
Nahyun Kwon
Jeeeun Kim
Yanan Li
Shu Kong
14
2
0
30 Oct 2023
There Are No Data Like More Data- Datasets for Deep Learning in Earth Observation
Michael Schmitt
S. A. Ahmadi
Yonghao Xu
G. Taşkın
Ujjwal Verma
F. Sica
Ronny Hansch
21
24
0
30 Oct 2023
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
Zitong S. Chen
Chau Pham
Siqi Wang
Michael Doron
Nikita Moshkov
Bryan A. Plummer
Juan C. Caicedo
20
9
0
30 Oct 2023
AI for Open Science: A Multi-Agent Perspective for Ethically Translating Data to Knowledge
Chase Yakaboski
Gregory Hyde
Clement Nyanhongo
Eugene Santos
8
1
0
28 Oct 2023
CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data
Taiki Miyanishi
Fumiya Kitamori
Shuhei Kurita
Jungdae Lee
M. Kawanabe
Nakamasa Inoue
AI4TS
3DPC
17
4
0
28 Oct 2023
WCLD: Curated Large Dataset of Criminal Cases from Wisconsin Circuit Courts
Elliott Ash
Naman Goel
Nianyun Li
Claudia Marangon
Peiyao Sun
27
2
0
28 Oct 2023
Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing
Yi Wang
Hugo Hernández Hernández
C. Albrecht
Xiao Xiang Zhu
37
30
0
28 Oct 2023
Socially Cognizant Robotics for a Technology Enhanced Society
Kristin J. Dana
Clinton Andrews
Kostas Bekris
Jacob Feldman
Matthew Stone
Pernille Hemmer
Aaron Mazzeo
Hal Salzman
Jingang Yi
13
0
0
27 Oct 2023
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
...
K. Bollacker
Tongshuang Wu
Luis Villa
Sandy Pentland
Sara Hooker
15
55
0
25 Oct 2023
AI Hazard Management: A framework for the systematic management of root causes for AI risks
Ronald Schnitzer
Andreas Hapfelmeier
Sven Gaube
Sonja Zillner
13
3
0
25 Oct 2023
Can You Rely on Your Model Evaluation? Improving Model Evaluation with Synthetic Test Data
B. V. Breugel
Nabeel Seedat
F. Imrie
M. Schaar
SyDa
24
19
0
25 Oct 2023
ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors
Xiaoxuan Ma
Stephan P. Kaufhold
Jiajun Su
Wentao Zhu
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
21
13
0
25 Oct 2023
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Sander Schulhoff
Jeremy Pinto
Anaum Khan
Louis-Franccois Bouchard
Chenglei Si
Svetlina Anati
Valen Tagliabue
Anson Liu Kost
Christopher Carnahan
Jordan L. Boyd-Graber
SILM
29
41
0
24 Oct 2023
On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms
S. Mittal
K. Thakral
Richa Singh
Mayank Vatsa
Tamar Glaser
Cristian Canton Ferrer
Tal Hassner
FaML
21
3
0
24 Oct 2023
RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions
Lingdong Kong
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
OODD
25
30
0
23 Oct 2023
The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Pranav Narayanan Venkit
Mukund Srinath
Sanjana Gautam
Saranya Venkatraman
Vipul Gupta
R. Passonneau
Shomir Wilson
40
13
0
18 Oct 2023
CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
Myra Cheng
Tiziano Piccardi
Diyi Yang
LLMAG
16
67
0
17 Oct 2023
A State-Vector Framework for Dataset Effects
E. Sahak
Zining Zhu
Frank Rudzicz
20
1
0
17 Oct 2023
The AI Incident Database as an Educational Tool to Raise Awareness of AI Harms: A Classroom Exploration of Efficacy, Limitations, & Future Improvements
Michael Feffer
Nikolas Martelaro
Hoda Heidari
29
14
0
10 Oct 2023
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
Lucie-Aimée Kaffee
Arnav Arora
Isabelle Augenstein
21
5
0
09 Oct 2023
InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Nils Feldhus
Qianli Wang
Tatiana Anikina
Sahil Chopra
Cennet Oguz
Sebastian Möller
29
9
0
09 Oct 2023
DORIS-MAE: Scientific Document Retrieval using Multi-level Aspect-based Queries
Jianyou Wang
Kaicheng Wang
Xiaoyue Wang
Prudhviraj Naidu
Leon Bergen
R. Paturi
37
11
0
07 Oct 2023
The Empty Signifier Problem: Towards Clearer Paradigms for Operationalising "Alignment" in Large Language Models
Hannah Rose Kirk
Bertie Vidgen
Paul Röttger
Scott A. Hale
39
2
0
03 Oct 2023
Grasping AI: experiential exercises for designers
Dave Murray-Rust
M. Lupetti
Iohanna Nicenboim
W. V. D. Hoog
24
12
0
02 Oct 2023
HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count
N. Wiederhold
Ava Megyeri
DiMaggio Paris
Sean Banerjee
N. Banerjee
13
9
0
01 Oct 2023
Berkeley Open Extended Reality Recordings 2023 (BOXRR-23): 4.7 Million Motion Capture Recordings from 105,852 Extended Reality Device Users
V. Nair
Wenbo Guo
Rui Wang
J. F. O'Brien
Louis B. Rosenberg
Dawn Song
13
7
0
30 Sep 2023
LagrangeBench: A Lagrangian Fluid Mechanics Benchmarking Suite
Stefania Costantini
Gianluca Galletti
Fabian Fritz
Stefan Adami
Nikolaus A. Adams
38
13
0
28 Sep 2023
More than Model Documentation: Uncovering Teachers' Bespoke Information Needs for Informed Classroom Integration of ChatGPT
Mei Tan
Hariharan Subramonyam
27
19
0
25 Sep 2023
SINCERE: Supervised Information Noise-Contrastive Estimation REvisited
Patrick Feeney
M. C. Hughes
23
3
0
25 Sep 2023
Affective Game Computing: A Survey
Georgios N. Yannakakis
Dávid Melhárt
19
14
0
25 Sep 2023
VidChapters-7M: Video Chapters at Scale
Antoine Yang
Arsha Nagrani
Ivan Laptev
Josef Sivic
Cordelia Schmid
VGen
13
26
0
25 Sep 2023
Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data
Wai Tong Chung
Bassem Akoush
Pushan Sharma
Alex Tamkin
Kihoon Jung
...
D. Brouzet
M. Talei
B. Savard
A. Poludnenko
M. Ihme
AI4CE
25
13
0
23 Sep 2023
Unlocking Model Insights: A Dataset for Automated Model Card Generation
Shruti Singh
Hitesh Lodwal
Husain Malwat
Rakesh Thakur
Mayank Singh
SyDa
19
3
0
22 Sep 2023
The Cambridge Law Corpus: A Dataset for Legal AI Research
Andreas Ostling
Holli Sargeant
Huiyuan Xie
Ludwig Bull
Alexander Terenin
Leif Jonsson
Maans Magnusson
Felix Steffek
ELM
AILaw
14
7
0
21 Sep 2023
Learning and DiSentangling Patient Static Information from Time-series Electronic HEalth Record (STEER)
Wei-Duen Liao
J. Voldman
OOD
CML
9
0
0
20 Sep 2023
How to Data in Datathons
Carlos Mougan
Richard Plant
Clare Teng
Marya Bazzi
Alvaro Cabregas-Ejea
Ryan Sze-Yin Chan
David Salvador Jasin
Martin Stoffel
K. Whitaker
Jules Manser
20
1
0
18 Sep 2023
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving
Ahmed Rida Sekkat
Rohit Mohan
Oliver Sawade
Elmar Matthes
Abhinav Valada
28
9
0
12 Sep 2023
Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color
William Thong
Przemyslaw K. Joniak
Alice Xiang
20
19
0
10 Sep 2023
Augmenting Chest X-ray Datasets with Non-Expert Annotations
Cathrine Damgaard
Trine Eriksen
Dovile Juodelyte
V. Cheplygina
Amelia Jiménez-Sánchez
37
3
0
05 Sep 2023
Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord
Jan Fillies
Silvio Peikert
Adrian Paschke
9
2
0
04 Sep 2023
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Taylor Sorensen
Liwei Jiang
Jena D. Hwang
Sydney Levine
Valentina Pyatkin
...
Kavel Rao
Chandra Bhagavatula
Maarten Sap
J. Tasioulas
Yejin Choi
SLR
16
50
0
02 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Isabel O. Gallegos
Ryan A. Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
19
485
0
02 Sep 2023
Previous
1
2
3
...
6
7
8
...
18
19
20
Next