v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018

Timnit Gebru

Jamie Morgenstern

Briana Vecchione

Jennifer Wortman Vaughan

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown

Attribute Diversity Determines the Systematicity Gap in VQAConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Ian Berlot-Attwell

Kumar Krishna Agrawal

A. M. Carrell

Yash Sharma

Naomi Saphra

254

15 Nov 2023

Fairness Hacking: The Malicious Practice of Shrouding Unfairness in Algorithms

Kristof Meding

Thilo Hagendorff

150

12 Nov 2023

MultiIoT: Benchmarking Machine Learning for the Internet of Things

Shentong Mo

Louis-Philippe Morency

Russ Salakhutdinov

Paul Pu Liang

199

10 Nov 2023

Is a Seat at the Table Enough? Engaging Teachers and Students in Dataset Specification for ML in Education

Mei Tan

Hansol Lee

Dakuo Wang

Hariharan Subramonyam

175

09 Nov 2023

Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Joan Nwatu

Oana Ignat

Amélie Reymond

206

09 Nov 2023

On Leakage in Machine Learning Pipelines

Leonard Sasse

Eliana Nicolaisen-Sobesky

...

293

07 Nov 2023

Benefits and Harms of Large Language Models in Digital Mental Health

201

07 Nov 2023

Contextual Confidence and Generative AI

Shrey Jain

Zoe Hitzig

Pamela Mishkin

307

02 Nov 2023

FAIRLABEL: Correcting Bias in Labels

Srinivasan H. Sengamedu

Hien Pham

132

01 Nov 2023

ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology LabNeural Information Processing Systems (NeurIPS), 2023

Baoxiong Jia

209

01 Nov 2023

Sentiment Analysis in Digital Spaces: An Overview of Reviews

L. Ayravainen

Joanne Hinds

Brittany I. Davidson

240

30 Oct 2023

A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture

171

30 Oct 2023

There Are No Data Like More Data- Datasets for Deep Learning in Earth ObservationIEEE Geoscience and Remote Sensing Magazine (GRSM), 2023

187

30 Oct 2023

CHAMMI: A benchmark for channel-adaptive models in microscopy imagingNeural Information Processing Systems (NeurIPS), 2023

184

30 Oct 2023

AI for Open Science: A Multi-Agent Perspective for Ethically Translating Data to Knowledge

219

28 Oct 2023

CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud DataNeural Information Processing Systems (NeurIPS), 2023

222

28 Oct 2023

WCLD: Curated Large Dataset of Criminal Cases from Wisconsin Circuit CourtsNeural Information Processing Systems (NeurIPS), 2023

265

28 Oct 2023

Feature Guided Masked Autoencoder for Self-supervised Learning in Remote SensingIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (JSTARS), 2023

Yi Wang

Hugo Hernández Hernández

C. Albrecht

Xiao Xiang Zhu

252

28 Oct 2023

Socially Cognizant Robotics for a Technology Enhanced Society

Kostas Bekris

207

27 Oct 2023

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

Damien Sileo

...

Tongshuang Wu

302

25 Oct 2023

AI Hazard Management: A framework for the systematic management of root causes for AI risks

276

25 Oct 2023

Can You Rely on Your Model Evaluation? Improving Model Evaluation with Synthetic Test DataNeural Information Processing Systems (NeurIPS), 2023

211

25 Oct 2023

ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee BehaviorsNeural Information Processing Systems (NeurIPS), 2023

230

25 Oct 2023

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition

Sander Schulhoff

Jeremy Pinto

Anaum Khan

Louis-Franccois Bouchard

Jordan L. Boyd-Graber

SILM

355

24 Oct 2023

On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms

Cristian Canton Ferrer

Tal Hassner

FaML

292

24 Oct 2023

RoboDepth: Robust Out-of-Distribution Depth Estimation under CorruptionsNeural Information Processing Systems (NeurIPS), 2023

281

23 Oct 2023

The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment AnalysisConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Pranav Narayanan Venkit

204

18 Oct 2023

CoMPosT: Characterizing and Evaluating Caricature in LLM SimulationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

323

106

17 Oct 2023

A State-Vector Framework for Dataset EffectsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

E. Sahak

Zining Zhu

Frank Rudzicz

217

17 Oct 2023

The AI Incident Database as an Educational Tool to Raise Awareness of AI Harms: A Classroom Exploration of Efficacy, Limitations, & Future ImprovementsConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023

Michael Feffer

Nikolas Martelaro

Hoda Heidari

180

10 Oct 2023

Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor DiscussionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Lucie-Aimée Kaffee

Arnav Arora

Isabelle Augenstein

204

09 Oct 2023

InterroLang: Exploring NLP Models and Datasets through Dialogue-based ExplanationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Nils Feldhus

Qianli Wang

Tatiana Anikina

Sahil Chopra

Cennet Oguz

Sebastian Möller

310

09 Oct 2023

DORIS-MAE: Scientific Document Retrieval using Multi-level Aspect-based QueriesNeural Information Processing Systems (NeurIPS), 2023

473

07 Oct 2023

The Empty Signifier Problem: Towards Clearer Paradigms for Operationalising "Alignment" in Large Language Models

Hannah Rose Kirk

Bertie Vidgen

Paul Röttger

Scott A. Hale

381

03 Oct 2023

Grasping AI: experiential exercises for designersAi & Society (AI & Society), 2023

148

02 Oct 2023

HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object CountNeural Information Processing Systems (NeurIPS), 2023

454

01 Oct 2023

Berkeley Open Extended Reality Recordings 2023 (BOXRR-23): 4.7 Million Motion Capture Recordings from 105,852 Extended Reality Device UsersIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023

263

30 Sep 2023

LagrangeBench: A Lagrangian Fluid Mechanics Benchmarking SuiteNeural Information Processing Systems (NeurIPS), 2023

Stefania Costantini

Gianluca Galletti

Fabian Fritz

Stefan Adami

Nikolaus A. Adams

272

28 Sep 2023

More than Model Documentation: Uncovering Teachers' Bespoke Information Needs for Informed Classroom Integration of ChatGPTInternational Conference on Human Factors in Computing Systems (CHI), 2023

Mei Tan

Hariharan Subramonyam

250

25 Sep 2023

SINCERE: Supervised Information Noise-Contrastive Estimation REvisited

Patrick Feeney

M. C. Hughes

239

25 Sep 2023

Affective Game Computing: A SurveyProceedings of the IEEE (Proc. IEEE), 2023

Georgios N. Yannakakis

Dávid Melhárt

238

25 Sep 2023

VidChapters-7M: Video Chapters at ScaleNeural Information Processing Systems (NeurIPS), 2023

242

25 Sep 2023

Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 DataNeural Information Processing Systems (NeurIPS), 2023

...

345

23 Sep 2023

Unlocking Model Insights: A Dataset for Automated Model Card Generation

160

22 Sep 2023

The Cambridge Law Corpus: A Dataset for Legal AI ResearchSocial Science Research Network (SSRN), 2023

Huiyuan Xie

271

21 Sep 2023

Learning and DiSentangling Patient Static Information from Time-series Electronic HEalth Record (STEER)PLOS Digital Health (PDH), 2023

Wei-Duen Liao

J. Voldman

OOD CML

181

20 Sep 2023

How to Data in DatathonsNeural Information Processing Systems (NeurIPS), 2023

161

18 Sep 2023

AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous DrivingIEEE Robotics and Automation Letters (RA-L), 2023

Abhinav Valada

363

12 Sep 2023

Beyond Skin Tone: A Multidimensional Measure of Apparent Skin ColorIEEE International Conference on Computer Vision (ICCV), 2023

William Thong

Przemyslaw K. Joniak

Alice Xiang

281

10 Sep 2023

Augmenting Chest X-ray Datasets with Non-Expert AnnotationsAnnual Conference on Medical Image Understanding and Analysis (MIUA), 2023

Amelia Jiménez-Sánchez

280

05 Sep 2023