v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018

Timnit Gebru

Jamie Morgenstern

Briana Vecchione

Jennifer Wortman Vaughan

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown

Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord

Jan Fillies

Silvio Peikert

Adrian Paschke

139

04 Sep 2023

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and DutiesAAAI Conference on Artificial Intelligence (AAAI), 2023

...

Yejin Choi

492

02 Sep 2023

Bias and Fairness in Large Language Models: A SurveyComputational Linguistics (CL), 2023

Isabel O. Gallegos

Ryan Rossi

Joe Barrow

Md Mehrab Tanjim

Sungchul Kim

394

888

02 Sep 2023

The Use of Synthetic Data to Train AI Models: Opportunities and Risks for Sustainable Development

T. Marwala

Eleonore Fournier-Tombs

Serge Stinckwich

100

31 Aug 2023

FACET: Fairness in Computer Vision Evaluation BenchmarkIEEE International Conference on Computer Vision (ICCV), 2023

351

31 Aug 2023

Party Prediction for TwitterInternational Conference on Web and Social Media (ICWSM), 2023

Kellin Pelrine

Anne Imouza

Zachary Yang

Jacob-Junqi Tian

...

Jean-François Godbout

Reihaneh Rabbany

152

25 Aug 2023

American Stories: A Large-Scale Structured Text Dataset of Historical U.S. NewspapersNeural Information Processing Systems (NeurIPS), 2023

167

24 Aug 2023

Collect, Measure, Repeat: Reliability Factors for Responsible AI Data CollectionProceedings of the AAAI Conference on Human Computation and Crowdsourcing (HCOMP), 2023

Oana Inel

Tim Draws

Lora Aroyo

299

22 Aug 2023

Artificial Intelligence and Aesthetic Judgment

Jessica Hullman

Ari Holtzman

Andrew Gelman

126

21 Aug 2023

LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language ModelsSocial Science Research Network (SSRN), 2023

...

249

301

20 Aug 2023

LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and BenchmarkIEEE International Conference on Computer Vision (ICCV), 2023

203

18 Aug 2023

EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language UnderstandingNeural Information Processing Systems (NeurIPS), 2023

K. Mangalam

Raiymbek Akshulakov

Jitendra Malik

402

495

17 Aug 2023

Impression-Aware Recommender Systems

F. B. P. Maurera

Maurizio Ferrari Dacrema

P. Castells

Paolo Cremonesi

AI4TS

211

15 Aug 2023

REFORMS: Reporting Standards for Machine Learning Based Science

...

241

15 Aug 2023

A User-Centered Evaluation of Spanish Text Simplification

Adrian de Wynter

Anthony Hevia

Si-Qing Chen

154

15 Aug 2023

Ground Truth Or Dare: Factors Affecting The Creation Of Medical Datasets For Training AIAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

H. D. Zając

Natalia-Rozalia Avlona

T. O. Andersen

F. Kensing

Irina Shklovski

139

12 Aug 2023

Visualising category recoding and numeric redistributions

Cynthia A. Huang

12 Aug 2023

FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcareBritish medical journal (BMJ), 2023

Alejandro F Frangi

...

Yi Zeng

Yunusa G Mohammed

Yves Saint James Aquino

Zohaib Salahuddin

M. P. Starmans

AI4TS

315

221

11 Aug 2023

OpenProteinSet: Training data for structural biology at scaleNeural Information Processing Systems (NeurIPS), 2023

Mohammed AlQuraishi

209

10 Aug 2023

Heuristics for Supporting Cooperative Dashboard DesignIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023

231

08 Aug 2023

PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation LearningNeural Information Processing Systems (NeurIPS), 2023

Pascal Vincent

202

08 Aug 2023

Balanced Face Dataset: Guiding StyleGAN to Generate Labeled Synthetic Face Image Dataset for Underrepresented Group

Kidist Amde Mekonnen

CVBM

156

07 Aug 2023

Auditing and Robustifying COVID-19 Misinformation Datasets via Anticontent SamplingAAAI Conference on Artificial Intelligence (AAAI), 2023

Clay H. Yoo

Ashiqur R. KhudaBukhsh

230

05 Aug 2023

No Fair Lunch: A Causal Perspective on Dataset Bias in Machine Learning for Medical Imaging

Charles Jones

Daniel Coelho De Castro

Fabio De Sousa Ribeiro

297

31 Jul 2023

The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systemsSIGDIAL Conferences (SIGDIAL), 2023

Andreas Liesenfeld

Alianda Lopez

Mark Dingemanse

304

28 Jul 2023

FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning PipelinesConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023

188

28 Jul 2023

RAI Guidelines: Method for Generating Responsible AI Guidelines Grounded in Regulations and Usable by (Non-)Technical Roles

Marios Constantinides

Edyta Bogucka

Daniele Quercia

Susanna Kallio

Mohammad Tahaei

205

27 Jul 2023

Designing Fiduciary Artificial IntelligenceConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023

Sebastian Benthall

David Shekman

172

27 Jul 2023

A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosInternational Conference on Web and Social Media (ICWSM), 2023

Anand Rai

Siddharth D. Jaiswal

Animesh Mukherjee

171

20 Jul 2023

FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback

...

310

20 Jul 2023

Europepolls: A Dataset of Country-Level Opinion Polling Data for the European Union and the UK

Konstantinos Pitas

19 Jul 2023

Test-takers have a say: understanding the implications of the use of AI in language tests

164

19 Jul 2023

Beyond the ML Model: Applying Safety Engineering Frameworks to Text-to-Image DevelopmentAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

Shalaleh Rismani

Renee Shelby

A. Smart

Renelito Delos Santos

AJung Moon

Negar Rostamzadeh

225

19 Jul 2023

Reflections from the Workshop on AI-Assisted Decision Making for Conservation

...

214

17 Jul 2023

Leveraging Recommender Systems to Reduce Content Gaps on Peer Production PlatformsInternational Conference on Web and Social Media (ICWSM), 2023

210

17 Jul 2023

Fairness in KI-Systemen

Barbara Hammer

135

17 Jul 2023

Where Did the President Visit Last Week? Detecting Celebrity Trips from News ArticlesInternational Conference on Web and Social Media (ICWSM), 2023

189

17 Jul 2023

Analyzing Dataset Annotation Quality Management in the WildComputational Linguistics (CL), 2023

Jan-Christoph Klie

Richard Eckart de Castilho

Iryna Gurevych

361

16 Jul 2023

Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI HarmsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

Organizers of QueerInAI

Nathaniel Dennler

Anaelia Ovalle

Ashwin Singh

Luca Soldaini

...

Kyra Yee

Irene Font Peradejordi

Zeerak Talat

Mayra Russo

Jessica de Jesus de Pinho Pinhal

200

15 Jul 2023

Othering and low status framing of immigrant cuisines in US restaurant reviews and large language modelsInternational Conference on Web and Social Media (ICWSM), 2023

Yiwei Luo

Kristina Gligorić

Dan Jurafsky

175

14 Jul 2023

Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning

Marcel Hussing

Jorge Armando Mendez Mendez

339

13 Jul 2023

IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation

Thiviyan Thanapalasingam

Emile van Krieken

Peter Bloem

Paul T. Groth

223

13 Jul 2023

Machine Learning practices and infrastructuresAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

G. Berman

255

13 Jul 2023

Objaverse-XL: A Universe of 10M+ 3D ObjectsNeural Information Processing Systems (NeurIPS), 2023

...

Carl Vondrick

286

644

11 Jul 2023

HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge UnderstandingNeural Information Processing Systems (NeurIPS), 2023

105

09 Jul 2023

Frontier AI Regulation: Managing Emerging Risks to Public Safety

...

Divya Siddarth

430

156

06 Jul 2023

BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load ForecastingNeural Information Processing Systems (NeurIPS), 2023

476

30 Jun 2023

A Massive Scale Semantic Similarity Dataset of Historical EnglishNeural Information Processing Systems (NeurIPS), 2023

Emily Silcock

Melissa Dell

183

30 Jun 2023

Next Steps for Human-Centered Generative AI: A Technical Perspective

...

Karl D. D. Willis

274

27 Jun 2023

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical PropertiesNeural Information Processing Systems (NeurIPS), 2023

Mingyu Ding

Chuang Gan

223

27 Jun 2023