Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1803.09010
Cited By

Datasheets for Datasets

v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018

Jamie Morgenstern

Briana Vecchione

Jennifer Wortman Vaughan

Hanna M. Wallach

ArXiv (abs)PDF HTML

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown

Eval Factsheets: A Structured Framework for Documenting AI Evaluations

Eval Factsheets: A Structured Framework for Documenting AI Evaluations

Evangelia Spiliopoulou

48

0

0

03 Dec 2025

Whose Personae? Synthetic Persona Experiments in LLM Research and Pathways to Transparency

Anusha Natarajan

Gjergji Kasneci

87

1

0

29 Nov 2025

Defending Large Language Models Against Jailbreak Exploits with Responsible AI Considerations

Defending Large Language Models Against Jailbreak Exploits with Responsible AI Considerations

Hosea David Yu Fei Ng

Dhananjai Sharma

Glenn Jun Jie Ng

Kavishvaran Srinivasan

281

0

0

24 Nov 2025

AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework

AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework

Samuel Nathanson

Catherine Chen Kieffer

Melanie Lockhart

Elisha Peterson

72

0

0

16 Nov 2025

mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models

mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models

172

0

0

12 Nov 2025

InfoAffect: A Dataset for Affective Analysis of Infographics

InfoAffect: A Dataset for Affective Analysis of Infographics

108

0

0

09 Nov 2025

QuAnTS: Question Answering on Time Series

QuAnTS: Question Answering on Time Series

Kristian Kersting

Devendra Singh Dhami

92

0

0

07 Nov 2025

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

...

Stella Biderman

Mykel J. Kochenderfer

230

0

0

06 Nov 2025

What's in Common? Multimodal Models Hallucinate When Reasoning Across Scenes

What's in Common? Multimodal Models Hallucinate When Reasoning Across Scenes

Polina Kirichenko

208

1

0

05 Nov 2025

EvalCards: A Framework for Standardized Evaluation Reporting

EvalCards: A Framework for Standardized Evaluation Reporting

Danae Sanchez Villegas

Antonia Karamolegkou

Alice Schiavone

...

Stephanie Brandl

Daniel Hershcovich

Anders Søgaard

Desmond Elliott

64

0

0

05 Nov 2025

miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward

miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward

Roozbeh Yousefzadeh

121

1

0

05 Nov 2025

AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda

AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda

Kundeshwar Pundalik

Piyush Sawarkar

Maunendra Sankar Desarkar

Ganesh Ramakrishnan

252

0

0

04 Nov 2025

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

Angelika Romanou

Franziska Sofia Hafner

...

Christopher Summerfield

478

5

0

03 Nov 2025

Before the Clinic: Transparent and Operable Design Principles for Healthcare AI

Before the Clinic: Transparent and Operable Design Principles for Healthcare AI

Alexander Bakumenko

Aaron J. Masino

Janine Hoelscher

100

1

0

31 Oct 2025

A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection

A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection

Dominik Stammbach

Christopher D. Manning

Peter Henderson

101

0

0

24 Oct 2025

HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences

HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences

Zain Ul Abideen Tariq

Mahmood Al-Zubaidi

Mowafa J Househ

116

0

0

24 Oct 2025

Race and Gender in LLM-Generated Personas: A Large-Scale Audit of 41 Occupations

Race and Gender in LLM-Generated Personas: A Large-Scale Audit of 41 Occupations

Ilona van der Linden

David C. Anastasiu

112

0

0

23 Oct 2025

VLSU: Mapping the Limits of Joint Multimodal Understanding for AI Safety

VLSU: Mapping the Limits of Joint Multimodal Understanding for AI Safety

Shruti Palaskar

Mona Abdelrahman

...

Charles Maalouf

122

0

0

21 Oct 2025

BO4Mob: Bayesian Optimization Benchmarks for High-Dimensional Urban Mobility Problem

BO4Mob: Bayesian Optimization Benchmarks for High-Dimensional Urban Mobility Problem

Carolina Osorio

96

0

0

21 Oct 2025

Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications

Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications

...

Ji-Eun Irene Yum

Muhammad Ali Khan

Muhammad Umar Afzal

182

1

0

20 Oct 2025

AFRICAPTION: Establishing a New Paradigm for Image Captioning in African Languages

AFRICAPTION: Establishing a New Paradigm for Image Captioning in African Languages

Mardiyyah Oduwole

Fatimo Adebanjo

Oluwatosin Olajide

Mahi Aminu Aliyu

Jekaterina Novikova

105

0

0

20 Oct 2025

DroneAudioset: An Audio Dataset for Drone-based Search and Rescue

DroneAudioset: An Audio Dataset for Drone-based Search and Rescue

Chitralekha Gupta

Soundarya Ramesh

Suranga Nanayakkara

98

0

0

17 Oct 2025

Iterative Topic Taxonomy Induction with LLMs: A Case Study of Electoral Advertising

Iterative Topic Taxonomy Induction with LLMs: A Case Study of Electoral Advertising

Alexander Brady

Tunazzina Islam

72

0

0

16 Oct 2025

JEDA: Query-Free Clinical Order Search from Ambient Dialogues

JEDA: Query-Free Clinical Order Search from Ambient Dialogues

Corey D Barrett

Sumana Srivasta

Krishnaram Kenthapadi

122

0

0

16 Oct 2025

Predicting the Unpredictable: Reproducible BiLSTM Forecasting of Incident Counts in the Global Terrorism Database (GTD)

Predicting the Unpredictable: Reproducible BiLSTM Forecasting of Incident Counts in the Global Terrorism Database (GTD)

Oluwasegun Adegoke

112

0

0

16 Oct 2025

Machine Learning and Public Health: Identifying and Mitigating Algorithmic Bias through a Systematic Review

Machine Learning and Public Health: Identifying and Mitigating Algorithmic Bias through a Systematic Review

Sara Altamirano

Sennay Ghebreab

115

0

0

16 Oct 2025

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Christopher Schröder

Stefan Schweter

Christopher Akiki

Ferdinand Schlatt

Arden Zimmermann

Phillipe Genêt

Martin Potthast

100

0

0

15 Oct 2025

Measuring What Matters: The AI Pluralism Index

Measuring What Matters: The AI Pluralism Index

Rashid Mushkani

80

0

0

09 Oct 2025

Lean Finder: Semantic Search for Mathlib That Understands User Intents

Lean Finder: Semantic Search for Mathlib That Understands User Intents

Swarat Chaudhuri

150

2

0

08 Oct 2025

COLE: a Comprehensive Benchmark for French Language Understanding Evaluation

COLE: a Comprehensive Benchmark for French Language Understanding Evaluation

David Beauchemin

Mohamed Amine Youssef

284

1

0

06 Oct 2025

Accountability Capture: How Record-Keeping to Support AI Transparency and Accountability (Re)shapes Algorithmic Oversight

Accountability Capture: How Record-Keeping to Support AI Transparency and Accountability (Re)shapes Algorithmic Oversight

Shreya Chappidi

105

1

0

06 Oct 2025

What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification

What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification

Andrew Halterman

Katherine A. Keith

126

0

0

03 Oct 2025

Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation

Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation

François Ledoyen

Alexis Lechervy

92

0

0

01 Oct 2025

On Explaining Proxy Discrimination and Unfairness in Individual Decisions Made by AI Systems

On Explaining Proxy Discrimination and Unfairness in Individual Decisions Made by AI Systems

116

0

0

30 Sep 2025

RoBiologyDataChoiceQA: A Romanian Dataset for improving Biology understanding of Large Language Models

RoBiologyDataChoiceQA: A Romanian Dataset for improving Biology understanding of Large Language Models

Dragos-Dumitru Ghinea

Adela-Nicoleta Corbeanu

Adrian-Marius Dumitran

96

0

0

30 Sep 2025

VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes

VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes

Muhammad Jehanzeb Mirza

Soumya Jahagirdar

Muhammad Huzaifa

Serena Yeung-Levy

179

1

0

29 Sep 2025

Fostering Robots: A Governance-First Conceptual Framework for Domestic, Curriculum-Based Trajectory Collection

Fostering Robots: A Governance-First Conceptual Framework for Domestic, Curriculum-Based Trajectory Collection

Federico Pablo-Marti

Carlos Mir Fernandez

52

0

0

28 Sep 2025

Does AI Coaching Prepare us for Workplace Negotiations?

Does AI Coaching Prepare us for Workplace Negotiations?

Jash Rajesh Parekh

117

0

0

26 Sep 2025

WolBanking77: Wolof Banking Speech Intent Classification Dataset

WolBanking77: Wolof Banking Speech Intent Classification Dataset

Abdou Karim Kandji

Frédéric Precioso

Augustin Ndione

209

0

0

23 Sep 2025

QUINTA: Reflexive Sensibility For Responsible AI Research and Data-Driven Processes

QUINTA: Reflexive Sensibility For Responsible AI Research and Data-Driven Processes

47

1

0

19 Sep 2025

Assessing Historical Structural Oppression Worldwide via Rule-Guided Prompting of Large Language Models

Assessing Historical Structural Oppression Worldwide via Rule-Guided Prompting of Large Language Models

Sreejato Chatterjee

Quoc Duy Nguyen

100

0

0

18 Sep 2025

Practitioners' Perspectives on a Differential Privacy Deployment Registry

Practitioners' Perspectives on a Differential Privacy Deployment Registry

Priyanka Nanayakkara

119

1

0

16 Sep 2025

Op-Fed: Opinion, Stance, and Monetary Policy Annotations on FOMC Transcripts Using Active Learning

Op-Fed: Opinion, Stance, and Monetary Policy Annotations on FOMC Transcripts Using Active Learning

Katherine A. Keith

121

0

0

16 Sep 2025

Standards in the Preparation of Biomedical Research Metadata: A Bridge2AI Perspective

Standards in the Preparation of Biomedical Research Metadata: A Bridge2AI Perspective

Nathan Sheffield

Andrew Williams

Monica C. Munoz-Torres

115

0

0

12 Sep 2025

MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems

MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems

306

0

0

11 Sep 2025

Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Michele Joshua Maggini

Rabiraj Bandyopadhyay

120

0

0

09 Sep 2025

MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature

MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature

Hirofumi Tsuruta

157

0

0

01 Sep 2025

Who Owns The Robot?: Four Ethical and Socio-technical Questions about Wellbeing Robots in the Real World through Community Engagement

Who Owns The Robot?: Four Ethical and Socio-technical Questions about Wellbeing Robots in the Real World through Community Engagement

159

2

0

01 Sep 2025

Deep opacity and AI: A threat to XAI and to privacy protection mechanisms

Deep opacity and AI: A threat to XAI and to privacy protection mechanisms

Vincent C. Müller

72

0

0

30 Aug 2025

Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting

Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting

Michael Peter Hoffmann

Rebecca Reichel

Roman Salzwedel

124

0

0

26 Aug 2025

1 2 3 4...20 21 22