ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.09010
  4. Cited By
Datasheets for Datasets
v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
ArXiv (abs)PDFHTML

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown
Hateful Messages: A Conversational Data Set of Hate Speech produced by
  Adolescents on Discord
Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord
Jan Fillies
Silvio Peikert
Adrian Paschke
139
7
0
04 Sep 2023
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights,
  and Duties
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and DutiesAAAI Conference on Artificial Intelligence (AAAI), 2023
Taylor Sorensen
Liwei Jiang
Jena D. Hwang
Sydney Levine
Valentina Pyatkin
...
Kavel Rao
Chandra Bhagavatula
Maarten Sap
J. Tasioulas
Yejin Choi
SLR
492
90
0
02 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Bias and Fairness in Large Language Models: A SurveyComputational Linguistics (CL), 2023
Isabel O. Gallegos
Ryan Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
394
888
0
02 Sep 2023
The Use of Synthetic Data to Train AI Models: Opportunities and Risks
  for Sustainable Development
The Use of Synthetic Data to Train AI Models: Opportunities and Risks for Sustainable Development
T. Marwala
Eleonore Fournier-Tombs
Serge Stinckwich
100
14
0
31 Aug 2023
FACET: Fairness in Computer Vision Evaluation Benchmark
FACET: Fairness in Computer Vision Evaluation BenchmarkIEEE International Conference on Computer Vision (ICCV), 2023
Laura Gustafson
Chloe Rolland
Nikhila Ravi
Quentin Duval
Aaron B. Adcock
Cheng-Yang Fu
Melissa Hall
Candace Ross
VLMEGVM
351
60
0
31 Aug 2023
Party Prediction for Twitter
Party Prediction for TwitterInternational Conference on Web and Social Media (ICWSM), 2023
Kellin Pelrine
Anne Imouza
Zachary Yang
Jacob-Junqi Tian
Sacha Lévy
...
Aarash Feizi
Cécile Amadoro
A. Blais
Jean-François Godbout
Reihaneh Rabbany
152
2
0
25 Aug 2023
American Stories: A Large-Scale Structured Text Dataset of Historical
  U.S. Newspapers
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. NewspapersNeural Information Processing Systems (NeurIPS), 2023
Melissa Dell
Jacob Carlson
Tom Bryan
Emily Silcock
Abhishek Arora
Zejiang Shen
Luca DÁmico-Wong
Q. Le
Pablo Querubin
Leander Heldring
AI4TS
167
15
0
24 Aug 2023
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data
  Collection
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data CollectionProceedings of the AAAI Conference on Human Computation and Crowdsourcing (HCOMP), 2023
Oana Inel
Tim Draws
Lora Aroyo
299
11
0
22 Aug 2023
Artificial Intelligence and Aesthetic Judgment
Artificial Intelligence and Aesthetic Judgment
Jessica Hullman
Ari Holtzman
Andrew Gelman
126
4
0
21 Aug 2023
LegalBench: A Collaboratively Built Benchmark for Measuring Legal
  Reasoning in Large Language Models
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language ModelsSocial Science Research Network (SSRN), 2023
Neel Guha
Julian Nyarko
Mark A. Lemley
Christopher Ré
Adam Chilton
...
Spencer Williams
Sunny G. Gandhi
Tomer Zur
Varun J. Iyer
Zehua Li
AILawLRMELM
249
301
0
20 Aug 2023
LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and
  Benchmark
LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and BenchmarkIEEE International Conference on Computer Vision (ICCV), 2023
Lojze Žust
J. Pers
Matej Kristan
VOS
203
37
0
18 Aug 2023
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language
  Understanding
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language UnderstandingNeural Information Processing Systems (NeurIPS), 2023
K. Mangalam
Raiymbek Akshulakov
Jitendra Malik
402
495
0
17 Aug 2023
Impression-Aware Recommender Systems
Impression-Aware Recommender Systems
F. B. P. Maurera
Maurizio Ferrari Dacrema
P. Castells
Paolo Cremonesi
AI4TS
211
4
0
15 Aug 2023
REFORMS: Reporting Standards for Machine Learning Based Science
REFORMS: Reporting Standards for Machine Learning Based Science
Sayash Kapoor
Emily F. Cantrell
Kenny Peng
Thanh Hien Pham
C. Bail
...
Matthew J. Salganik
Marta Serra-Garcia
Brandon M Stewart
Gilles Vandewiele
Arvind Narayanan
241
26
0
15 Aug 2023
A User-Centered Evaluation of Spanish Text Simplification
A User-Centered Evaluation of Spanish Text Simplification
Adrian de Wynter
Anthony Hevia
Si-Qing Chen
154
1
0
15 Aug 2023
Ground Truth Or Dare: Factors Affecting The Creation Of Medical Datasets
  For Training AI
Ground Truth Or Dare: Factors Affecting The Creation Of Medical Datasets For Training AIAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
H. D. Zając
Natalia-Rozalia Avlona
T. O. Andersen
F. Kensing
Irina Shklovski
139
29
0
12 Aug 2023
Visualising category recoding and numeric redistributions
Visualising category recoding and numeric redistributions
Cynthia A. Huang
57
2
0
12 Aug 2023
FUTURE-AI: International consensus guideline for trustworthy and
  deployable artificial intelligence in healthcare
FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcareBritish medical journal (BMJ), 2023
Karim Lekadir
Aasa Feragen
Abdul Joseph Fofanah
Alejandro F Frangi
Alena Buyx
...
Yi Zeng
Yunusa G Mohammed
Yves Saint James Aquino
Zohaib Salahuddin
M. P. Starmans
AI4TS
315
221
0
11 Aug 2023
OpenProteinSet: Training data for structural biology at scale
OpenProteinSet: Training data for structural biology at scaleNeural Information Processing Systems (NeurIPS), 2023
Gustaf Ahdritz
N. Bouatta
S. Kadyan
Lukas Jarosch
Daniel Berenberg
Ian Fisk
Andrew Watkins
Stephen Ra
Richard Bonneau
Mohammed AlQuraishi
AI4CE
209
22
0
10 Aug 2023
Heuristics for Supporting Cooperative Dashboard Design
Heuristics for Supporting Cooperative Dashboard DesignIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
V. Setlur
M. Correll
Arvind Satyanarayan
Melanie Tory
231
18
0
08 Aug 2023
PUG: Photorealistic and Semantically Controllable Synthetic Data for
  Representation Learning
PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation LearningNeural Information Processing Systems (NeurIPS), 2023
Florian Bordes
Shashank Shekhar
Mark Ibrahim
Diane Bouchacourt
Pascal Vincent
Ari S. Morcos
202
36
0
08 Aug 2023
Balanced Face Dataset: Guiding StyleGAN to Generate Labeled Synthetic
  Face Image Dataset for Underrepresented Group
Balanced Face Dataset: Guiding StyleGAN to Generate Labeled Synthetic Face Image Dataset for Underrepresented Group
Kidist Amde Mekonnen
CVBM
156
3
0
07 Aug 2023
Auditing and Robustifying COVID-19 Misinformation Datasets via
  Anticontent Sampling
Auditing and Robustifying COVID-19 Misinformation Datasets via Anticontent SamplingAAAI Conference on Artificial Intelligence (AAAI), 2023
Clay H. Yoo
Ashiqur R. KhudaBukhsh
230
6
0
05 Aug 2023
No Fair Lunch: A Causal Perspective on Dataset Bias in Machine Learning
  for Medical Imaging
No Fair Lunch: A Causal Perspective on Dataset Bias in Machine Learning for Medical Imaging
Charles Jones
Daniel Coelho De Castro
Fabio De Sousa Ribeiro
Ozan Oktay
Melissa McCradden
Ben Glocker
FaMLCML
297
12
0
31 Jul 2023
The timing bottleneck: Why timing and overlap are mission-critical for
  conversational user interfaces, speech recognition and dialogue systems
The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systemsSIGDIAL Conferences (SIGDIAL), 2023
Andreas Liesenfeld
Alianda Lopez
Mark Dingemanse
304
11
0
28 Jul 2023
FeedbackLogs: Recording and Incorporating Stakeholder Feedback into
  Machine Learning Pipelines
FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning PipelinesConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Matthew Barker
Emma Kallina
D. Ashok
Katherine M. Collins
Ashley Casovan
Adrian Weller
Ameet Talwalkar
Valerie Chen
Umang Bhatt
188
11
0
28 Jul 2023
RAI Guidelines: Method for Generating Responsible AI Guidelines Grounded
  in Regulations and Usable by (Non-)Technical Roles
RAI Guidelines: Method for Generating Responsible AI Guidelines Grounded in Regulations and Usable by (Non-)Technical Roles
Marios Constantinides
Edyta Bogucka
Daniele Quercia
Susanna Kallio
Mohammad Tahaei
205
24
0
27 Jul 2023
Designing Fiduciary Artificial Intelligence
Designing Fiduciary Artificial IntelligenceConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Sebastian Benthall
David Shekman
172
9
0
27 Jul 2023
A Deep Dive into the Disparity of Word Error Rates Across Thousands of
  NPTEL MOOC Videos
A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosInternational Conference on Web and Social Media (ICWSM), 2023
Anand Rai
Siddharth D. Jaiswal
Animesh Mukherjee
171
5
0
20 Jul 2023
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Ashish Singh
Ashutosh Singh
Prateek R. Agarwal
Zixuan Huang
Arpita Singh
...
Ryan Rossi
Puneet Mathur
Erik Learned-Miller
Franck Dernoncourt
Ryan Rossi
310
8
0
20 Jul 2023
Europepolls: A Dataset of Country-Level Opinion Polling Data for the
  European Union and the UK
Europepolls: A Dataset of Country-Level Opinion Polling Data for the European Union and the UK
Konstantinos Pitas
48
0
0
19 Jul 2023
Test-takers have a say: understanding the implications of the use of AI
  in language tests
Test-takers have a say: understanding the implications of the use of AI in language tests
Dawen Zhang
Thong Hoang
Shidong Pan
Yongquan Hu
Zhenchang Xing
Mark Staples
Xiwei Xu
Qinghua Lu
Aaron Quigley
ELM
164
5
0
19 Jul 2023
Beyond the ML Model: Applying Safety Engineering Frameworks to
  Text-to-Image Development
Beyond the ML Model: Applying Safety Engineering Frameworks to Text-to-Image DevelopmentAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
Shalaleh Rismani
Renee Shelby
A. Smart
Renelito Delos Santos
AJung Moon
Negar Rostamzadeh
225
13
0
19 Jul 2023
Reflections from the Workshop on AI-Assisted Decision Making for
  Conservation
Reflections from the Workshop on AI-Assisted Decision Making for Conservation
Lily Xu
Esther Rolf
Sara Beery
J. Bennett
T. Berger-Wolf
...
P. Moorcroft
Jonathan Palmer
Andrew Perrault
D. Thau
Milind Tambe
214
4
0
17 Jul 2023
Leveraging Recommender Systems to Reduce Content Gaps on Peer Production
  Platforms
Leveraging Recommender Systems to Reduce Content Gaps on Peer Production PlatformsInternational Conference on Web and Social Media (ICWSM), 2023
M. Houtti
Isaac Johnson
Morten Warncke-Wang
Loren G. Terveen
210
3
0
17 Jul 2023
Fairness in KI-Systemen
Fairness in KI-Systemen
Janine Strotherm
Alissa Müller
Barbara Hammer
Benjamin Paassen
FaML
135
1
0
17 Jul 2023
Where Did the President Visit Last Week? Detecting Celebrity Trips from
  News Articles
Where Did the President Visit Last Week? Detecting Celebrity Trips from News ArticlesInternational Conference on Web and Social Media (ICWSM), 2023
Kai Peng
Ying Zhang
Shuai Ling
Zhaoru Ke
Haipeng Zhang
GNN
189
1
0
17 Jul 2023
Analyzing Dataset Annotation Quality Management in the Wild
Analyzing Dataset Annotation Quality Management in the WildComputational Linguistics (CL), 2023
Jan-Christoph Klie
Richard Eckart de Castilho
Iryna Gurevych
361
55
0
16 Jul 2023
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for
  Queer AI Harms
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI HarmsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
Organizers of QueerInAI
Nathaniel Dennler
Anaelia Ovalle
Ashwin Singh
Luca Soldaini
...
Kyra Yee
Irene Font Peradejordi
Zeerak Talat
Mayra Russo
Jessica de Jesus de Pinho Pinhal
200
22
0
15 Jul 2023
Othering and low status framing of immigrant cuisines in US restaurant
  reviews and large language models
Othering and low status framing of immigrant cuisines in US restaurant reviews and large language modelsInternational Conference on Web and Social Media (ICWSM), 2023
Yiwei Luo
Kristina Gligorić
Dan Jurafsky
175
7
0
14 Jul 2023
Robotic Manipulation Datasets for Offline Compositional Reinforcement
  Learning
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
339
9
0
13 Jul 2023
IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation
IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation
Thiviyan Thanapalasingam
Emile van Krieken
Peter Bloem
Paul T. Groth
223
3
0
13 Jul 2023
Machine Learning practices and infrastructures
Machine Learning practices and infrastructuresAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
G. Berman
255
5
0
13 Jul 2023
Objaverse-XL: A Universe of 10M+ 3D Objects
Objaverse-XL: A Universe of 10M+ 3D ObjectsNeural Information Processing Systems (NeurIPS), 2023
Matt Deitke
Ruoshi Liu
Matthew Wallingford
Huong Ngo
Oscar Michel
...
Carl Vondrick
Georgia Gkioxari
Kiana Ehsani
Ludwig Schmidt
Ali Farhadi
286
644
0
11 Jul 2023
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly
  Knowledge Understanding
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge UnderstandingNeural Information Processing Systems (NeurIPS), 2023
Hao Zheng
R. Lee
Yuqian Lu
VGen
105
23
0
09 Jul 2023
Frontier AI Regulation: Managing Emerging Risks to Public Safety
Frontier AI Regulation: Managing Emerging Risks to Public Safety
Markus Anderljung
Joslyn Barnhart
Anton Korinek
Jade Leung
Cullen O'Keefe
...
Jonas Schuett
Yonadav Shavit
Divya Siddarth
Robert F. Trager
Kevin J. Wolf
SILM
430
156
0
06 Jul 2023
BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark
  for Short-Term Load Forecasting
BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load ForecastingNeural Information Processing Systems (NeurIPS), 2023
Patrick Emami
A. Sahu
Peter Graf
AI4TS
476
33
0
30 Jun 2023
A Massive Scale Semantic Similarity Dataset of Historical English
A Massive Scale Semantic Similarity Dataset of Historical EnglishNeural Information Processing Systems (NeurIPS), 2023
Emily Silcock
Melissa Dell
183
5
0
30 Jun 2023
Next Steps for Human-Centered Generative AI: A Technical Perspective
Next Steps for Human-Centered Generative AI: A Technical Perspective
Xiang Ánthony' Chen
Jeff Burke
Andrea Colaço
Matthew K. Hong
Jennifer Jacobs
...
Dingzeyu Li
Nanyun Peng
Karl D. D. Willis
Chien-Sheng Wu
Bolei Zhou
LLMAG
274
44
0
27 Jun 2023
Physion++: Evaluating Physical Scene Understanding that Requires Online
  Inference of Different Physical Properties
Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical PropertiesNeural Information Processing Systems (NeurIPS), 2023
H. Tung
Mingyu Ding
Zhenfang Chen
Daniel M. Bear
Chuang Gan
J. Tenenbaum
Daniel L. K. Yamins
Judy Fan
Kevin A. Smith
223
28
0
27 Jun 2023
Previous
123...91011...202122
Next