ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.09010
  4. Cited By
Datasheets for Datasets
v1v2v3v4v5v6v7v8 (latest)

Datasheets for Datasets

23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
ArXiv (abs)PDFHTML

Papers citing "Datasheets for Datasets"

50 / 1,069 papers shown
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Kaiyu Yang
Aidan M. Swope
Alex Gu
Rahul Chalamala
Peiyang Song
Shixing Yu
Saad Godil
R. Prenger
Anima Anandkumar
RALM
376
338
0
27 Jun 2023
Use case cards: a use case reporting framework inspired by the European
  AI Act
Use case cards: a use case reporting framework inspired by the European AI ActEthics and Information Technology (EIT), 2023
Isabelle Hupont
David Fernández Llorca
S. Baldassarri
Emilia Gómez
154
32
0
23 Jun 2023
Critical-Reflective Human-AI Collaboration: Exploring Computational
  Tools for Art Historical Image Retrieval
Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval
Katrin Glinka
Claudia Muller-Birn
97
16
0
22 Jun 2023
Realistic Synthetic Financial Transactions for Anti-Money Laundering
  Models
Realistic Synthetic Financial Transactions for Anti-Money Laundering ModelsNeural Information Processing Systems (NeurIPS), 2023
Erik Altman
Jovan Blanuvsa
Luc von Niederhäusern
Béni Egressy
Andreea Anghel
Kubilay Atasu
346
76
0
22 Jun 2023
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities
Xudong Shen
H. Brown
Jiashu Tao
Martin Strobel
Yao Tong
Akshay Narayan
Harold Soh
Finale Doshi-Velez
334
3
0
22 Jun 2023
VisoGender: A dataset for benchmarking gender bias in image-text pronoun
  resolution
VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolutionNeural Information Processing Systems (NeurIPS), 2023
Elizaveta Semenova
F. G. Abrantes
Hanwen Zhu
Grace A. Sodunke
Aleksandar Shtedritski
Hannah Rose Kirk
CoGe
377
57
0
21 Jun 2023
An Overview of Catastrophic AI Risks
An Overview of Catastrophic AI Risks
Dan Hendrycks
Mantas Mazeika
Thomas Woodside
SILM
600
247
0
21 Jun 2023
Benchmarking the Influence of Pre-training on Explanation Performance in MR Image Classification
Benchmarking the Influence of Pre-training on Explanation Performance in MR Image Classification
Marta Oliveira
Rick Wilming
Benedict Clark
Céline Budding
Fabian Eitel
K. Ritter
Stefan Haufe
180
1
0
21 Jun 2023
Event Stream GPT: A Data Pre-processing and Modeling Library for
  Generative, Pre-trained Transformers over Continuous-time Sequences of
  Complex Events
Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex EventsNeural Information Processing Systems (NeurIPS), 2023
Matthew B. A. McDermott
Bret A. Nestor
Peniel Argaw
I. Kohane
AI4TS
372
41
0
20 Jun 2023
Quilt-1M: One Million Image-Text Pairs for Histopathology
Quilt-1M: One Million Image-Text Pairs for HistopathologyNeural Information Processing Systems (NeurIPS), 2023
Wisdom O. Ikezogwo
M. S. Seyfioglu
Fatemeh Ghezloo
Dylan Stefan Chan Geva
Fatwir Sheikh Mohammed
Pavan Kumar Anand
Ranjay Krishna
Linda G. Shapiro
CLIPVLM
736
196
0
20 Jun 2023
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity
  Quantification
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity QuantificationIEEE Transactions on Big Data (IEEE Trans. Big Data), 2023
Le-le Cao
Vilhelm von Ehrenheim
Mark Granroth-Wilding
Richard Anselmo Stahl
Andrew McCornack
Armin Catovic
Dhiana Deva Cavalcanti Rocha
303
4
0
18 Jun 2023
The Importance of Human-Labeled Data in the Era of LLMs
The Importance of Human-Labeled Data in the Era of LLMsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Yang Liu
ALM
239
11
0
18 Jun 2023
Reproducibility in NLP: What Have We Learned from the Checklist?
Reproducibility in NLP: What Have We Learned from the Checklist?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ian H. Magnusson
Noah A. Smith
Jesse Dodge
170
13
0
16 Jun 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes
  with Spatiotemporal Annotations of Sound Events
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound EventsNeural Information Processing Systems (NeurIPS), 2023
Kazuki Shimada
Archontis Politis
Parthasaarathy Sudarsanam
D. Krause
Kengo Uchida
...
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Maria Sandsten
Yuki Mitsufuji
273
87
0
15 Jun 2023
Dissecting Multimodality in VideoQA Transformer Models by Impairing
  Modality Fusion
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality FusionInternational Conference on Machine Learning (ICML), 2023
Isha Rawal
Alexander Matyasko
Shantanu Jaiswal
Basura Fernando
Cheston Tan
276
7
0
15 Jun 2023
LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting
LargeST: A Benchmark Dataset for Large-Scale Traffic ForecastingNeural Information Processing Systems (NeurIPS), 2023
Xu Liu
Yutong Xia
Yuxuan Liang
Junfeng Hu
Yiwei Wang
Mengwei He
Chaoqin Huang
Zhenguang Liu
Bryan Hooi
Roger Zimmermann
AI4TS
180
146
0
14 Jun 2023
V-LoL: A Diagnostic Dataset for Visual Logical Learning
V-LoL: A Diagnostic Dataset for Visual Logical Learning
Lukas Helff
Wolfgang Stammer
Hikaru Shindo
Devendra Singh Dhami
Kristian Kersting
NAI
302
9
0
13 Jun 2023
Unraveling the Interconnected Axes of Heterogeneity in Machine Learning
  for Democratic and Inclusive Advancements
Unraveling the Interconnected Axes of Heterogeneity in Machine Learning for Democratic and Inclusive AdvancementsConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2023
Maryam Molamohammadi
Afaf Taik
Nicolas Le Roux
G. Farnadi
194
2
0
11 Jun 2023
Evaluating the Social Impact of Generative AI Systems in Systems and
  Society
Evaluating the Social Impact of Generative AI Systems in Systems and Society
Irene Solaiman
Zeerak Talat
William Agnew
Lama Ahmad
Dylan K. Baker
...
Marie-Therese Png
Shubham Singh
A. Strait
Lukas Struppek
Arjun Subramonian
ELMEGVM
486
150
0
09 Jun 2023
AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle
  Designs
AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle DesignsNeural Information Processing Systems (NeurIPS), 2023
Adam D. Cobb
Anirban Roy
Daniel Elenius
F. M. Heim
Brian Swenson
...
Theodore Bapty
Joseph Hite
K. Ramani
Christopher McComb
Susmit Jha
176
16
0
08 Jun 2023
Explainable Predictive Maintenance
Explainable Predictive Maintenance
Sepideh Pashami
Sławomir Nowaczyk
Yuantao Fan
Jakub Jakubowski
Nuno Paiva
...
Bruno Veloso
M. Sayed-Mouchaweh
L. Rajaoarisoa
Grzegorz J. Nalepa
João Gama
213
19
0
08 Jun 2023
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
  of Videos
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of VideosComputer Vision and Pattern Recognition (CVPR), 2023
Jielin Qiu
Jiacheng Zhu
William Jongwon Han
Aditesh Kumar
Karthik Mittal
...
Linjie Li
Jianfeng Wang
Ding Zhao
Bo Li
Lijuan Wang
VGen
231
14
0
07 Jun 2023
Art and the science of generative AI: A deeper dive
Art and the science of generative AI: A deeper diveScience (Science), 2023
Ziv Epstein
Aaron Hertzmann
L. Herman
Robert Mahari
M. Frank
...
Jessica Fjeld
Hany Farid
Neil Leach
Alex Pentland
Olga Russakovsky
271
494
0
07 Jun 2023
Applying Standards to Advance Upstream & Downstream Ethics in Large
  Language Models
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models
Jose Berengueres
Marybeth Sandell
181
0
0
06 Jun 2023
AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca
  for Predicting Antigen-Antibody Interactions
AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca for Predicting Antigen-Antibody InteractionsNeural Information Processing Systems (NeurIPS), 2023
Hirofumi Tsuruta
Hiroyuki Yamazaki
R. Maeda
Ryotaro Tamura
Jennifer Wei
...
Poomarin Phloyphisut
H. Shimokawa
J. Ledsam
Lucy J. Colwell
Akihiro Imura
115
10
0
06 Jun 2023
AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms
AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms
Zana Buçinca
Chau Minh Pham
Maurice Jakesch
Marco Tulio Ribeiro
Alexandra Olteanu
Saleema Amershi
205
45
0
05 Jun 2023
NLPositionality: Characterizing Design Biases of Datasets and Models
NLPositionality: Characterizing Design Biases of Datasets and ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Sebastin Santy
Jenny T Liang
Ronan Le Bras
Katharina Reinecke
Maarten Sap
317
106
0
02 Jun 2023
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
Q. V. Liao
J. Vaughan
318
222
0
02 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Multilingual Conceptual Coverage in Text-to-Image ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Michael Stephen Saxon
William Yang Wang
EGVM
140
10
0
02 Jun 2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora
  with Web Data, and Web Data Only
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Guilherme Penedo
Quentin Malartic
Daniel Hesslow
Ruxandra-Aimée Cojocaru
Alessandro Cappelli
Hamza Alobeidli
B. Pannier
Ebtesam Almazrouei
Julien Launay
422
881
0
01 Jun 2023
Mitigating Inappropriateness in Image Generation: Can there be Value in
  Reflecting the World's Ugliness?
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness?
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
EGVM
167
18
0
28 May 2023
Optimization's Neglected Normative Commitments
Optimization's Neglected Normative CommitmentsConference on Fairness, Accountability and Transparency (FAccT), 2023
Benjamin Laufer
T. Gilbert
Helen Nissenbaum
OffRL
218
8
0
27 May 2023
On Degrees of Freedom in Defining and Testing Natural Language
  Understanding
On Degrees of Freedom in Defining and Testing Natural Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Saku Sugawara
S. Tsugita
ELM
326
2
0
24 May 2023
TalkUp: Paving the Way for Understanding Empowering Language
TalkUp: Paving the Way for Understanding Empowering LanguageConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lucille Njoo
Chan Young Park
Octavia Stappart
Marvin Thielk
Yi Chu
Yulia Tsvetkov
254
4
0
23 May 2023
PaLM 2 Technical Report
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLMLRM
678
1,406
0
17 May 2023
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to
  Support Human-AI Scientific Writing
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
Hua Shen
Huang Chieh-Yang
Tongshuang Wu
Ting-Hao 'Kenneth' Huang
457
45
0
16 May 2023
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and
  Measurements of Performance
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of PerformanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Arjun Subramonian
Xingdi Yuan
Hal Daumé
Su Lin Blodgett
205
22
0
15 May 2023
DATED: Guidelines for Creating Synthetic Datasets for Engineering Design
  Applications
DATED: Guidelines for Creating Synthetic Datasets for Engineering Design ApplicationsDesign Automation Conference (DAC), 2023
Cyril Picard
Jürg Schiffmann
Faez Ahmed
167
14
0
15 May 2023
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for
  Languages in India
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in IndiaConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ashok Urlana
Pinzhen Chen
Zheng Zhao
Shay B. Cohen
Manish Shrivastava
Barry Haddow
185
13
0
15 May 2023
Certification Labels for Trustworthy AI: Insights From an Empirical
  Mixed-Method Study
Certification Labels for Trustworthy AI: Insights From an Empirical Mixed-Method StudyConference on Fairness, Accountability and Transparency (FAccT), 2023
Nicolas Scharowski
Michaela Benk
S. J. Kühne
Léane Wettstein
Florian Brühlmann
199
33
0
15 May 2023
What's the Meaning of Superhuman Performance in Today's NLU?
What's the Meaning of Superhuman Performance in Today's NLU?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Simone Tedeschi
Johan Bos
T. Declerck
Jan Hajic
Daniel Hershcovich
...
Simon Krek
Steven Schockaert
Rico Sennrich
Ekaterina Shutova
Roberto Navigli
ELMLM&MAVLMReLMLRM
309
37
0
15 May 2023
The Ethics of AI in Games
The Ethics of AI in GamesIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023
Dávid Melhárt
Julian Togelius
Benedikte Mikkelsen
Christoffer Holmgård
Georgios N. Yannakakis
155
30
0
12 May 2023
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Vārta: A Large-Scale Headline-Generation Dataset for Indic LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Rahul Aralikatte
Ziling Cheng
Sumanth Doddapaneni
Jackie C.K. Cheung
270
10
0
10 May 2023
When Do Neural Nets Outperform Boosted Trees on Tabular Data?
When Do Neural Nets Outperform Boosted Trees on Tabular Data?Neural Information Processing Systems (NeurIPS), 2023
Duncan C. McElfresh
Sujay Khandagale
Jonathan Valverde
C. VishakPrasad
Ben Feuer
Chinmay Hegde
Ganesh Ramakrishnan
Micah Goldblum
Colin White
LMTD
305
248
0
04 May 2023
AutoML-GPT: Automatic Machine Learning with GPT
AutoML-GPT: Automatic Machine Learning with GPT
Shujian Zhang
Chengyue Gong
Lemeng Wu
Xingchao Liu
Mi Zhou
LLMAG
317
89
0
04 May 2023
Judgment Sieve: Reducing Uncertainty in Group Judgments through
  Interventions Targeting Ambiguity versus Disagreement
Judgment Sieve: Reducing Uncertainty in Group Judgments through Interventions Targeting Ambiguity versus Disagreement
Quan Ze Chen
Amy X. Zhang
199
12
0
02 May 2023
SoK: Log Based Transparency Enhancing Technologies
SoK: Log Based Transparency Enhancing Technologies
A. Hicks
200
3
0
02 May 2023
Racial Bias within Face Recognition: A Survey
Racial Bias within Face Recognition: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023
Seyma Yucer
Furkan Tektas
Noura Al Moubayed
T. Breckon
FaML
235
26
0
01 May 2023
Generating Process-Centric Explanations to Enable Contestability in
  Algorithmic Decision-Making: Challenges and Opportunities
Generating Process-Centric Explanations to Enable Contestability in Algorithmic Decision-Making: Challenges and Opportunities
Mireia Yurrita
Agathe Balayn
U. Gadiraju
183
3
0
01 May 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
591
163
0
28 Apr 2023
Previous
123...101112...202122
Next