ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.09010
  4. Cited By
Datasheets for Datasets

Datasheets for Datasets

23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
ArXivPDFHTML

Papers citing "Datasheets for Datasets"

50 / 966 papers shown
Title
Realistic Synthetic Financial Transactions for Anti-Money Laundering
  Models
Realistic Synthetic Financial Transactions for Anti-Money Laundering Models
Erik Altman
Jovan Blanuvsa
Luc von Niederhäusern
Béni Egressy
Andreea Anghel
Kubilay Atasu
42
37
0
22 Jun 2023
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities
Xudong Shen
H. Brown
Jiashu Tao
Martin Strobel
Yao Tong
Akshay Narayan
Harold Soh
Finale Doshi-Velez
27
3
0
22 Jun 2023
VisoGender: A dataset for benchmarking gender bias in image-text pronoun
  resolution
VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
S. Hall
F. G. Abrantes
Hanwen Zhu
Grace A. Sodunke
Aleksandar Shtedritski
Hannah Rose Kirk
CoGe
21
39
0
21 Jun 2023
Benchmark data to study the influence of pre-training on explanation
  performance in MR image classification
Benchmark data to study the influence of pre-training on explanation performance in MR image classification
Marta Oliveira
Rick Wilming
Benedict Clark
Céline Budding
Fabian Eitel
K. Ritter
Stefan Haufe
16
1
0
21 Jun 2023
An Overview of Catastrophic AI Risks
An Overview of Catastrophic AI Risks
Dan Hendrycks
Mantas Mazeika
Thomas Woodside
SILM
26
165
0
21 Jun 2023
Event Stream GPT: A Data Pre-processing and Modeling Library for
  Generative, Pre-trained Transformers over Continuous-time Sequences of
  Complex Events
Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events
Matthew B. A. McDermott
Bret A. Nestor
Peniel Argaw
I. Kohane
AI4TS
24
21
0
20 Jun 2023
Quilt-1M: One Million Image-Text Pairs for Histopathology
Quilt-1M: One Million Image-Text Pairs for Histopathology
Wisdom O. Ikezogwo
M. S. Seyfioglu
Fatemeh Ghezloo
Dylan Stefan Chan Geva
Fatwir Sheikh Mohammed
Pavan Kumar Anand
Ranjay Krishna
Linda G. Shapiro
CLIP
VLM
139
114
0
20 Jun 2023
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity
  Quantification
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification
Le-le Cao
Vilhelm von Ehrenheim
Mark Granroth-Wilding
Richard Anselmo Stahl
Andrew McCornack
Armin Catovic
Dhiana Deva Cavalcanti Rocha
35
3
0
18 Jun 2023
The Importance of Human-Labeled Data in the Era of LLMs
The Importance of Human-Labeled Data in the Era of LLMs
Yang Liu
ALM
17
8
0
18 Jun 2023
Reproducibility in NLP: What Have We Learned from the Checklist?
Reproducibility in NLP: What Have We Learned from the Checklist?
Ian H. Magnusson
Noah A. Smith
Jesse Dodge
20
11
0
16 Jun 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes
  with Spatiotemporal Annotations of Sound Events
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Kazuki Shimada
A. Politis
Parthasaarathy Sudarsanam
D. Krause
Kengo Uchida
...
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Tuomas Virtanen
Yuki Mitsufuji
63
36
0
15 Jun 2023
Dissecting Multimodality in VideoQA Transformer Models by Impairing
  Modality Fusion
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion
Isha Rawal
Alexander Matyasko
Shantanu Jaiswal
Basura Fernando
Cheston Tan
21
1
0
15 Jun 2023
LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting
LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting
Xu Liu
Yutong Xia
Yuxuan Liang
Junfeng Hu
Yiwei Wang
Lei Bai
Chaoqin Huang
Zhenguang Liu
Bryan Hooi
Roger Zimmermann
AI4TS
14
64
0
14 Jun 2023
V-LoL: A Diagnostic Dataset for Visual Logical Learning
V-LoL: A Diagnostic Dataset for Visual Logical Learning
Lukas Helff
Wolfgang Stammer
Hikaru Shindo
D. Dhami
Kristian Kersting
NAI
19
3
0
13 Jun 2023
Unraveling the Interconnected Axes of Heterogeneity in Machine Learning
  for Democratic and Inclusive Advancements
Unraveling the Interconnected Axes of Heterogeneity in Machine Learning for Democratic and Inclusive Advancements
Maryam Molamohammadi
Afaf Taik
Nicolas Le Roux
G. Farnadi
29
1
0
11 Jun 2023
Evaluating the Social Impact of Generative AI Systems in Systems and
  Society
Evaluating the Social Impact of Generative AI Systems in Systems and Society
Irene Solaiman
Zeerak Talat
William Agnew
Lama Ahmad
Dylan K. Baker
...
Marie-Therese Png
Shubham Singh
A. Strait
Lukas Struppek
Arjun Subramonian
ELM
EGVM
31
104
0
09 Jun 2023
AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle
  Designs
AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs
Adam D. Cobb
Anirban Roy
Daniel Elenius
F. M. Heim
Brian Swenson
...
Theodore Bapty
Joseph Hite
K. Ramani
Christopher McComb
Susmit Jha
20
7
0
08 Jun 2023
Explainable Predictive Maintenance
Explainable Predictive Maintenance
Sepideh Pashami
Sławomir Nowaczyk
Yuantao Fan
Jakub Jakubowski
Nuno Paiva
...
Bruno Veloso
M. Sayed-Mouchaweh
L. Rajaoarisoa
Grzegorz J. Nalepa
João Gama
32
8
0
08 Jun 2023
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
  of Videos
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Jielin Qiu
Jiacheng Zhu
William Jongwon Han
Aditesh Kumar
Karthik Mittal
...
Linjie Li
Jianfeng Wang
Ding Zhao
Bo Li
Lijuan Wang
VGen
16
5
0
07 Jun 2023
Art and the science of generative AI: A deeper dive
Art and the science of generative AI: A deeper dive
Ziv Epstein
Aaron Hertzmann
L. Herman
Robert Mahari
M. Frank
...
Jessica Fjeld
Hany Farid
Neil Leach
Alex Pentland
Olga Russakovsky
23
291
0
07 Jun 2023
Applying Standards to Advance Upstream & Downstream Ethics in Large
  Language Models
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models
Jose Berengueres
Marybeth Sandell
27
0
0
06 Jun 2023
AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca
  for Predicting Antigen-Antibody Interactions
AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca for Predicting Antigen-Antibody Interactions
Hirofumi Tsuruta
Hiroyuki Yamazaki
R. Maeda
Ryotaro Tamura
Jennifer Wei
...
Poomarin Phloyphisut
H. Shimokawa
J. Ledsam
Lucy J. Colwell
Akihiro Imura
11
7
0
06 Jun 2023
AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms
AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms
Zana Buçinca
Chau Minh Pham
Maurice Jakesch
Marco Tulio Ribeiro
Alexandra Olteanu
Saleema Amershi
25
35
0
05 Jun 2023
NLPositionality: Characterizing Design Biases of Datasets and Models
NLPositionality: Characterizing Design Biases of Datasets and Models
Sebastin Santy
Jenny T Liang
Ronan Le Bras
Katharina Reinecke
Maarten Sap
30
77
0
02 Jun 2023
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
Q. V. Liao
J. Vaughan
38
158
0
02 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Multilingual Conceptual Coverage in Text-to-Image Models
Michael Stephen Saxon
William Yang Wang
EGVM
24
8
0
02 Jun 2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora
  with Web Data, and Web Data Only
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Guilherme Penedo
Quentin Malartic
Daniel Hesslow
Ruxandra-Aimée Cojocaru
Alessandro Cappelli
Hamza Alobeidli
B. Pannier
Ebtesam Almazrouei
Julien Launay
27
749
0
01 Jun 2023
Mitigating Inappropriateness in Image Generation: Can there be Value in
  Reflecting the World's Ugliness?
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness?
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
EGVM
18
13
0
28 May 2023
Optimization's Neglected Normative Commitments
Optimization's Neglected Normative Commitments
Benjamin Laufer
T. Gilbert
Helen Nissenbaum
OffRL
21
4
0
27 May 2023
On Degrees of Freedom in Defining and Testing Natural Language
  Understanding
On Degrees of Freedom in Defining and Testing Natural Language Understanding
Saku Sugawara
S. Tsugita
ELM
26
1
0
24 May 2023
TalkUp: Paving the Way for Understanding Empowering Language
TalkUp: Paving the Way for Understanding Empowering Language
Lucille Njoo
Chan Young Park
Octavia Stappart
Marvin Thielk
Yi Chu
Yulia Tsvetkov
16
3
0
23 May 2023
PaLM 2 Technical Report
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
86
1,147
0
17 May 2023
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to
  Support Human-AI Scientific Writing
ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing
Hua Shen
Huang Chieh-Yang
Tongshuang Wu
Ting-Hao 'Kenneth' Huang
23
37
0
16 May 2023
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and
  Measurements of Performance
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Arjun Subramonian
Xingdi Yuan
Hal Daumé
Su Lin Blodgett
39
17
0
15 May 2023
DATED: Guidelines for Creating Synthetic Datasets for Engineering Design
  Applications
DATED: Guidelines for Creating Synthetic Datasets for Engineering Design Applications
Cyril Picard
Jürg Schiffmann
Faez Ahmed
32
8
0
15 May 2023
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for
  Languages in India
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India
Ashok Urlana
Pinzhen Chen
Zheng Zhao
Shay B. Cohen
Manish Shrivastava
Barry Haddow
29
9
0
15 May 2023
Certification Labels for Trustworthy AI: Insights From an Empirical
  Mixed-Method Study
Certification Labels for Trustworthy AI: Insights From an Empirical Mixed-Method Study
Nicolas Scharowski
Michaela Benk
S. J. Kühne
Léane Wettstein
Florian Brühlmann
30
12
0
15 May 2023
What's the Meaning of Superhuman Performance in Today's NLU?
What's the Meaning of Superhuman Performance in Today's NLU?
Simone Tedeschi
Johan Bos
T. Declerck
Jan Hajic
Daniel Hershcovich
...
Simon Krek
Steven Schockaert
Rico Sennrich
Ekaterina Shutova
Roberto Navigli
ELM
LM&MA
VLM
ReLM
LRM
34
26
0
15 May 2023
The Ethics of AI in Games
The Ethics of AI in Games
Dávid Melhárt
Julian Togelius
Benedikte Mikkelsen
Christoffer Holmgård
Georgios N. Yannakakis
25
24
0
12 May 2023
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Rahul Aralikatte
Ziling Cheng
Sumanth Doddapaneni
Jackie C.K. Cheung
40
8
0
10 May 2023
When Do Neural Nets Outperform Boosted Trees on Tabular Data?
When Do Neural Nets Outperform Boosted Trees on Tabular Data?
Duncan C. McElfresh
Sujay Khandagale
Jonathan Valverde
C. VishakPrasad
Ben Feuer
Chinmay Hegde
Ganesh Ramakrishnan
Micah Goldblum
Colin White
LMTD
26
130
0
04 May 2023
AutoML-GPT: Automatic Machine Learning with GPT
AutoML-GPT: Automatic Machine Learning with GPT
Shujian Zhang
Chengyue Gong
Lemeng Wu
Xingchao Liu
Mi Zhou
LLMAG
61
60
0
04 May 2023
Judgment Sieve: Reducing Uncertainty in Group Judgments through
  Interventions Targeting Ambiguity versus Disagreement
Judgment Sieve: Reducing Uncertainty in Group Judgments through Interventions Targeting Ambiguity versus Disagreement
Quan Ze Chen
Amy X. Zhang
32
7
0
02 May 2023
SoK: Log Based Transparency Enhancing Technologies
SoK: Log Based Transparency Enhancing Technologies
A. Hicks
26
1
0
02 May 2023
Racial Bias within Face Recognition: A Survey
Racial Bias within Face Recognition: A Survey
Seyma Yucer
Furkan Tektas
Noura Al Moubayed
T. Breckon
FaML
38
10
0
01 May 2023
Generating Process-Centric Explanations to Enable Contestability in
  Algorithmic Decision-Making: Challenges and Opportunities
Generating Process-Centric Explanations to Enable Contestability in Algorithmic Decision-Making: Challenges and Opportunities
Mireia Yurrita
Agathe Balayn
U. Gadiraju
26
2
0
01 May 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
145
111
0
28 Apr 2023
Understanding accountability in algorithmic supply chains
Understanding accountability in algorithmic supply chains
Jennifer Cobbe
Michael Veale
Jatinder Singh
50
60
0
28 Apr 2023
A Group-Specific Approach to NLP for Hate Speech Detection
A Group-Specific Approach to NLP for Hate Speech Detection
Karina Halevy
10
1
0
21 Apr 2023
Auditing and Generating Synthetic Data with Controllable Trust
  Trade-offs
Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Brian M. Belgodere
Pierre L. Dognin
Adam Ivankay
Igor Melnyk
Youssef Mroueh
...
Mattia Rigotti
Jerret Ross
Yair Schiff
Radhika Vedpathak
Richard A. Young
24
12
0
21 Apr 2023
Previous
123...8910...181920
Next