ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.09010
  4. Cited By
Datasheets for Datasets

Datasheets for Datasets

23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
ArXivPDFHTML

Papers citing "Datasheets for Datasets"

50 / 966 papers shown
Title
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge
  Conflicts from Wikipedia
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Yufang Hou
Alessandra Pascale
Javier Carnerero-Cano
T. Tchrakian
Radu Marinescu
Elizabeth M. Daly
Inkit Padhi
P. Sattigeri
41
6
0
19 Jun 2024
StableSemantics: A Synthetic Language-Vision Dataset of Semantic
  Representations in Naturalistic Images
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
Rushikesh Zawar
Shaurya Dewan
Andrew F. Luo
Margaret M. Henderson
Michael J. Tarr
Leila Wehbe
VGen
CoGe
36
1
0
19 Jun 2024
Is Your HD Map Constructor Reliable under Sensor Corruptions?
Is Your HD Map Constructor Reliable under Sensor Corruptions?
Xiaoshuai Hao
Mengchuan Wei
Yifan Yang
Haimei Zhao
Hui Zhang
Yi Zhou
Qiang Wang
Weiming Li
Lingdong Kong
Jing Zhang
3DV
42
8
0
18 Jun 2024
IDs for AI Systems
IDs for AI Systems
Alan Chan
Noam Kolt
Peter Wills
Usman Anwar
Christian Schroeder de Witt
Nitarshan Rajkumar
Lewis Hammond
David M. Krueger
Lennart Heim
Markus Anderljung
41
6
0
17 Jun 2024
Centering Policy and Practice: Research Gaps around Usable Differential
  Privacy
Centering Policy and Practice: Research Gaps around Usable Differential Privacy
Rachel Cummings
Jayshree Sarathy
33
7
0
17 Jun 2024
Extrinsic Evaluation of Cultural Competence in Large Language Models
Extrinsic Evaluation of Cultural Competence in Large Language Models
Shaily Bhatt
Fernando Diaz
ELM
EGVM
47
4
0
17 Jun 2024
They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate
  Associative Bias
They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias
Salma Abdel Magid
Jui-Hsien Wang
Kushal Kafle
Hanspeter Pfister
34
1
0
17 Jun 2024
Generalization and Knowledge Transfer in Abstract Visual Reasoning
  Models
Generalization and Knowledge Transfer in Abstract Visual Reasoning Models
Mikołaj Małkiński
Jacek Mañdziuk
32
0
0
16 Jun 2024
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness
  Evaluation in Large Language Models
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness Evaluation in Large Language Models
Yuqing Wang
Yun Zhao
LRM
AAML
ELM
27
1
0
16 Jun 2024
Rideshare Transparency: Translating Gig Worker Insights on AI Platform
  Design to Policy
Rideshare Transparency: Translating Gig Worker Insights on AI Platform Design to Policy
Varun Nagaraj Rao
Samantha Dalal
Eesha Agarwal
D. Calacci
Andrés Monroy-Hernández
24
2
0
16 Jun 2024
DocNet: Semantic Structure in Inductive Bias Detection Models
DocNet: Semantic Structure in Inductive Bias Detection Models
Jessica Zhu
Iain Cruickshank
Michel Cukier
34
0
0
16 Jun 2024
From Pixels to Prose: A Large Dataset of Dense Image Captions
From Pixels to Prose: A Large Dataset of Dense Image Captions
Vasu Singla
Kaiyu Yue
Sukriti Paul
Reza Shirkavand
Mayuka Jayawardhana
Alireza Ganjdanesh
Heng Huang
A. Bhatele
Gowthami Somepalli
Tom Goldstein
3DV
VLM
28
22
0
14 Jun 2024
BABILong: Testing the Limits of LLMs with Long Context
  Reasoning-in-a-Haystack
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Ivan Rodkin
Dmitry Sorokin
Artyom Sorokin
Mikhail Burtsev
RALM
ALM
LRM
ReLM
ELM
42
58
0
14 Jun 2024
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal
  Data
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data
Grigor Bezirganyan
Sana Sellami
Laure Berti-Équille
Sébastien Fournier
24
3
0
14 Jun 2024
TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and
  Heterogeneous Graphs
TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs
J. Gastinger
Shenyang Huang
Mikhail Galkin
Erfan Loghmani
Ali Parviz
...
Emanuele Rossi
Ioannis Koutis
Heiner Stuckenschmidt
Reihaneh Rabbany
Guillaume Rabusseau
46
6
0
14 Jun 2024
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming
Victor-Alexandru Pădurean
Adish Singla
ELM
46
3
0
14 Jun 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Holy Lovenia
Rahmad Mahendra
Salsabil Maulana Akbar
Lester James Validad Miranda
Jennifer Santoso
...
Genta Indra Winata
Ruochen Zhang
Fajri Koto
Zheng-Xin Yong
Samuel Cahyawijaya
77
9
0
14 Jun 2024
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Bahare Fatemi
Mehran Kazemi
Anton Tsitsulin
Karishma Malkan
Jinyeong Yim
John Palowitch
Sungyong Seo
Jonathan J. Halcrow
Bryan Perozzi
LRM
35
26
0
13 Jun 2024
SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image
  Super-Resolution
SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution
Soufiane Belharbi
Mara K M Whitford
Phuong Hoang
Shakeeb Murtaza
Luke McCaffrey
Eric Granger
33
0
0
13 Jun 2024
ECBD: Evidence-Centered Benchmark Design for NLP
ECBD: Evidence-Centered Benchmark Design for NLP
Yu Lu Liu
Su Lin Blodgett
Jackie Chi Kit Cheung
Q. Vera Liao
Alexandra Olteanu
Ziang Xiao
28
10
0
13 Jun 2024
DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
Mohamed Elrefaie
Florin Morar
Angela Dai
Faez Ahmed
PINN
AI4CE
66
14
0
13 Jun 2024
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition
Mehreen Saeed
Adrian Chan
Anupam Mijar
Joseph Moukarzel
Georges Habchi
Carlos Younes
Amin Elias
Chau-Wai Wong
Akram Khater
30
3
0
13 Jun 2024
Are Large Language Models Good Statisticians?
Are Large Language Models Good Statisticians?
Yizhang Zhu
Shiyin Du
Boyan Li
Yuyu Luo
Nan Tang
ELM
27
15
0
12 Jun 2024
DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with
  38 Subclasses
DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 38 Subclasses
Abdurrahim Yilmaz
Sirin Pekcan Yasar
G. Gencoglan
Burak Temelkuran
22
1
0
11 Jun 2024
A Taxonomy of Challenges to Curating Fair Datasets
A Taxonomy of Challenges to Curating Fair Datasets
Dora Zhao
M. Scheuerman
Pooja Chitre
Jerone T. A. Andrews
Georgia Panagiotidou
Shawn Walker
Kathleen H. Pine
Alice Xiang
39
2
0
10 Jun 2024
STimage-1K4M: A histopathology image-gene expression dataset for spatial
  transcriptomics
STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics
Jiawen Chen
Muqing Zhou
Wenrong Wu
Jinwei Zhang
Yun Li
Didong Li
22
6
0
10 Jun 2024
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in
  Low-Resource and Extinct Languages
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
Andrew M. Bean
Simi Hellsten
Harry Mayne
Jabez Magomere
Ethan A. Chi
Ryan A. Chi
Scott A. Hale
Hannah Rose Kirk
ELM
LRM
37
7
0
10 Jun 2024
Situated Ground Truths: Enhancing Bias-Aware AI by Situating Data Labels
  with SituAnnotate
Situated Ground Truths: Enhancing Bias-Aware AI by Situating Data Labels with SituAnnotate
Delfina Sol Martinez Pandiani
Valentina Presutti
22
1
0
10 Jun 2024
LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models
LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models
Lukas Helff
Felix Friedrich
Manuel Brack
Kristian Kersting
P. Schramowski
VLM
46
0
0
07 Jun 2024
Reconfiguring Participatory Design to Resist AI Realism
Reconfiguring Participatory Design to Resist AI Realism
Aakash Gautam
35
3
0
05 Jun 2024
A Standardized Machine-readable Dataset Documentation Format for
  Responsible AI
A Standardized Machine-readable Dataset Documentation Format for Responsible AI
Nitisha Jain
Mubashara Akhtar
Joan Giner-Miguelez
Rajat Shinde
Joaquin Vanschoren
...
Costanza Conforti
Michael Kuchnik
Lora Aroyo
Omar Benjelloun
Elena Simperl
19
2
0
04 Jun 2024
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Li Lin
Santosh
Xin Eric Wang
Shu Hu
Shu Hu
EGVM
81
11
0
02 Jun 2024
Gender Bias Detection in Court Decisions: A Brazilian Case Study
Gender Bias Detection in Court Decisions: A Brazilian Case Study
Raysa Benatti
F. Severi
Sandra Avila
Esther Luna Colombini
31
1
0
01 Jun 2024
WebUOT-1M: Advancing Deep Underwater Object Tracking with A
  Million-Scale Benchmark
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
Chunhui Zhang
Li Liu
Guanjie Huang
Hao-Kai Wen
Xi Zhou
Yanfeng Wang
45
8
0
30 May 2024
A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody
  Language Models
A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models
Hirofumi Tsuruta
Hiroyuki Yamazaki
R. Maeda
Ryotaro Tamura
Akihiro Imura
18
0
0
29 May 2024
Artificial Intelligence in Industry 4.0: A Review of Integration
  Challenges for Industrial Systems
Artificial Intelligence in Industry 4.0: A Review of Integration Challenges for Industrial Systems
Alexander Windmann
Philipp Wittenberg
Marvin Schieseck
Oliver Niggemann
AI4CE
22
4
0
28 May 2024
Trust and Terror: Hazards in Text Reveal Negatively Biased Credulity and
  Partisan Negativity Bias
Trust and Terror: Hazards in Text Reveal Negatively Biased Credulity and Partisan Negativity Bias
Keith Burghardt
D. Fessler
Chyna Tang
Anne C. Pisor
Kristina Lerman
30
0
0
28 May 2024
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in
  Text-to-Image Models
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models
Hanjun Luo
Ziye Deng
Ruizhe Chen
Zuo-Qiang Liu
EGVM
33
9
0
28 May 2024
Privacy-Aware Visual Language Models
Privacy-Aware Visual Language Models
Laurens Samson
Nimrod Barazani
S. Ghebreab
Yukiyasu Asano
PILM
VLM
37
1
0
27 May 2024
Stop! In the Name of Flaws: Disentangling Personal Names and
  Sociodemographic Attributes in NLP
Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP
Vagrant Gautam
Arjun Subramonian
Anne Lauscher
O. Keyes
32
6
0
27 May 2024
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with
  LLM-Enhanced Cardiological Text
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text
Han Yu
Peikun Guo
Akane Sano
34
15
0
26 May 2024
Understanding Stakeholders' Perceptions and Needs Across the LLM Supply
  Chain
Understanding Stakeholders' Perceptions and Needs Across the LLM Supply Chain
Agathe Balayn
Lorenzo Corti
Fanny Rancourt
Fabio Casati
U. Gadiraju
21
5
0
25 May 2024
Paths of A Million People: Extracting Life Trajectories from Wikipedia
Paths of A Million People: Extracting Life Trajectories from Wikipedia
Ying Zhang
Xiaofeng Li
Zhaoyang Liu
Haipeng Zhang
11
0
0
25 May 2024
A Multilingual Similarity Dataset for News Article Frame
A Multilingual Similarity Dataset for News Article Frame
Xi Chen
Mattia Samory
Scott A. Hale
David Jurgens
Przemyslaw A. Grabowicz
11
0
0
22 May 2024
Pragmatic auditing: a pilot-driven approach for auditing Machine
  Learning systems
Pragmatic auditing: a pilot-driven approach for auditing Machine Learning systems
Djalel Benbouzid
Christiane Plociennik
Laura Lucaj
Mihai Maftei
Iris Merget
A. Burchardt
Marc P. Hauer
Abdeldjallil Naceri
Patrick van der Smagt
MLAU
33
0
0
21 May 2024
Cascade-based Randomization for Inferring Causal Effects under Diffusion
  Interference
Cascade-based Randomization for Inferring Causal Effects under Diffusion Interference
Zahra Fatemi
Jean Pouget-Abadie
Elena Zheleva
CML
19
0
0
20 May 2024
On Efficient and Statistical Quality Estimation for Data Annotation
On Efficient and Statistical Quality Estimation for Data Annotation
Jan-Christoph Klie
Juan Haladjian
Marc Kirchner
Rahul Nair
17
1
0
20 May 2024
Societal Adaptation to Advanced AI
Societal Adaptation to Advanced AI
Jamie Bernardi
Gabriel Mukobi
Hilary Greaves
Lennart Heim
Markus Anderljung
40
4
0
16 May 2024
Risks and Opportunities of Open-Source Generative AI
Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksander Petrov
Bertie Vidgen
Christian Schroeder
Fabio Pizzati
...
Matthew Jackson
Phillip H. S. Torr
Trevor Darrell
Y. Lee
Jakob N. Foerster
40
18
0
14 May 2024
BLIP: Facilitating the Exploration of Undesirable Consequences of
  Digital Technologies
BLIP: Facilitating the Exploration of Undesirable Consequences of Digital Technologies
Rock Yuren Pang
Sebastin Santy
René Just
Katharina Reinecke
32
14
0
10 May 2024
Previous
12345...181920
Next