Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.09010
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 1,069 papers shown
Societal Adaptation to Advanced AI
Jamie Bernardi
Gabriel Mukobi
Hilary Greaves
Lennart Heim
Markus Anderljung
435
13
0
16 May 2024
Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksander Petrov
Bertie Vidgen
Christian Schroeder
Fabio Pizzati
...
Matthew Jackson
Phillip H. S. Torr
Trevor Darrell
Y. Lee
Jakob N. Foerster
425
24
0
14 May 2024
BLIP: Facilitating the Exploration of Undesirable Consequences of Digital Technologies
Rock Yuren Pang
Sebastin Santy
René Just
Katharina Reinecke
201
19
0
10 May 2024
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jiarui Liu
Wenkai Li
Zhijing Jin
Mona T. Diab
SyDa
299
11
0
10 May 2024
The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Eve Fleisig
Su Lin Blodgett
Dan Klein
Zeerak Talat
210
35
0
09 May 2024
Natural Language Processing RELIES on Linguistics
Computational Linguistics (CL), 2024
Juri Opitz
Shira Wein
Nathan Schneider
AI4CE
629
11
0
09 May 2024
Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames
Keith Burghardt
Kai Chen
Kristina Lerman
181
1
0
06 May 2024
Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science Workflows
Jasmine Y. Shih
Vishal Mohanty
Yannis Katsis
Hariharan Subramonyam
191
3
0
03 May 2024
Social Life Simulation for Non-Cognitive Skills Learning
Zihan Yan
Yaohong Xiang
Yun Huang
215
3
0
01 May 2024
Towards Scenario- and Capability-Driven Dataset Development and Evaluation: An Approach in the Context of Mapless Automated Driving
Felix Grün
Marcus Nolte
Markus Maurer
302
2
0
30 Apr 2024
OpenStreetView-5M: The Many Roads to Global Visual Geolocation
Guillaume Astruc
Nicolas Dufour
Ioannis Siglidis
Constantin Aronssohn
Nacim Bouia
...
Charles Raude
Elliot Vincent
Lintao Xu
Hongyu Zhou
Loic Landrieu
208
32
0
29 Apr 2024
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
253
95
0
29 Apr 2024
Mapping the Potential of Explainable AI for Fairness Along the AI Lifecycle
Luca Deck
Astrid Schomacker
Timo Speith
Jakob Schöffer
Lena Kästner
Niklas Kühl
409
4
0
29 Apr 2024
Lazy Data Practices Harm Fairness Research
Jan Simson
Alessandro Fabris
Christoph Kern
198
10
0
26 Apr 2024
Near to Mid-term Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksandar Petrov
Bertie Vidgen
Christian Schroeder de Witt
Fabio Pizzati
...
Paul Röttger
Juil Sock
Trevor Darrell
Y. Lee
Jakob N. Foerster
291
17
0
25 Apr 2024
Inside the echo chamber: Linguistic underpinnings of misinformation on Twitter
Xinyu Wang
Jiayi Li
Sarah Rajtmajer
83
3
0
24 Apr 2024
Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing
Ben Hutchinson
294
0
0
23 Apr 2024
Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?
Shayne Longpre
Robert Mahari
Naana Obeng-Marnu
William Brannon
Tobin South
Katy Gero
Sandy Pentland
Jad Kabbara
297
21
0
19 Apr 2024
AI Competitions and Benchmarks: Dataset Development
Romain Egele
Julio C. S. Jacques Junior
Jan N. van Rijn
Isabelle M Guyon
Xavier Baró
Albert Clapés
Dali Wang
Sergio Escalera
T. Moeslund
Jun Wan
173
0
0
15 Apr 2024
Laissez-Faire Harms: Algorithmic Biases in Generative Language Models
Evan Shieh
Faye-Marie Vassel
Cassidy R. Sugimoto
T. Monroe-White
207
7
0
11 Apr 2024
Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in LLMs
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
Ahmed A. Agiza
Mohamed Mostagir
Sherief Reda
186
7
0
10 Apr 2024
Racial/Ethnic Categories in AI and Algorithmic Fairness: Why They Matter and What They Represent
Conference on Fairness, Accountability and Transparency (FAccT), 2024
Jennifer Mickel
127
12
0
10 Apr 2024
[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Leshem Choshen
Robert Bamler
Michael Y. Hu
Tal Linzen
Aaron Mueller
Candace Ross
Alex Warstadt
Ethan Gotlieb Wilcox
Adina Williams
Chengxu Zhuang
319
37
0
09 Apr 2024
Data Readiness for AI: A 360-Degree Survey
Kaveen Hiniduma
Suren Byna
J. L. Bez
186
20
0
08 Apr 2024
Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors
Chen Huang
Peixin Qin
Yang Deng
Wenqiang Lei
Jiancheng Lv
Tat-Seng Chua
410
15
0
04 Apr 2024
Responsible Reporting for Frontier AI Development
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
Noam Kolt
Markus Anderljung
Joslyn Barnhart
Asher Brass
K. Esvelt
Gillian K. Hadfield
Lennart Heim
Mikel Rodriguez
Jonas B. Sandbrink
Thomas Woodside
268
20
0
03 Apr 2024
Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs
Pengda Wang
Zilin Xiao
Hanjie Chen
Frederick L. Oswald
292
14
0
01 Apr 2024
Designing a User-centric Framework for Information Quality Ranking of Large-scale Street View Images
Tahiya Chowdhury
Ilan Mandel
Jorge Ortiz
Wendy Ju
130
0
0
30 Mar 2024
FISBe: A real-world benchmark dataset for instance segmentation of long-range thin filamentous structures
Lisa Mais
Peter Hirsch
Claire Managan
Ramya Kandarpa
J. L. Rumberger
Annika Reinke
Lena Maier-Hein
Gudrun Ihrke
Dagmar Kainmueller
151
0
0
29 Mar 2024
Benchmarking Object Detectors with COCO: A New Path Forward
Shweta Singh
Aayan Yadav
Jitesh Jain
Humphrey Shi
Justin Johnson
Karan Desai
170
23
0
27 Mar 2024
Decoding the Digital Fine Print: Navigating the potholes in Terms of service/ use of GenAI tools against the emerging need for Transparent and Trustworthy Tech Futures
Sundaraparipurnan Narayanan
135
0
0
26 Mar 2024
Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships
Rangel Daroya
Aaron Sun
Subhransu Maji
348
0
0
25 Mar 2024
Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels
Kexin Luo
Yue Mao
Bei Zhang
Sophie Hao
190
5
0
25 Mar 2024
"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning
Shreya Shankar
Rolando Garcia
J. M. Hellerstein
Aditya G. Parameswaran
242
22
0
25 Mar 2024
InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection
International Conference on Web and Social Media (ICWSM), 2024
T. Bertaglia
Lily Heisig
Rishabh Kaushal
Adriana Iamnitchi
211
5
0
22 Mar 2024
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Jeffrey Cheng
Marc Marone
Orion Weller
Dawn J Lawrie
Daniel Khashabi
Benjamin Van Durme
283
46
0
19 Mar 2024
From Melting Pots to Misrepresentations: Exploring Harms in Generative AI
Sanjana Gautam
Pranav Narayanan Venkit
Sourojit Ghosh
185
27
0
16 Mar 2024
Data Ethics Emergency Drill: A Toolbox for Discussing Responsible AI for Industry Teams
Vanessa Hanschke
Dylan Rees
Merve Alanyali
David Hopkinson
Paul Marshall
226
6
0
15 Mar 2024
Couler: Unified Machine Learning Workflow Optimization in Cloud
IEEE International Conference on Data Engineering (ICDE), 2024
Xiaoda Wang
Yuan-ju Tang
Tengda Guo
Bo Sang
Jingji Wu
Jian Sha
Ke Zhang
Jiang Qian
Mingjie Tang
170
0
0
12 Mar 2024
Elephants Never Forget: Testing Language Models for Memorization of Tabular Data
Sebastian Bordt
Harsha Nori
Rich Caruana
LMTD
229
21
0
11 Mar 2024
CommitBench: A Benchmark for Commit Message Generation
IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER), 2024
Maximilian Schall
Tamara Czinczoll
Gerard de Melo
191
9
0
08 Mar 2024
Position: Insights from Survey Methodology can Improve Training Data
Stephanie Eckman
Barbara Plank
Frauke Kreuter
SyDa
310
11
0
02 Mar 2024
Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Rachith Aiyappa
Shruthi Senthilmani
Jisun An
Haewoon Kwak
Yong-Yeol Ahn
244
5
0
01 Mar 2024
Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence
Marios Constantinides
Mohammad Tahaei
Daniele Quercia
Simone Stumpf
Michael A. Madaio
...
Ewa Luger
Jess Holbrook
Michael J. Muller
Ilana Golbin Blumenfeld
Giada Pistilli
164
15
0
29 Feb 2024
The Situate AI Guidebook: Co-Designing a Toolkit to Support Multi-Stakeholder Early-stage Deliberations Around Public Sector AI Proposals
Anna Kawakami
Amanda Coston
Haiyi Zhu
Hoda Heidari
Kenneth Holstein
233
38
0
29 Feb 2024
DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity Recognition
Kenneth Enevoldsen
Fredrik Jørgensen
Morten H Baglini
202
0
0
28 Feb 2024
An Integrated Data Processing Framework for Pretraining Foundation Models
Yiding Sun
Feng Wang
Yutao Zhu
Wayne Xin Zhao
Jiaxin Mao
275
6
0
26 Feb 2024
Foundation Model Transparency Reports
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Betty Xiong
Sayash Kapoor
Nestor Maslej
Arvind Narayanan
Abigail Z. Jacobs
256
24
0
26 Feb 2024
Towards Fair Graph Anomaly Detection: Problem, New Datasets, and Evaluation
Neng Kai Nigel Neo
Yeon-Chang Lee
Yiqiao Jin
Sang-Wook Kim
Srijan Kumar
298
4
0
25 Feb 2024
Farsight: Fostering Responsible AI Awareness During AI Application Prototyping
Zijie J. Wang
Chinmay Kulkarni
Lauren Wilcox
Michael Terry
Michael A. Madaio
317
71
0
23 Feb 2024
Previous
1
2
3
...
6
7
8
...
20
21
22
Next