ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.09010
  4. Cited By
Datasheets for Datasets

Datasheets for Datasets

23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
ArXivPDFHTML

Papers citing "Datasheets for Datasets"

50 / 966 papers shown
Title
Automatic Generation of Model and Data Cards: A Step Towards Responsible
  AI
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI
Jiarui Liu
Wenkai Li
Zhijing Jin
Mona T. Diab
SyDa
55
3
0
10 May 2024
The Perspectivist Paradigm Shift: Assumptions and Challenges of
  Capturing Human Labels
The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
Eve Fleisig
Su Lin Blodgett
Dan Klein
Zeerak Talat
27
13
0
09 May 2024
Natural Language Processing RELIES on Linguistics
Natural Language Processing RELIES on Linguistics
Juri Opitz
Shira Wein
Nathan Schneider
AI4CE
44
7
0
09 May 2024
Large Language Models Reveal Information Operation Goals, Tactics, and
  Narrative Frames
Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames
Keith Burghardt
Kai Chen
Kristina Lerman
32
0
0
06 May 2024
Leveraging Large Language Models to Enhance Domain Expert Inclusion in
  Data Science Workflows
Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science Workflows
Jasmine Y. Shih
Vishal Mohanty
Yannis Katsis
Hariharan Subramonyam
25
1
0
03 May 2024
Social Life Simulation for Non-Cognitive Skills Learning
Social Life Simulation for Non-Cognitive Skills Learning
Zihan Yan
Yaohong Xiang
Yun Huang
26
1
0
01 May 2024
Towards Scenario- and Capability-Driven Dataset Development and Evaluation: An Approach in the Context of Mapless Automated Driving
Towards Scenario- and Capability-Driven Dataset Development and Evaluation: An Approach in the Context of Mapless Automated Driving
Felix Grün
Marcus Nolte
Markus Maurer
27
1
0
30 Apr 2024
OpenStreetView-5M: The Many Roads to Global Visual Geolocation
OpenStreetView-5M: The Many Roads to Global Visual Geolocation
Guillaume Astruc
Nicolas Dufour
Ioannis Siglidis
Constantin Aronssohn
Nacim Bouia
...
Charles Raude
Elliot Vincent
Lintao Xu
Hongyu Zhou
Loic Landrieu
27
6
0
29 Apr 2024
Benchmarking Benchmark Leakage in Large Language Models
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
56
42
0
29 Apr 2024
Mapping the Potential of Explainable AI for Fairness Along the AI
  Lifecycle
Mapping the Potential of Explainable AI for Fairness Along the AI Lifecycle
Luca Deck
Astrid Schomacker
Timo Speith
Jakob Schöffer
Lena Kästner
Niklas Kühl
33
4
0
29 Apr 2024
Lazy Data Practices Harm Fairness Research
Lazy Data Practices Harm Fairness Research
Jan Simson
Alessandro Fabris
Christoph Kern
18
5
0
26 Apr 2024
Near to Mid-term Risks and Opportunities of Open-Source Generative AI
Near to Mid-term Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksandar Petrov
Bertie Vidgen
Christian Schroeder de Witt
Fabio Pizzati
...
Paul Röttger
Philip H. S. Torr
Trevor Darrell
Y. Lee
Jakob N. Foerster
44
5
0
25 Apr 2024
Inside the echo chamber: Linguistic underpinnings of misinformation on
  Twitter
Inside the echo chamber: Linguistic underpinnings of misinformation on Twitter
Xinyu Wang
Jiayi Li
Sarah Rajtmajer
34
0
0
24 Apr 2024
Modeling the Sacred: Considerations when Using Religious Texts in
  Natural Language Processing
Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing
Ben Hutchinson
85
0
0
23 Apr 2024
Data Authenticity, Consent, & Provenance for AI are all broken: what
  will it take to fix them?
Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?
Shayne Longpre
Robert Mahari
Naana Obeng-Marnu
William Brannon
Tobin South
Katy Gero
Sandy Pentland
Jad Kabbara
56
5
0
19 Apr 2024
AI Competitions and Benchmarks: Dataset Development
AI Competitions and Benchmarks: Dataset Development
Romain Egele
Julio C. S. Jacques Junior
Jan N. van Rijn
Isabelle M Guyon
Xavier Baró
Albert Clapés
Prasanna Balaprakash
Sergio Escalera
T. Moeslund
Jun Wan
42
0
0
15 Apr 2024
Laissez-Faire Harms: Algorithmic Biases in Generative Language Models
Laissez-Faire Harms: Algorithmic Biases in Generative Language Models
Evan Shieh
Faye-Marie Vassel
Cassidy R. Sugimoto
T. Monroe-White
30
3
0
11 Apr 2024
Analyzing the Impact of Data Selection and Fine-Tuning on Economic and
  Political Biases in LLMs
Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in LLMs
Ahmed A. Agiza
Mohamed Mostagir
Sherief Reda
25
7
0
10 Apr 2024
Racial/Ethnic Categories in AI and Algorithmic Fairness: Why They Matter
  and What They Represent
Racial/Ethnic Categories in AI and Algorithmic Fairness: Why They Matter and What They Represent
Jennifer Mickel
25
5
0
10 Apr 2024
[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining
  on a developmentally plausible corpus
[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Leshem Choshen
Ryan Cotterell
Michael Y. Hu
Tal Linzen
Aaron Mueller
Candace Ross
Alex Warstadt
Ethan Gotlieb Wilcox
Adina Williams
Chengxu Zhuang
26
22
0
09 Apr 2024
Data Readiness for AI: A 360-Degree Survey
Data Readiness for AI: A 360-Degree Survey
Kaveen Hiniduma
Suren Byna
J. L. Bez
38
6
0
08 Apr 2024
Concept -- An Evaluation Protocol on Conversational Recommender Systems
  with System-centric and User-centric Factors
Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors
Chen Huang
Peixin Qin
Yang Deng
Wenqiang Lei
Jiancheng Lv
Tat-Seng Chua
32
6
0
04 Apr 2024
Responsible Reporting for Frontier AI Development
Responsible Reporting for Frontier AI Development
Noam Kolt
Markus Anderljung
Joslyn Barnhart
Asher Brass
K. Esvelt
Gillian K. Hadfield
Lennart Heim
Mikel Rodriguez
Jonas B. Sandbrink
Thomas Woodside
42
13
0
03 Apr 2024
Will the Real Linda Please Stand up...to Large Language Models?
  Examining the Representativeness Heuristic in LLMs
Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs
Pengda Wang
Zilin Xiao
Hanjie Chen
Frederick L. Oswald
27
6
0
01 Apr 2024
Designing a User-centric Framework for Information Quality Ranking of
  Large-scale Street View Images
Designing a User-centric Framework for Information Quality Ranking of Large-scale Street View Images
Tahiya Chowdhury
Ilan Mandel
Jorge Ortiz
Wendy Ju
27
0
0
30 Mar 2024
FISBe: A real-world benchmark dataset for instance segmentation of
  long-range thin filamentous structures
FISBe: A real-world benchmark dataset for instance segmentation of long-range thin filamentous structures
Lisa Mais
Peter Hirsch
Claire Managan
Ramya Kandarpa
J. L. Rumberger
Annika Reinke
Lena Maier-Hein
Gudrun Ihrke
Dagmar Kainmueller
23
0
0
29 Mar 2024
Benchmarking Object Detectors with COCO: A New Path Forward
Benchmarking Object Detectors with COCO: A New Path Forward
Shweta Singh
Aayan Yadav
Jitesh Jain
Humphrey Shi
Justin Johnson
Karan Desai
23
5
0
27 Mar 2024
Decoding the Digital Fine Print: Navigating the potholes in Terms of
  service/ use of GenAI tools against the emerging need for Transparent and
  Trustworthy Tech Futures
Decoding the Digital Fine Print: Navigating the potholes in Terms of service/ use of GenAI tools against the emerging need for Transparent and Trustworthy Tech Futures
Sundaraparipurnan Narayanan
26
0
0
26 Mar 2024
Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships
Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships
Rangel Daroya
Aaron Sun
Subhransu Maji
27
0
0
25 Mar 2024
Reflecting the Male Gaze: Quantifying Female Objectification in 19th and
  20th Century Novels
Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels
Kexin Luo
Yue Mao
Bei Zhang
Sophie Hao
24
1
0
25 Mar 2024
"We Have No Idea How Models will Behave in Production until Production":
  How Engineers Operationalize Machine Learning
"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning
Shreya Shankar
Rolando Garcia
J. M. Hellerstein
Aditya G. Parameswaran
22
10
0
25 Mar 2024
InstaSynth: Opportunities and Challenges in Generating Synthetic
  Instagram Data with ChatGPT for Sponsored Content Detection
InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection
T. Bertaglia
Lily Heisig
Rishabh Kaushal
Adriana Iamnitchi
20
4
0
22 Mar 2024
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Jeffrey Cheng
Marc Marone
Orion Weller
Dawn J Lawrie
Daniel Khashabi
Benjamin Van Durme
59
12
0
19 Mar 2024
From Melting Pots to Misrepresentations: Exploring Harms in Generative
  AI
From Melting Pots to Misrepresentations: Exploring Harms in Generative AI
Sanjana Gautam
Pranav Narayanan Venkit
Sourojit Ghosh
39
15
0
16 Mar 2024
Data Ethics Emergency Drill: A Toolbox for Discussing Responsible AI for
  Industry Teams
Data Ethics Emergency Drill: A Toolbox for Discussing Responsible AI for Industry Teams
Vanessa Hanschke
Dylan Rees
Merve Alanyali
David Hopkinson
Paul Marshall
48
2
0
15 Mar 2024
Couler: Unified Machine Learning Workflow Optimization in Cloud
Couler: Unified Machine Learning Workflow Optimization in Cloud
Xiaoda Wang
Yuan-ju Tang
Tengda Guo
Bo Sang
Jingji Wu
Jian Sha
Ke Zhang
Jiang Qian
Mingjie Tang
25
0
0
12 Mar 2024
Elephants Never Forget: Testing Language Models for Memorization of
  Tabular Data
Elephants Never Forget: Testing Language Models for Memorization of Tabular Data
Sebastian Bordt
Harsha Nori
Rich Caruana
LMTD
33
13
0
11 Mar 2024
CommitBench: A Benchmark for Commit Message Generation
CommitBench: A Benchmark for Commit Message Generation
Maximilian Schall
Tamara Czinczoll
Gerard de Melo
14
3
0
08 Mar 2024
Position: Insights from Survey Methodology can Improve Training Data
Position: Insights from Survey Methodology can Improve Training Data
Stephanie Eckman
Barbara Plank
Frauke Kreuter
SyDa
33
3
0
02 Mar 2024
Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from
  training data, prompting, and decoding strategies into its near-SoTA
  performance
Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Rachith Aiyappa
Shruthi Senthilmani
Jisun An
Haewoon Kwak
Yong-Yeol Ahn
24
3
0
01 Mar 2024
Implications of Regulations on the Use of AI and Generative AI for
  Human-Centered Responsible Artificial Intelligence
Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence
Marios Constantinides
Mohammad Tahaei
Daniele Quercia
Simone Stumpf
Michael A. Madaio
...
Ewa Luger
Jess Holbrook
Michael J. Muller
Ilana Golbin Blumenfeld
Giada Pistilli
49
7
0
29 Feb 2024
The Situate AI Guidebook: Co-Designing a Toolkit to Support
  Multi-Stakeholder Early-stage Deliberations Around Public Sector AI Proposals
The Situate AI Guidebook: Co-Designing a Toolkit to Support Multi-Stakeholder Early-stage Deliberations Around Public Sector AI Proposals
Anna Kawakami
Amanda Coston
Haiyi Zhu
Hoda Heidari
Kenneth Holstein
36
23
0
29 Feb 2024
DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity
  Recognition
DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity Recognition
K. Enevoldsen
Fredrik Jørgensen
Morten H Baglini
23
0
0
28 Feb 2024
An Integrated Data Processing Framework for Pretraining Foundation
  Models
An Integrated Data Processing Framework for Pretraining Foundation Models
Yiding Sun
Feng Wang
Yutao Zhu
Wayne Xin Zhao
Jiaxin Mao
67
4
0
26 Feb 2024
Foundation Model Transparency Reports
Foundation Model Transparency Reports
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Betty Xiong
Sayash Kapoor
Nestor Maslej
Arvind Narayanan
Percy Liang
32
15
0
26 Feb 2024
Towards Fair Graph Anomaly Detection: Problem, New Datasets, and
  Evaluation
Towards Fair Graph Anomaly Detection: Problem, New Datasets, and Evaluation
Neng Kai Nigel Neo
Yeon-Chang Lee
Yiqiao Jin
Sang-Wook Kim
Srijan Kumar
41
2
0
25 Feb 2024
Farsight: Fostering Responsible AI Awareness During AI Application
  Prototyping
Farsight: Fostering Responsible AI Awareness During AI Application Prototyping
Zijie J. Wang
Chinmay Kulkarni
Lauren Wilcox
Michael Terry
Michael A. Madaio
38
43
0
23 Feb 2024
Automatic Histograms: Leveraging Language Models for Text Dataset
  Exploration
Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
Emily Reif
Crystal Qian
James Wexler
Minsuk Kahng
30
10
0
21 Feb 2024
Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia
Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia
Tzu-Sheng Kuo
Aaron L Halfaker
Zirui Cheng
Jiwoo Kim
Meng-Hsin Wu
Tongshuang Wu
Kenneth Holstein
Haiyi Zhu
57
21
0
21 Feb 2024
The METRIC-framework for assessing data quality for trustworthy AI in
  medicine: a systematic review
The METRIC-framework for assessing data quality for trustworthy AI in medicine: a systematic review
Daniel Schwabe
Katinka Becker
Martin Seyferth
Andreas Klass
Tobias Schäffter
29
20
0
21 Feb 2024
Previous
123456...181920
Next