ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.12206
  4. Cited By
Improving Reproducibility in Machine Learning Research (A Report from
  the NeurIPS 2019 Reproducibility Program)
v1v2v3v4 (latest)

Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

27 March 2020
Joelle Pineau
Philippe Vincent-Lamarre
Koustuv Sinha
V. Larivière
A. Beygelzimer
Florence dÁlché-Buc
E. Fox
Hugo Larochelle
ArXiv (abs)PDFHTML

Papers citing "Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)"

50 / 176 papers shown
Title
Towards Reliable and Generalizable Differentially Private Machine Learning (Extended Version)
Towards Reliable and Generalizable Differentially Private Machine Learning (Extended Version)
Wenxuan Bao
Vincent Bindschaedler
AAML
56
0
0
21 Aug 2025
STREAM (ChemBio): A Standard for Transparently Reporting Evaluations in AI Model Reports
STREAM (ChemBio): A Standard for Transparently Reporting Evaluations in AI Model Reports
Tegan McCaslin
Jide Alaga
Samira Nedungadi
Seth Donoughe
Tom Reed
Rishi Bommasani
Chris Painter
Luca Righetti
40
0
0
13 Aug 2025
KonfAI: A Modular and Fully Configurable Framework for Deep Learning in Medical Imaging
KonfAI: A Modular and Fully Configurable Framework for Deep Learning in Medical Imaging
Valentin Boussot
Jean-Louis Dillenseger
MedIm
20
0
0
13 Aug 2025
A Reproducible, Scalable Pipeline for Synthesizing Autoregressive Model Literature
A Reproducible, Scalable Pipeline for Synthesizing Autoregressive Model Literature
Faruk Alpay
Bugra Kilictas
Hamdi Alakkad
24
0
0
06 Aug 2025
Graph Lineages and Skeletal Graph Products
Graph Lineages and Skeletal Graph Products
Eric Mjolsness
Cory Braker Scott
AI4CE
47
0
0
31 Jul 2025
Reproducibility of Machine Learning-Based Fault Detection and Diagnosis for HVAC Systems in Buildings: An Empirical Study
Reproducibility of Machine Learning-Based Fault Detection and Diagnosis for HVAC Systems in Buildings: An Empirical Study
Adil Mukhtar
Michael Hadwiger
F. Wotawa
Gerald Schweiger
AI4CE
11
0
0
23 Jul 2025
Beware! The AI Act Can Also Apply to Your AI Research Practices
Beware! The AI Act Can Also Apply to Your AI Research Practices
Alina Wernick
Kristof Meding
55
0
0
03 Jun 2025
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data
Poli A. Nemkova
Solomon Ubani
Mark V. Albert
AILaw
93
0
0
15 May 2025
Rethink Repeatable Measures of Robot Performance with Statistical Query
Rethink Repeatable Measures of Robot Performance with Statistical Query
Bowen Weng
L. Capito
Guillermo A. Castillo
Dylan Khor
139
1
0
13 May 2025
Visual Affordances: Enabling Robots to Understand Object Functionality
Visual Affordances: Enabling Robots to Understand Object Functionality
Tommaso Apicella
Alessio Xompero
Andrea Cavallaro
183
0
0
08 May 2025
Improving the Reproducibility of Deep Learning Software: An Initial Investigation through a Case Study Analysis
Improving the Reproducibility of Deep Learning Software: An Initial Investigation through a Case Study Analysis
Nikita Ravi
Abhinav Goel
James C. Davis
George K. Thiruvathukal
113
0
0
06 May 2025
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Minju Seo
Jinheon Baek
Seongyun Lee
Sung Ju Hwang
AI4CE
263
10
0
24 Apr 2025
LEMUR Neural Network Dataset: Towards Seamless AutoML
LEMUR Neural Network Dataset: Towards Seamless AutoML
Arash Torabi Goodarzi
Roman Kochnev
Waleed Khalid
Furui Qin
Tolgay Atinc Uzun
Yashkumar Sanjaybhai Dhameliya
Yash Kanubhai Kathiriya
Zofia Antonina Bentyn
D. Ignatov
Radu Timofte
125
3
0
14 Apr 2025
Reproducibility and Artifact Consistency of the SIGIR 2022 Recommender Systems Papers Based on Message Passing
Maurizio Ferrari Dacrema
Michael Benigni
Nicola Ferro
101
0
0
10 Mar 2025
LimeSoDa: A Dataset Collection for Benchmarking of Machine Learning Regressors in Digital Soil Mapping
LimeSoDa: A Dataset Collection for Benchmarking of Machine Learning Regressors in Digital Soil Mapping
J. Schmidinger
S. Vogel
V. Barkov
A.-D. Pham
R. Gebbers
...
P. Rosso
M. M. Costa
R. S. Zandonadi
J. Wetterlind
M. Atzmueller
166
1
0
27 Feb 2025
Beyond Release: Access Considerations for Generative AI Systems
Beyond Release: Access Considerations for Generative AI Systems
Irene Solaiman
Rishi Bommasani
Dan Hendrycks
Ariel Herbert-Voss
Yacine Jernite
Aviya Skowron
Andrew Trask
266
1
0
23 Feb 2025
Enhancing Code Consistency in AI Research with Large Language Models and Retrieval-Augmented Generation
Enhancing Code Consistency in AI Research with Large Language Models and Retrieval-Augmented Generation
Rajat Keshri
Arun George Zachariah
Michael Boone
172
0
0
02 Feb 2025
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam
  Timesteps
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps
Benjamin Ellis
Matthew Jackson
Andrei Lupu
Alexander David Goldie
Mattie Fellows
Shimon Whiteson
Jakob Foerster
185
4
0
22 Dec 2024
Benchmark Data Repositories for Better Benchmarking
Benchmark Data Repositories for Better Benchmarking
Rachel Longjohn
Markelle Kelly
Sameer Singh
Padhraic Smyth
151
6
0
31 Oct 2024
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel
  Governance Mechanisms
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms
Jordan Meyer
Nick Padgett
Cullen Miller
Laura Exline
117
7
0
30 Oct 2024
Mitigating Downstream Model Risks via Model Provenance
Mitigating Downstream Model Risks via Model Provenance
Keyu Wang
Abdullah Norozi Iranzad
Scott Schaffter
Doina Precup
Jonathan Lebensold
116
1
0
03 Oct 2024
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic
  Foundation Models
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models
Heng Yang
Jack Cole
Ke Li
80
0
0
02 Oct 2024
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language
  Models
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models
David Castillo-Bolado
Joseph Davidson
Finlay Gray
Marek Rosa
103
10
0
30 Sep 2024
Confidence intervals uncovered: Are we ready for real-world medical
  imaging AI?
Confidence intervals uncovered: Are we ready for real-world medical imaging AI?
Evangelia Christodoulou
Annika Reinke
Rola Houhou
P. Kalinowski
Selen Erkan
...
Paul F. Jäger
Annette Kopp-Schneider
Gaël Varoquaux
O. Colliot
Lena Maier-Hein
OOD
70
5
0
26 Sep 2024
Investigating the Impact of Randomness on Reproducibility in Computer
  Vision: A Study on Applications in Civil Engineering and Medicine
Investigating the Impact of Randomness on Reproducibility in Computer Vision: A Study on Applications in Civil Engineering and Medicine
Bahadır Eryılmaz
Osman Alperen Koras
Jorg Schlotterer
Christin Seifert
83
0
0
19 Sep 2024
CORE-Bench: Fostering the Credibility of Published Research Through a
  Computational Reproducibility Agent Benchmark
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark
Zachary S. Siegel
Sayash Kapoor
Nitya Nagdir
Benedikt Stroebl
Arvind Narayanan
116
23
0
17 Sep 2024
AI Research is not Magic, it has to be Reproducible and Responsible:
  Challenges in the AI field from the Perspective of its PhD Students
AI Research is not Magic, it has to be Reproducible and Responsible: Challenges in the AI field from the Perspective of its PhD Students
Andrea Hrckova
Jennifer Renoux
Rafael Tolosana Calasanz
Daniela Chuda
Martin Tamajka
Jakub Simko
64
0
0
13 Aug 2024
Saliency Detection in Educational Videos: Analyzing the Performance of
  Current Models, Identifying Limitations and Advancement Directions
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions
Evelyn Navarrete
Ralph Ewerth
Anett Hoppe
77
1
0
08 Aug 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
85
2
0
26 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification:
  Leveraging Context and Attention
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
128
0
0
26 Jul 2024
Questionable practices in machine learning
Questionable practices in machine learning
Gavin Leech
Juan J. Vazquez
Misha Yagudin
Niclas Kupper
Laurence Aitchison
148
6
0
17 Jul 2024
Generalizability of experimental studies
Generalizability of experimental studies
Federico Matteucci
Vadim Arzamasov
Jose Cribeiro-Ramallo
Marco Heyden
Konstantin Ntounas
Klemens Bohm
141
0
0
25 Jun 2024
Let Guidelines Guide You: A Prescriptive Guideline-Centered Data
  Annotation Methodology
Let Guidelines Guide You: A Prescriptive Guideline-Centered Data Annotation Methodology
Federico Ruggeri
Eleonora Misino
Arianna Muti
Katerina Korre
Paolo Torroni
Alberto Barrón-Cedeño
148
1
0
20 Jun 2024
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers
Harald Semmelrock
Tony Ross-Hellauer
Simone Kopeinik
Dieter Theiler
Armin Haberl
Stefan Thalmann
Dominik Kowald
212
17
0
20 Jun 2024
ChaosMining: A Benchmark to Evaluate Post-Hoc Local Attribution Methods
  in Low SNR Environments
ChaosMining: A Benchmark to Evaluate Post-Hoc Local Attribution Methods in Low SNR Environments
Ge Shi
Ziwen Kan
J. Smucny
Ian Davidson
124
0
0
17 Jun 2024
GECOBench: A Gender-Controlled Text Dataset and Benchmark for
  Quantifying Biases in Explanations
GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in Explanations
Rick Wilming
Artur Dox
Hjalmar Schulz
Marta Oliveira
Benedict Clark
Stefan Haufe
131
3
0
17 Jun 2024
Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP
  Research
Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research
Surangika Ranathunga
Nisansa de Silva
Dilith Jayakody
Aloka Fernando
96
4
0
10 Jun 2024
Position: Embracing Negative Results in Machine Learning
Position: Embracing Negative Results in Machine Learning
Florian Karl
Lukas Malte Kemeter
Gabriel Dax
Paulina Sierak
117
3
0
06 Jun 2024
Repeatable and Reliable Efforts of Accelerated Risk Assessment
Repeatable and Reliable Efforts of Accelerated Risk Assessment
L. Capito
Guillermo A. Castillo
Bowen Weng
84
2
0
30 May 2024
Transfer Learning with Informative Priors: Simple Baselines Better than
  Previously Reported
Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported
Ethan Harvey
Mikhail Petrov
Michael C. Hughes
BDL
72
3
0
24 May 2024
Position: Why We Must Rethink Empirical Research in Machine Learning
Position: Why We Must Rethink Empirical Research in Machine Learning
Moritz Herrmann
F. J. D. Lange
Katharina Eggensperger
Giuseppe Casalicchio
Marcel Wever
Matthias Feurer
David Rügamer
Eyke Hüllermeier
A. Boulesteix
Bernd Bischl
124
12
0
03 May 2024
A Partial Replication of MaskFormer in TensorFlow on TPUs for the
  TensorFlow Model Garden
A Partial Replication of MaskFormer in TensorFlow on TPUs for the TensorFlow Model Garden
Vishal Purohit
Wenxin Jiang
Akshath R. Ravikiran
James C. Davis
90
1
0
29 Apr 2024
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic
  Review
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review
Sergio A. Serrano
J. Martínez-Carranza
L. Sucar
129
4
0
26 Apr 2024
From Model Performance to Claim: How a Change of Focus in Machine Learning Replicability Can Help Bridge the Responsibility Gap
From Model Performance to Claim: How a Change of Focus in Machine Learning Replicability Can Help Bridge the Responsibility Gap
Tianqi Kou
163
3
0
19 Apr 2024
Optimization-Based System Identification and Moving Horizon Estimation
  Using Low-Cost Sensors for a Miniature Car-Like Robot
Optimization-Based System Identification and Moving Horizon Estimation Using Low-Cost Sensors for a Miniature Car-Like Robot
Sabrina Bodmer
Lukas Vogel
S. Muntwiler
Alexander Hansson
Tobias Bodewig
Jonas Wahlen
Melanie Zeilinger
Andrea Carron
82
5
0
12 Apr 2024
Reproducibility and Geometric Intrinsic Dimensionality: An Investigation
  on Graph Neural Network Research
Reproducibility and Geometric Intrinsic Dimensionality: An Investigation on Graph Neural Network Research
Tobias Hille
Maximilian Stubbemann
Tom Hanika
AI4CE
101
0
0
13 Mar 2024
Supervised machine learning for microbiomics: bridging the gap between
  current and best practices
Supervised machine learning for microbiomics: bridging the gap between current and best practices
Natasha K. Dudek
Mariam Chakhvadze
Saba Kobakhidze
Omar Kantidze
Yuriy Gankin
LM&MA
105
3
0
27 Feb 2024
SzCORE: A Seizure Community Open-source Research Evaluation framework
  for the validation of EEG-based automated seizure detection algorithms
SzCORE: A Seizure Community Open-source Research Evaluation framework for the validation of EEG-based automated seizure detection algorithms
Jonathan Dan
U. Pale
Alireza Amirshahi
William Cappelletti
T. Ingolfsson
...
Adriano Bernini
Luca Benini
S. Beniczky
David Atienza
P. Ryvlin
179
16
0
20 Feb 2024
Reproducibility, Replicability, and Transparency in Research: What 430
  Professors Think in Universities across the USA and India
Reproducibility, Replicability, and Transparency in Research: What 430 Professors Think in Universities across the USA and India
Tatiana Chakravorti
S. Koneru
Sarah Rajtmajer
AI4CE
94
2
0
13 Feb 2024
Investigating Reproducibility in Deep Learning-Based Software Fault
  Prediction
Investigating Reproducibility in Deep Learning-Based Software Fault Prediction
Adil Mukhtar
Dietmar Jannach
Franz Wotawa
AI4CE
83
0
0
08 Feb 2024
1234
Next