v1v2v3v4 (latest)

Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

27 March 2020

Joelle Pineau

Philippe Vincent-Lamarre

Papers citing "Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)"

50 / 176 papers shown

Title
Towards Reliable and Generalizable Differentially Private Machine Learning (Extended Version) Wenxuan Bao Vincent Bindschaedler AAML 56 0 0 21 Aug 2025
STREAM (ChemBio): A Standard for Transparently Reporting Evaluations in AI Model Reports Tegan McCaslin Jide Alaga Samira Nedungadi Seth Donoughe Tom Reed Rishi Bommasani Chris Painter Luca Righetti 40 0 0 13 Aug 2025
KonfAI: A Modular and Fully Configurable Framework for Deep Learning in Medical Imaging Valentin Boussot Jean-Louis Dillenseger MedIm 20 0 0 13 Aug 2025
A Reproducible, Scalable Pipeline for Synthesizing Autoregressive Model Literature Faruk Alpay Bugra Kilictas Hamdi Alakkad 24 0 0 06 Aug 2025
Graph Lineages and Skeletal Graph Products Eric Mjolsness Cory Braker Scott AI4CE 47 0 0 31 Jul 2025
Reproducibility of Machine Learning-Based Fault Detection and Diagnosis for HVAC Systems in Buildings: An Empirical Study Adil Mukhtar Michael Hadwiger F. Wotawa Gerald Schweiger AI4CE 11 0 0 23 Jul 2025
Beware! The AI Act Can Also Apply to Your AI Research Practices Alina Wernick Kristof Meding 55 0 0 03 Jun 2025
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data Poli A. Nemkova Solomon Ubani Mark V. Albert AILaw 93 0 0 15 May 2025
Rethink Repeatable Measures of Robot Performance with Statistical Query Bowen Weng L. Capito Guillermo A. Castillo Dylan Khor 139 1 0 13 May 2025
Visual Affordances: Enabling Robots to Understand Object Functionality Tommaso Apicella Alessio Xompero Andrea Cavallaro 183 0 0 08 May 2025
Improving the Reproducibility of Deep Learning Software: An Initial Investigation through a Case Study Analysis Nikita Ravi Abhinav Goel James C. Davis George K. Thiruvathukal 113 0 0 06 May 2025
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Minju Seo Jinheon Baek Seongyun Lee Sung Ju Hwang AI4CE 263 10 0 24 Apr 2025
LEMUR Neural Network Dataset: Towards Seamless AutoML Arash Torabi Goodarzi Roman Kochnev Waleed Khalid Furui Qin Tolgay Atinc Uzun Yashkumar Sanjaybhai Dhameliya Yash Kanubhai Kathiriya Zofia Antonina Bentyn D. Ignatov Radu Timofte 125 3 0 14 Apr 2025
Reproducibility and Artifact Consistency of the SIGIR 2022 Recommender Systems Papers Based on Message Passing Maurizio Ferrari Dacrema Michael Benigni Nicola Ferro 101 0 0 10 Mar 2025
LimeSoDa: A Dataset Collection for Benchmarking of Machine Learning Regressors in Digital Soil Mapping J. Schmidinger S. Vogel V. Barkov A.-D. Pham R. Gebbers ... P. Rosso M. M. Costa R. S. Zandonadi J. Wetterlind M. Atzmueller 166 1 0 27 Feb 2025
Beyond Release: Access Considerations for Generative AI Systems Irene Solaiman Rishi Bommasani Dan Hendrycks Ariel Herbert-Voss Yacine Jernite Aviya Skowron Andrew Trask 266 1 0 23 Feb 2025
Enhancing Code Consistency in AI Research with Large Language Models and Retrieval-Augmented Generation Rajat Keshri Arun George Zachariah Michael Boone 172 0 0 02 Feb 2025
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps Benjamin Ellis Matthew Jackson Andrei Lupu Alexander David Goldie Mattie Fellows Shimon Whiteson Jakob Foerster 185 4 0 22 Dec 2024
Benchmark Data Repositories for Better Benchmarking Rachel Longjohn Markelle Kelly Sameer Singh Padhraic Smyth 151 6 0 31 Oct 2024
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms Jordan Meyer Nick Padgett Cullen Miller Laura Exline 117 7 0 30 Oct 2024
Mitigating Downstream Model Risks via Model Provenance Keyu Wang Abdullah Norozi Iranzad Scott Schaffter Doina Precup Jonathan Lebensold 116 1 0 03 Oct 2024
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models Heng Yang Jack Cole Ke Li 80 0 0 02 Oct 2024
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models David Castillo-Bolado Joseph Davidson Finlay Gray Marek Rosa 103 10 0 30 Sep 2024
Confidence intervals uncovered: Are we ready for real-world medical imaging AI? Evangelia Christodoulou Annika Reinke Rola Houhou P. Kalinowski Selen Erkan ... Paul F. Jäger Annette Kopp-Schneider Gaël Varoquaux O. Colliot Lena Maier-Hein OOD 70 5 0 26 Sep 2024
Investigating the Impact of Randomness on Reproducibility in Computer Vision: A Study on Applications in Civil Engineering and Medicine Bahadır Eryılmaz Osman Alperen Koras Jorg Schlotterer Christin Seifert 83 0 0 19 Sep 2024
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark Zachary S. Siegel Sayash Kapoor Nitya Nagdir Benedikt Stroebl Arvind Narayanan 116 23 0 17 Sep 2024
AI Research is not Magic, it has to be Reproducible and Responsible: Challenges in the AI field from the Perspective of its PhD Students Andrea Hrckova Jennifer Renoux Rafael Tolosana Calasanz Daniela Chuda Martin Tamajka Jakub Simko 64 0 0 13 Aug 2024
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions Evelyn Navarrete Ralph Ewerth Anett Hoppe 77 1 0 08 Aug 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning Andrew Patterson Samuel Neumann Raksha Kumaraswamy Martha White Adam White 85 2 0 26 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention João D. Nunes D. Montezuma Domingos Oliveira Tania Pereira Jaime S. Cardoso 128 0 0 26 Jul 2024
Questionable practices in machine learning Gavin Leech Juan J. Vazquez Misha Yagudin Niclas Kupper Laurence Aitchison 148 6 0 17 Jul 2024
Generalizability of experimental studies Federico Matteucci Vadim Arzamasov Jose Cribeiro-Ramallo Marco Heyden Konstantin Ntounas Klemens Bohm 141 0 0 25 Jun 2024
Let Guidelines Guide You: A Prescriptive Guideline-Centered Data Annotation Methodology Federico Ruggeri Eleonora Misino Arianna Muti Katerina Korre Paolo Torroni Alberto Barrón-Cedeño 148 1 0 20 Jun 2024
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers Harald Semmelrock Tony Ross-Hellauer Simone Kopeinik Dieter Theiler Armin Haberl Stefan Thalmann Dominik Kowald 212 17 0 20 Jun 2024
ChaosMining: A Benchmark to Evaluate Post-Hoc Local Attribution Methods in Low SNR Environments Ge Shi Ziwen Kan J. Smucny Ian Davidson 124 0 0 17 Jun 2024
GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in Explanations Rick Wilming Artur Dox Hjalmar Schulz Marta Oliveira Benedict Clark Stefan Haufe 131 3 0 17 Jun 2024
Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research Surangika Ranathunga Nisansa de Silva Dilith Jayakody Aloka Fernando 96 4 0 10 Jun 2024
Position: Embracing Negative Results in Machine Learning Florian Karl Lukas Malte Kemeter Gabriel Dax Paulina Sierak 117 3 0 06 Jun 2024
Repeatable and Reliable Efforts of Accelerated Risk Assessment L. Capito Guillermo A. Castillo Bowen Weng 84 2 0 30 May 2024
Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported Ethan Harvey Mikhail Petrov Michael C. Hughes BDL 72 3 0 24 May 2024
Position: Why We Must Rethink Empirical Research in Machine Learning Moritz Herrmann F. J. D. Lange Katharina Eggensperger Giuseppe Casalicchio Marcel Wever Matthias Feurer David Rügamer Eyke Hüllermeier A. Boulesteix Bernd Bischl 124 12 0 03 May 2024
A Partial Replication of MaskFormer in TensorFlow on TPUs for the TensorFlow Model Garden Vishal Purohit Wenxin Jiang Akshath R. Ravikiran James C. Davis 90 1 0 29 Apr 2024
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review Sergio A. Serrano J. Martínez-Carranza L. Sucar 129 4 0 26 Apr 2024
From Model Performance to Claim: How a Change of Focus in Machine Learning Replicability Can Help Bridge the Responsibility Gap Tianqi Kou 163 3 0 19 Apr 2024
Optimization-Based System Identification and Moving Horizon Estimation Using Low-Cost Sensors for a Miniature Car-Like Robot Sabrina Bodmer Lukas Vogel S. Muntwiler Alexander Hansson Tobias Bodewig Jonas Wahlen Melanie Zeilinger Andrea Carron 82 5 0 12 Apr 2024
Reproducibility and Geometric Intrinsic Dimensionality: An Investigation on Graph Neural Network Research Tobias Hille Maximilian Stubbemann Tom Hanika AI4CE 101 0 0 13 Mar 2024
Supervised machine learning for microbiomics: bridging the gap between current and best practices Natasha K. Dudek Mariam Chakhvadze Saba Kobakhidze Omar Kantidze Yuriy Gankin LM&MA 105 3 0 27 Feb 2024
SzCORE: A Seizure Community Open-source Research Evaluation framework for the validation of EEG-based automated seizure detection algorithms Jonathan Dan U. Pale Alireza Amirshahi William Cappelletti T. Ingolfsson ... Adriano Bernini Luca Benini S. Beniczky David Atienza P. Ryvlin 179 16 0 20 Feb 2024
Reproducibility, Replicability, and Transparency in Research: What 430 Professors Think in Universities across the USA and India Tatiana Chakravorti S. Koneru Sarah Rajtmajer AI4CE 94 2 0 13 Feb 2024
Investigating Reproducibility in Deep Learning-Based Software Fault Prediction Adil Mukhtar Dietmar Jannach Franz Wotawa AI4CE 83 0 0 08 Feb 2024