Papers citing 'Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets'

Title
Does a Large Language Model Really Speak in Human-Like Language? Mose Park Yunjin Choi Jong-June Jeon DeLMO 129 1 0 03 Jan 2025
Benchmark Data Repositories for Better BenchmarkingNeural Information Processing Systems (NeurIPS), 2024 Rachel Longjohn Markelle Kelly Sameer Singh Padhraic Smyth 231 10 0 31 Oct 2024
LMEMs for post-hoc analysis of HPO Benchmarking Anton Geburek Neeratyoy Mallik Danny Stoll Xavier Bouthillier Frank Hutter 135 1 0 05 Aug 2024
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers Harald Semmelrock Tony Ross-Hellauer Simone Kopeinik Dieter Theiler Armin Haberl Stefan Thalmann Dominik Kowald 398 29 0 20 Jun 2024
Small Effect Sizes in Malware Detection? Make Harder Train/Test Splits! Tirth Patel Fred Lu Edward Raff Charles K. Nicholas Cynthia Matuszek James Holt 140 3 0 25 Dec 2023
Faithful Model Evaluation for Model-Based Metrics Palash Goyal Qian Hu Rahul Gupta 70 1 0 19 Dec 2023
Data Similarity is Not Enough to Explain Language Model PerformanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Gregory Yauney Emily Reif David M. Mimno 189 9 0 15 Nov 2023
A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations Hang Chen Keqing Du Chenguang Li Xinyu Yang 282 3 0 02 Nov 2023
Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit TestsNeural Information Processing Systems (NeurIPS), 2023 Edward Raff James Holt 156 8 0 27 Oct 2023
Reproducibility in Machine Learning-Driven Research Harald Semmelrock Simone Kopeinik Dieter Theiler Tony Ross-Hellauer Dominik Kowald AI4CE 164 30 0 19 Jul 2023
The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond SociodemographicsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Matthias Orlikowski Paul Röttger Philipp Cimiano Italy 151 41 0 20 Jun 2023
A Two-Sided Discussion of Preregistration of NLP ResearchConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023 Anders Søgaard Daniel Hershcovich Miryam de Lhoneux OnRL AI4CE 190 4 0 20 Feb 2023
Towards Inferential Reproducibility of Machine Learning ResearchInternational Conference on Learning Representations (ICLR), 2023 Michael Hagmann Philipp Meier Stefan Riezler 408 3 0 08 Feb 2023
BMX: Boosting Natural Language Generation Metrics with ExplainabilityFindings (Findings), 2022 Christoph Leiter Hoang-Quan Nguyen Steffen Eger ELM 174 0 0 20 Dec 2022
BiasBed -- Rigorous Texture Bias EvaluationComputer Vision and Pattern Recognition (CVPR), 2022 Nikolai Kalischek Rodrigo Caye Daudt T. Peters Reinhard Furrer Jan Dirk Wegner Konrad Schindler 128 2 0 23 Nov 2022
Assessing Resource-Performance Trade-off of Natural Language Models using Data Envelopment Analysis Z. Zhou Alisha Zachariah D. Conathan Jeffery Kline 68 0 0 02 Nov 2022
Dialect-robust Evaluation of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Jiao Sun Thibault Sellam Elizabeth Clark Tu Vu Timothy Dozat Dan Garrette Aditya Siddhant Jacob Eisenstein Sebastian Gehrmann 174 25 0 02 Nov 2022
A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms Hang Chen Keqing Du Xinyu Yang Chenguang Li CML 145 13 0 14 Sep 2022
Experimental Standards for Deep Learning in Natural Language Processing ResearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Dennis Ulmer Elisa Bassignana Max Müller-Eberstein Daniel Varab Mike Zhang Rob van der Goot Christian Hardmeier Barbara Plank 228 12 0 13 Apr 2022
A Siren Song of Open Source Reproducibility Edward Raff Andrew L. Farris 113 9 0 09 Apr 2022
Does the Market of Citations Reward Reproducible Work? Edward Raff HAI CML 120 15 0 08 Apr 2022
The worst of both worlds: A comparative analysis of errors in learning from data in psychology and machine learningAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022 Jessica Hullman Sayash Kapoor Priyanka Nanayakkara Andrew Gelman Arvind Narayanan 367 42 0 12 Mar 2022
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond Amir Feder Katherine A. Keith Emaad A. Manzoor Reid Pryzant Dhanya Sridhar ... Roi Reichart Margaret E. Roberts Brandon M Stewart Victor Veitch Diyi Yang CML 288 285 0 02 Sep 2021
The Benchmark Lottery Mostafa Dehghani Yi Tay A. Gritsenko Zhe Zhao N. Houlsby Fernando Diaz Donald Metzler Oriol Vinyals 215 104 0 14 Jul 2021
The Zero Resource Speech Challenge 2021: Spoken language modellingInterspeech (Interspeech), 2021 Ewan Dunbar Mathieu Bernard Nicolas Hamilakis Tu Nguyen Maureen de Seyssel Patricia Roze M. Rivière Eugene Kharitonov Emmanuel Dupoux 242 56 0 29 Apr 2021
LazyDAgger: Reducing Context Switching in Interactive Imitation Learning Ryan Hoque Ashwin Balakrishna Carl Putterman Michael Luo Daniel S. Brown Daniel Seita Brijen Thananjeyan Ellen R. Novoseller Ken Goldberg 397 59 0 31 Mar 2021
Accounting for Variance in Machine Learning BenchmarksConference on Machine Learning and Systems (MLSys), 2021 Xavier Bouthillier Pierre Delaunay Mirko Bronzi Assya Trofimov Brennan Nichyporuk ... Dmitriy Serdyuk Tal Arbel C. Pal Gaël Varoquaux Pascal Vincent 189 172 0 01 Mar 2021
Decoding EEG Brain Activity for Multi-Modal Natural Language ProcessingFrontiers in Human Neuroscience (Front Hum Neurosci), 2021 Nora Hollenstein Cédric Renggli B. Glaus Maria Barrett M. Troendle N. Langer Ce Zhang 267 43 0 17 Feb 2021
CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation Xinping Liu Zehong Cao Son N. Tran FedML 125 0 0 24 Sep 2020
On the Choice of Auxiliary Languages for Improved Sequence Tagging Lukas Lange Heike Adel Jannik Strötgen 107 5 0 19 May 2020
The Structured Weighted Violations MIRA Dor Ringel Rotem Dror Roi Reichart 61 0 0 09 May 2020
Quantifying the Semantic Core of Gender SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Adina Williams Robert Bamler Lawrence Wolf-Sonkin Damián E. Blasi Hanna M. Wallach 110 18 0 29 Oct 2019
CogniVal: A Framework for Cognitive Word Embedding EvaluationConference on Computational Natural Language Learning (CoNLL), 2019 Nora Hollenstein A. D. L. Torre N. Langer Ce Zhang 210 72 0 19 Sep 2019
A Step Toward Quantifying Independently Reproducible Machine Learning ResearchNeural Information Processing Systems (NeurIPS), 2019 Edward Raff 141 145 0 14 Sep 2019
Show Your Work: Improved Reporting of Experimental ResultsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Jesse Dodge Suchin Gururangan Dallas Card Roy Schwartz Noah A. Smith 191 274 0 06 Sep 2019
Advancing NLP with Cognitive Language Processing Signals Nora Hollenstein Maria Barrett M. Troendle Francesco Bigiolli N. Langer Ce Zhang 190 40 0 04 Apr 2019

129

1

0

03 Jan 2025

Benchmark Data Repositories for Better BenchmarkingNeural Information Processing Systems (NeurIPS), 2024

231

10

0

31 Oct 2024

LMEMs for post-hoc analysis of HPO Benchmarking

Anton Geburek

Neeratyoy Mallik

Danny Stoll

Xavier Bouthillier

Frank Hutter

135

1

0

05 Aug 2024

Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers

398

29

0

20 Jun 2024

Small Effect Sizes in Malware Detection? Make Harder Train/Test Splits!

140

3

0

25 Dec 2023

Faithful Model Evaluation for Model-Based Metrics

Palash Goyal

Qian Hu

Rahul Gupta

70

1

0

19 Dec 2023

Data Similarity is Not Enough to Explain Language Model PerformanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Gregory Yauney

Emily Reif

David M. Mimno

189

9

0

15 Nov 2023

A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

282

3

0

02 Nov 2023

Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit TestsNeural Information Processing Systems (NeurIPS), 2023

Edward Raff

James Holt

156

8

0

27 Oct 2023

Reproducibility in Machine Learning-Driven Research

164

30

0

19 Jul 2023

The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond SociodemographicsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Matthias Orlikowski

Paul Röttger

Philipp Cimiano

Italy

151

41

0

20 Jun 2023

A Two-Sided Discussion of Preregistration of NLP ResearchConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Anders Søgaard

Daniel Hershcovich

Miryam de Lhoneux

OnRL AI4CE

190

4

0

20 Feb 2023