Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1709.09500
Cited By
Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets
27 September 2017
Rotem Dror
G. Baumer
Marina Bogomolov
Roi Reichart
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets"
36 / 36 papers shown
Title
Does a Large Language Model Really Speak in Human-Like Language?
Mose Park
Yunjin Choi
Jong-June Jeon
DeLMO
129
1
0
03 Jan 2025
Benchmark Data Repositories for Better Benchmarking
Neural Information Processing Systems (NeurIPS), 2024
Rachel Longjohn
Markelle Kelly
Sameer Singh
Padhraic Smyth
231
10
0
31 Oct 2024
LMEMs for post-hoc analysis of HPO Benchmarking
Anton Geburek
Neeratyoy Mallik
Danny Stoll
Xavier Bouthillier
Frank Hutter
135
1
0
05 Aug 2024
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers
Harald Semmelrock
Tony Ross-Hellauer
Simone Kopeinik
Dieter Theiler
Armin Haberl
Stefan Thalmann
Dominik Kowald
398
29
0
20 Jun 2024
Small Effect Sizes in Malware Detection? Make Harder Train/Test Splits!
Tirth Patel
Fred Lu
Edward Raff
Charles K. Nicholas
Cynthia Matuszek
James Holt
140
3
0
25 Dec 2023
Faithful Model Evaluation for Model-Based Metrics
Palash Goyal
Qian Hu
Rahul Gupta
70
1
0
19 Dec 2023
Data Similarity is Not Enough to Explain Language Model Performance
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Gregory Yauney
Emily Reif
David M. Mimno
189
9
0
15 Nov 2023
A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations
Hang Chen
Keqing Du
Chenguang Li
Xinyu Yang
282
3
0
02 Nov 2023
Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests
Neural Information Processing Systems (NeurIPS), 2023
Edward Raff
James Holt
156
8
0
27 Oct 2023
Reproducibility in Machine Learning-Driven Research
Harald Semmelrock
Simone Kopeinik
Dieter Theiler
Tony Ross-Hellauer
Dominik Kowald
AI4CE
164
30
0
19 Jul 2023
The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Matthias Orlikowski
Paul Röttger
Philipp Cimiano
Italy
151
41
0
20 Jun 2023
A Two-Sided Discussion of Preregistration of NLP Research
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Anders Søgaard
Daniel Hershcovich
Miryam de Lhoneux
OnRL
AI4CE
190
4
0
20 Feb 2023
Towards Inferential Reproducibility of Machine Learning Research
International Conference on Learning Representations (ICLR), 2023
Michael Hagmann
Philipp Meier
Stefan Riezler
408
3
0
08 Feb 2023
BMX: Boosting Natural Language Generation Metrics with Explainability
Findings (Findings), 2022
Christoph Leiter
Hoang-Quan Nguyen
Steffen Eger
ELM
174
0
0
20 Dec 2022
BiasBed -- Rigorous Texture Bias Evaluation
Computer Vision and Pattern Recognition (CVPR), 2022
Nikolai Kalischek
Rodrigo Caye Daudt
T. Peters
Reinhard Furrer
Jan Dirk Wegner
Konrad Schindler
128
2
0
23 Nov 2022
Assessing Resource-Performance Trade-off of Natural Language Models using Data Envelopment Analysis
Z. Zhou
Alisha Zachariah
D. Conathan
Jeffery Kline
68
0
0
02 Nov 2022
Dialect-robust Evaluation of Generated Text
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jiao Sun
Thibault Sellam
Elizabeth Clark
Tu Vu
Timothy Dozat
Dan Garrette
Aditya Siddhant
Jacob Eisenstein
Sebastian Gehrmann
174
25
0
02 Nov 2022
A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms
Hang Chen
Keqing Du
Xinyu Yang
Chenguang Li
CML
145
13
0
14 Sep 2022
Experimental Standards for Deep Learning in Natural Language Processing Research
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Dennis Ulmer
Elisa Bassignana
Max Müller-Eberstein
Daniel Varab
Mike Zhang
Rob van der Goot
Christian Hardmeier
Barbara Plank
228
12
0
13 Apr 2022
A Siren Song of Open Source Reproducibility
Edward Raff
Andrew L. Farris
113
9
0
09 Apr 2022
Does the Market of Citations Reward Reproducible Work?
Edward Raff
HAI
CML
120
15
0
08 Apr 2022
The worst of both worlds: A comparative analysis of errors in learning from data in psychology and machine learning
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022
Jessica Hullman
Sayash Kapoor
Priyanka Nanayakkara
Andrew Gelman
Arvind Narayanan
367
42
0
12 Mar 2022
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
Amir Feder
Katherine A. Keith
Emaad A. Manzoor
Reid Pryzant
Dhanya Sridhar
...
Roi Reichart
Margaret E. Roberts
Brandon M Stewart
Victor Veitch
Diyi Yang
CML
288
285
0
02 Sep 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
215
104
0
14 Jul 2021
The Zero Resource Speech Challenge 2021: Spoken language modelling
Interspeech (Interspeech), 2021
Ewan Dunbar
Mathieu Bernard
Nicolas Hamilakis
Tu Nguyen
Maureen de Seyssel
Patricia Roze
M. Rivière
Eugene Kharitonov
Emmanuel Dupoux
242
56
0
29 Apr 2021
LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Ryan Hoque
Ashwin Balakrishna
Carl Putterman
Michael Luo
Daniel S. Brown
Daniel Seita
Brijen Thananjeyan
Ellen R. Novoseller
Ken Goldberg
397
59
0
31 Mar 2021
Accounting for Variance in Machine Learning Benchmarks
Conference on Machine Learning and Systems (MLSys), 2021
Xavier Bouthillier
Pierre Delaunay
Mirko Bronzi
Assya Trofimov
Brennan Nichyporuk
...
Dmitriy Serdyuk
Tal Arbel
C. Pal
Gaël Varoquaux
Pascal Vincent
189
172
0
01 Mar 2021
Decoding EEG Brain Activity for Multi-Modal Natural Language Processing
Frontiers in Human Neuroscience (Front Hum Neurosci), 2021
Nora Hollenstein
Cédric Renggli
B. Glaus
Maria Barrett
M. Troendle
N. Langer
Ce Zhang
267
43
0
17 Feb 2021
CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation
Xinping Liu
Zehong Cao
Son N. Tran
FedML
125
0
0
24 Sep 2020
On the Choice of Auxiliary Languages for Improved Sequence Tagging
Lukas Lange
Heike Adel
Jannik Strötgen
107
5
0
19 May 2020
The Structured Weighted Violations MIRA
Dor Ringel
Rotem Dror
Roi Reichart
61
0
0
09 May 2020
Quantifying the Semantic Core of Gender Systems
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Adina Williams
Robert Bamler
Lawrence Wolf-Sonkin
Damián E. Blasi
Hanna M. Wallach
110
18
0
29 Oct 2019
CogniVal: A Framework for Cognitive Word Embedding Evaluation
Conference on Computational Natural Language Learning (CoNLL), 2019
Nora Hollenstein
A. D. L. Torre
N. Langer
Ce Zhang
210
72
0
19 Sep 2019
A Step Toward Quantifying Independently Reproducible Machine Learning Research
Neural Information Processing Systems (NeurIPS), 2019
Edward Raff
141
145
0
14 Sep 2019
Show Your Work: Improved Reporting of Experimental Results
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
191
274
0
06 Sep 2019
Advancing NLP with Cognitive Language Processing Signals
Nora Hollenstein
Maria Barrett
M. Troendle
Francesco Bigiolli
N. Langer
Ce Zhang
190
40
0
04 Apr 2019
1