ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.03395
  4. Cited By
Underspecification Presents Challenges for Credibility in Modern Machine
  Learning

Underspecification Presents Challenges for Credibility in Modern Machine Learning

6 November 2020
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
Alex Beutel
Christina W. Chen
Jonathan Deaton
Jacob Eisenstein
Matthew D. Hoffman
F. Hormozdiari
N. Houlsby
Shaobo Hou
Ghassen Jerfel
Alan Karthikesalingam
Mario Lucic
Yi-An Ma
Cory Y. McLean
Diana Mincu
A. Mitani
Andrea Montanari
Zachary Nado
Vivek Natarajan
Christopher Nielson
T. Osborne
R. Raman
K. Ramasamy
Rory Sayres
Jessica Schrouff
Martin G. Seneviratne
Shannon Sequeira
Harini Suresh
Victor Veitch
Max Vladymyrov
Xuezhi Wang
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
    OffRL
ArXivPDFHTML

Papers citing "Underspecification Presents Challenges for Credibility in Modern Machine Learning"

50 / 351 papers shown
Title
How Emotionally Stable is ALBERT? Testing Robustness with Stochastic
  Weight Averaging on a Sentiment Analysis Task
How Emotionally Stable is ALBERT? Testing Robustness with Stochastic Weight Averaging on a Sentiment Analysis Task
Urja Khurana
Eric T. Nalisnick
Antske Fokkens
MoMe
14
6
0
18 Nov 2021
Selective Ensembles for Consistent Predictions
Selective Ensembles for Consistent Predictions
Emily Black
Klas Leino
Matt Fredrikson
12
21
0
16 Nov 2021
MassFormer: Tandem Mass Spectrum Prediction for Small Molecules using
  Graph Transformers
MassFormer: Tandem Mass Spectrum Prediction for Small Molecules using Graph Transformers
A. Young
Bo Wang
Hannes L. Röst
21
5
0
08 Nov 2021
Partial Order in Chaos: Consensus on Feature Attributions in the
  Rashomon Set
Partial Order in Chaos: Consensus on Feature Attributions in the Rashomon Set
Gabriel Laberge
Y. Pequignot
Alexandre Mathieu
Foutse Khomh
M. Marchand
FAtt
17
6
0
26 Oct 2021
Identifying and Benchmarking Natural Out-of-Context Prediction Problems
Identifying and Benchmarking Natural Out-of-Context Prediction Problems
David Madras
D. Psaltis
CML
OOD
16
4
0
25 Oct 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
Andreas Fürst
Elisabeth Rumetshofer
Johannes Lehner
Viet-Hung Tran
Fei Tang
...
David P. Kreil
Michael K Kopp
G. Klambauer
Angela Bitto-Nemling
Sepp Hochreiter
VLM
CLIP
199
102
0
21 Oct 2021
No One Representation to Rule Them All: Overlapping Features of Training
  Methods
No One Representation to Rule Them All: Overlapping Features of Training Methods
Raphael Gontijo-Lopes
Yann N. Dauphin
E. D. Cubuk
18
58
0
20 Oct 2021
Quantifying the Task-Specific Information in Text-Based Classifications
Quantifying the Task-Specific Information in Text-Based Classifications
Zining Zhu
Aparna Balagopalan
Marzyeh Ghassemi
Frank Rudzicz
26
4
0
17 Oct 2021
Robustness Challenges in Model Distillation and Pruning for Natural
  Language Understanding
Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding
Mengnan Du
Subhabrata Mukherjee
Yu Cheng
Milad Shokouhi
Xia Hu
Ahmed Hassan Awadallah
44
13
0
16 Oct 2021
Dropout Prediction Uncertainty Estimation Using Neuron Activation
  Strength
Dropout Prediction Uncertainty Estimation Using Neuron Activation Strength
Haichao Yu
Zhe Chen
Dong Lin
G. Shamir
Jie Han
UQCV
25
0
0
13 Oct 2021
Trivial or impossible -- dichotomous data difficulty masks model
  differences (on ImageNet and beyond)
Trivial or impossible -- dichotomous data difficulty masks model differences (on ImageNet and beyond)
Kristof Meding
Luca M. Schulze Buschoff
Robert Geirhos
Felix Wichmann
28
40
0
12 Oct 2021
Distinguishing rule- and exemplar-based generalization in learning
  systems
Distinguishing rule- and exemplar-based generalization in learning systems
Ishita Dasgupta
Erin Grant
Thomas L. Griffiths
8
13
0
08 Oct 2021
Machine Learning Featurizations for AI Hacking of Political Systems
Machine Learning Featurizations for AI Hacking of Political Systems
Nathan Sanders
B. Schneier
15
2
0
08 Oct 2021
Consistent Counterfactuals for Deep Models
Consistent Counterfactuals for Deep Models
Emily Black
Zifan Wang
Matt Fredrikson
Anupam Datta
BDL
OffRL
OOD
41
43
0
06 Oct 2021
Machine Learning Practices Outside Big Tech: How Resource Constraints
  Challenge Responsible Development
Machine Learning Practices Outside Big Tech: How Resource Constraints Challenge Responsible Development
Aspen K. Hopkins
Serena Booth
24
45
0
06 Oct 2021
Fairness and underspecification in acoustic scene classification: The
  case for disaggregated evaluations
Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations
Andreas Triantafyllopoulos
M. Milling
K. Drossos
Björn W. Schuller
21
7
0
04 Oct 2021
Expected Validation Performance and Estimation of a Random Variable's
  Maximum
Expected Validation Performance and Estimation of a Random Variable's Maximum
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
41
9
0
01 Oct 2021
Classification and Adversarial examples in an Overparameterized Linear
  Model: A Signal Processing Perspective
Classification and Adversarial examples in an Overparameterized Linear Model: A Signal Processing Perspective
Adhyyan Narang
Vidya Muthukumar
A. Sahai
SILM
AAML
13
1
0
27 Sep 2021
Neural forecasting at scale
Neural forecasting at scale
Philippe Chatigny
Shengrui Wang
Jean-Marc Patenaude and
Boris N. Oreshkin
AI4TS
20
1
0
20 Sep 2021
On the Language-specificity of Multilingual BERT and the Impact of
  Fine-tuning
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning
Marc Tanti
Lonneke van der Plas
Claudia Borg
Albert Gatt
18
10
0
14 Sep 2021
Assessing the Reliability of Word Embedding Gender Bias Measures
Assessing the Reliability of Word Embedding Gender Bias Measures
Yupei Du
Qixiang Fang
D. Nguyen
46
21
0
10 Sep 2021
Desiderata for Representation Learning: A Causal Perspective
Desiderata for Representation Learning: A Causal Perspective
Yixin Wang
Michael I. Jordan
CML
25
80
0
08 Sep 2021
Fishr: Invariant Gradient Variances for Out-of-Distribution
  Generalization
Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization
Alexandre Ramé
Corentin Dancette
Matthieu Cord
OOD
28
204
0
07 Sep 2021
Robust fine-tuning of zero-shot models
Robust fine-tuning of zero-shot models
Mitchell Wortsman
Gabriel Ilharco
Jong Wook Kim
Mike Li
Simon Kornblith
...
Raphael Gontijo-Lopes
Hannaneh Hajishirzi
Ali Farhadi
Hongseok Namkoong
Ludwig Schmidt
VLM
23
688
0
04 Sep 2021
Artificial Intelligence in Dry Eye Disease
Artificial Intelligence in Dry Eye Disease
Andrea Storås
Inga Strümke
Michael A. Riegler
J. Grauslund
Hugo Lewi Hammer
Anis Yazidi
P. Halvorsen
K. Gundersen
T. Utheim
C. Jackson
MedIm
17
38
0
02 Sep 2021
Network Generalization Prediction for Safety Critical Tasks in Novel
  Operating Domains
Network Generalization Prediction for Safety Critical Tasks in Novel Operating Domains
Molly O'Brien
Michael Medoff
Julia V. Bukowski
Gregory Hager
OOD
13
3
0
17 Aug 2021
Robustness testing of AI systems: A case study for traffic sign
  recognition
Robustness testing of AI systems: A case study for traffic sign recognition
Christian Berghoff
Pavol Bielik
Matthias Neu
Petar Tsankov
Arndt von Twickel
AAML
8
13
0
13 Aug 2021
Beyond Fairness Metrics: Roadblocks and Challenges for Ethical AI in
  Practice
Beyond Fairness Metrics: Roadblocks and Challenges for Ethical AI in Practice
Jiahao Chen
Victor Storchan
Eren Kurshan
9
10
0
11 Aug 2021
Grounding Representation Similarity with Statistical Testing
Grounding Representation Similarity with Statistical Testing
Frances Ding
Jean-Stanislas Denain
Jacob Steinhardt
11
30
0
03 Aug 2021
Your fairness may vary: Pretrained language model fairness in toxic text
  classification
Your fairness may vary: Pretrained language model fairness in toxic text classification
Ioana Baldini
Dennis L. Wei
K. Ramamurthy
Mikhail Yurochkin
Moninder Singh
12
53
0
03 Aug 2021
Artificial Intelligence in Healthcare: Lost In Translation?
Artificial Intelligence in Healthcare: Lost In Translation?
V. Madai
David C. Higgins
11
4
0
28 Jul 2021
An Instance-Dependent Simulation Framework for Learning with Label Noise
An Instance-Dependent Simulation Framework for Learning with Label Noise
Keren Gu
Xander Masotto
Vandana Bachani
Balaji Lakshminarayanan
Jack Nikodem
Dong Yin
NoLa
11
24
0
23 Jul 2021
Responsible and Regulatory Conform Machine Learning for Medicine: A
  Survey of Challenges and Solutions
Responsible and Regulatory Conform Machine Learning for Medicine: A Survey of Challenges and Solutions
Eike Petersen
Yannik Potdevin
Esfandiar Mohammadi
Stephan Zidowitz
Sabrina Breyer
...
Sandra Henn
Ludwig Pechmann
M. Leucker
P. Rostalski
Christian Herzog
FaML
AILaw
OOD
19
21
0
20 Jul 2021
Visual Representation Learning Does Not Generalize Strongly Within the
  Same Domain
Visual Representation Learning Does Not Generalize Strongly Within the Same Domain
Lukas Schott
Julius von Kügelgen
Frederik Trauble
Peter V. Gehler
Chris Russell
Matthias Bethge
Bernhard Schölkopf
Francesco Locatello
Wieland Brendel
OOD
DRL
29
66
0
17 Jul 2021
Artificial Intelligence in PET: an Industry Perspective
Artificial Intelligence in PET: an Industry Perspective
Arkadiusz Sitek
Sangtae Ahn
E. Asma
A. Chandler
Alvin Ihsani
S. Prevrhal
Arman Rahmim
Babak Saboury
K. Thielemans
14
5
0
14 Jul 2021
A Topological-Framework to Improve Analysis of Machine Learning Model
  Performance
A Topological-Framework to Improve Analysis of Machine Learning Model Performance
Henry Kvinge
Colby Wight
Sarah Akers
Scott Howland
W. Choi
Xiaolong Ma
Luke J. Gosink
E. Jurrus
K. Kappagantula
Tegan H. Emerson
29
0
0
09 Jul 2021
Accuracy on the Line: On the Strong Correlation Between
  Out-of-Distribution and In-Distribution Generalization
Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization
John Miller
Rohan Taori
Aditi Raghunathan
Shiori Sagawa
Pang Wei Koh
Vaishaal Shankar
Percy Liang
Y. Carmon
Ludwig Schmidt
OODD
OOD
14
266
0
09 Jul 2021
Scaling up Continuous-Time Markov Chains Helps Resolve
  Underspecification
Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification
Alkis Gotovos
R. Burkholz
John Quackenbush
Stefanie Jegelka
11
8
0
06 Jul 2021
The Spotlight: A General Method for Discovering Systematic Errors in
  Deep Learning Models
The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models
G. dÉon
Jason dÉon
J. R. Wright
Kevin Leyton-Brown
20
74
0
01 Jul 2021
The MultiBERTs: BERT Reproductions for Robustness Analysis
The MultiBERTs: BERT Reproductions for Robustness Analysis
Thibault Sellam
Steve Yadlowsky
Jason W. Wei
Naomi Saphra
Alexander DÁmour
...
Iulia Turc
Jacob Eisenstein
Dipanjan Das
Ian Tenney
Ellie Pavlick
22
93
0
30 Jun 2021
Randomness In Neural Network Training: Characterizing The Impact of
  Tooling
Randomness In Neural Network Training: Characterizing The Impact of Tooling
Donglin Zhuang
Xingyao Zhang
S. Song
Sara Hooker
17
75
0
22 Jun 2021
Disentangling Identifiable Features from Noisy Data with Structured
  Nonlinear ICA
Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA
Hermanni Hälvä
Sylvain Le Corff
Luc Lehéricy
Jonathan So
Yongjie Zhu
Elisabeth Gassiat
Aapo Hyvarinen
CML
29
64
0
17 Jun 2021
Controlling Neural Networks with Rule Representations
Controlling Neural Networks with Rule Representations
Sungyong Seo
Sercan Ö. Arik
Jinsung Yoon
Xiang Zhang
Kihyuk Sohn
Tomas Pfister
OOD
AI4CE
19
35
0
14 Jun 2021
Extracting Global Dynamics of Loss Landscape in Deep Learning Models
Extracting Global Dynamics of Loss Landscape in Deep Learning Models
Mohammed Eslami
Hamed Eramian
Marcio Gameiro
W. Kalies
Konstantin Mischaikow
11
1
0
14 Jun 2021
Characterizing the risk of fairwashing
Characterizing the risk of fairwashing
Ulrich Aivodji
Hiromi Arai
Sébastien Gambs
Satoshi Hara
18
27
0
14 Jun 2021
Ex uno plures: Splitting One Model into an Ensemble of Subnetworks
Ex uno plures: Splitting One Model into an Ensemble of Subnetworks
Zhilu Zhang
Vianne R. Gao
M. Sabuncu
UQCV
17
6
0
09 Jun 2021
Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused
  Interventions
Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions
Daniel Rosenberg
Itai Gat
Amir Feder
Roi Reichart
AAML
34
16
0
08 Jun 2021
Meta-Learning to Compositionally Generalize
Meta-Learning to Compositionally Generalize
Henry Conklin
Bailin Wang
Kenny Smith
Ivan Titov
OOD
26
73
0
08 Jun 2021
Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep
  Learning
Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Zachary Nado
Neil Band
Mark Collier
Josip Djolonga
Michael W. Dusenberry
...
D. Sculley
Balaji Lakshminarayanan
Jasper Snoek
Y. Gal
Dustin Tran
UQCV
ELM
19
96
0
07 Jun 2021
Investigating Transfer Learning in Multilingual Pre-trained Language
  Models through Chinese Natural Language Inference
Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference
Hai Hu
He Zhou
Zuoyu Tian
Yiwen Zhang
Yina Ma
Yanting Li
Yixin Nie
Kyle Richardson
19
11
0
07 Jun 2021
Previous
12345678
Next