ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.03395
  4. Cited By
Underspecification Presents Challenges for Credibility in Modern Machine
  Learning

Underspecification Presents Challenges for Credibility in Modern Machine Learning

6 November 2020
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
Alex Beutel
Christina W. Chen
Jonathan Deaton
Jacob Eisenstein
Matthew D. Hoffman
F. Hormozdiari
N. Houlsby
Shaobo Hou
Ghassen Jerfel
Alan Karthikesalingam
Mario Lucic
Yi-An Ma
Cory Y. McLean
Diana Mincu
A. Mitani
Andrea Montanari
Zachary Nado
Vivek Natarajan
Christopher Nielson
T. Osborne
R. Raman
K. Ramasamy
Rory Sayres
Jessica Schrouff
Martin G. Seneviratne
Shannon Sequeira
Harini Suresh
Victor Veitch
Max Vladymyrov
Xuezhi Wang
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
    OffRL
ArXivPDFHTML

Papers citing "Underspecification Presents Challenges for Credibility in Modern Machine Learning"

50 / 351 papers shown
Title
Mitigating and Evaluating Static Bias of Action Representations in the
  Background and the Foreground
Mitigating and Evaluating Static Bias of Action Representations in the Background and the Foreground
Haoxin Li
Yuan Liu
Hanwang Zhang
Boyang Li
25
15
0
23 Nov 2022
ModelDiff: A Framework for Comparing Learning Algorithms
ModelDiff: A Framework for Comparing Learning Algorithms
Harshay Shah
Sung Min Park
Andrew Ilyas
A. Madry
SyDa
46
26
0
22 Nov 2022
Instability in clinical risk stratification models using deep learning
Instability in clinical risk stratification models using deep learning
D. Martinez
A. Yakubovich
Martin G. Seneviratne
Á. Lelkes
Akshit Tyagi
...
N. L. Downing
Ron C. Li
Keith Morse
N. Shah
Ming-Jun Chen
OOD
14
2
0
20 Nov 2022
Mechanistic Mode Connectivity
Mechanistic Mode Connectivity
Ekdeep Singh Lubana
Eric J. Bigelow
Robert P. Dick
David M. Krueger
Hidenori Tanaka
27
45
0
15 Nov 2022
Capabilities for Better ML Engineering
Capabilities for Better ML Engineering
Chenyang Yang
Rachel A. Brower-Sinning
Grace A. Lewis
Christian Kastner
Tongshuang Wu
19
3
0
11 Nov 2022
Deep Learning based Computer Vision Methods for Complex Traffic
  Environments Perception: A Review
Deep Learning based Computer Vision Methods for Complex Traffic Environments Perception: A Review
Talha Azfar
Jinlong Li
Hongkai Yu
R. Cheu
Yisheng Lv
Ruimin Ke
20
21
0
09 Nov 2022
Dealing with Drift of Adaptation Spaces in Learning-based Self-Adaptive
  Systems using Lifelong Self-Adaptation
Dealing with Drift of Adaptation Spaces in Learning-based Self-Adaptive Systems using Lifelong Self-Adaptation
Omid Gheibi
Danny Weyns
8
3
0
04 Nov 2022
PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales
PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales
Peifeng Wang
Aaron Chan
Filip Ilievski
Muhao Chen
Xiang Ren
LRM
ReLM
21
59
0
03 Nov 2022
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
AAML
31
16
0
28 Oct 2022
LMPriors: Pre-Trained Language Models as Task-Specific Priors
LMPriors: Pre-Trained Language Models as Task-Specific Priors
Kristy Choi
Chris Cundy
Sanjari Srivastava
Stefano Ermon
BDL
48
36
0
22 Oct 2022
Exploring Predictive Uncertainty and Calibration in NLP: A Study on the
  Impact of Method & Data Scarcity
Exploring Predictive Uncertainty and Calibration in NLP: A Study on the Impact of Method & Data Scarcity
Dennis Ulmer
J. Frellsen
Christian Hardmeier
179
22
0
20 Oct 2022
Machine Learning for a Sustainable Energy Future
Machine Learning for a Sustainable Energy Future
Zhenpeng Yao
Yanwei Lum
Andrew K. Johnston
L. M. Mejia-Mendoza
Xiaoxia Zhou
Yonggang Wen
Alán Aspuru-Guzik
E. Sargent
Z. Seh
6
209
0
19 Oct 2022
Predicting Fine-Tuning Performance with Probing
Predicting Fine-Tuning Performance with Probing
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
26
9
0
13 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
66
32
0
11 Oct 2022
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut
  Learning in VQA
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Q. Si
Fandong Meng
Mingyu Zheng
Zheng Lin
Yuanxin Liu
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
24
20
0
10 Oct 2022
Goal Misgeneralization: Why Correct Specifications Aren't Enough For
  Correct Goals
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals
Rohin Shah
Vikrant Varma
Ramana Kumar
Mary Phuong
Victoria Krakovna
J. Uesato
Zachary Kenton
19
67
0
04 Oct 2022
Underspecification in Language Modeling Tasks: A Causality-Informed
  Study of Gendered Pronoun Resolution
Underspecification in Language Modeling Tasks: A Causality-Informed Study of Gendered Pronoun Resolution
Emily McMilin
13
0
0
30 Sep 2022
Fairness and robustness in anti-causal prediction
Fairness and robustness in anti-causal prediction
Maggie Makar
Alexander DÁmour
OOD
27
10
0
20 Sep 2022
Measuring Interventional Robustness in Reinforcement Learning
Measuring Interventional Robustness in Reinforcement Learning
Katherine Avery
Jack Kenney
Pracheta Amaranath
Erica Cai
David D. Jensen
13
0
0
19 Sep 2022
Exploring the Whole Rashomon Set of Sparse Decision Trees
Exploring the Whole Rashomon Set of Sparse Decision Trees
Rui Xin
Chudi Zhong
Zhi Chen
Takuya Takagi
Margo Seltzer
Cynthia Rudin
33
53
0
16 Sep 2022
On the Factory Floor: ML Engineering for Industrial-Scale Ads
  Recommendation Models
On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models
Rohan Anil
S. Gadanho
Danya Huang
Nijith Jacob
Zhuoshu Li
...
Cristina Pop
Kevin Regan
G. Shamir
Rakesh Shivanna
Qiqi Yan
3DV
8
41
0
12 Sep 2022
Bias Challenges in Counterfactual Data Augmentation
Bias Challenges in Counterfactual Data Augmentation
S Chandra Mouli
Yangze Zhou
Bruno Ribeiro
CML
OOD
OODD
37
4
0
12 Sep 2022
Reconciling Individual Probability Forecasts
Reconciling Individual Probability Forecasts
Aaron Roth
A. Tolbert
S. Weinstein
14
14
0
04 Sep 2022
ID and OOD Performance Are Sometimes Inversely Correlated on Real-world
  Datasets
ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets
Damien Teney
Yong Lin
Seong Joon Oh
Ehsan Abbasnejad
OOD
362
47
0
01 Sep 2022
Gaussian Process Surrogate Models for Neural Networks
Gaussian Process Surrogate Models for Neural Networks
Michael Y. Li
Erin Grant
Thomas L. Griffiths
BDL
SyDa
30
7
0
11 Aug 2022
Quality Not Quantity: On the Interaction between Dataset Design and
  Robustness of CLIP
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP
Thao Nguyen
Gabriel Ilharco
Mitchell Wortsman
Sewoong Oh
Ludwig Schmidt
CLIP
VLM
38
97
0
10 Aug 2022
Algorithmic Fairness in Business Analytics: Directions for Research and
  Practice
Algorithmic Fairness in Business Analytics: Directions for Research and Practice
Maria De-Arteaga
Stefan Feuerriegel
M. Saar-Tsechansky
FaML
14
42
0
22 Jul 2022
Detecting Shortcut Learning for Fair Medical AI using Shortcut Testing
Detecting Shortcut Learning for Fair Medical AI using Shortcut Testing
Alex Brown
Nenad Tomašev
Jan Freyberg
Yuan Liu
Alan Karthikesalingam
Jessica Schrouff
8
50
0
21 Jul 2022
The Birth of Bias: A case study on the evolution of gender bias in an
  English language model
The Birth of Bias: A case study on the evolution of gender bias in an English language model
Oskar van der Wal
Jaap Jumelet
K. Schulz
Willem H. Zuidema
24
16
0
21 Jul 2022
Assaying Out-Of-Distribution Generalization in Transfer Learning
Assaying Out-Of-Distribution Generalization in Transfer Learning
F. Wenzel
Andrea Dittadi
Peter V. Gehler
Carl-Johann Simon-Gabriel
Max Horn
...
Chris Russell
Thomas Brox
Bernt Schiele
Bernhard Schölkopf
Francesco Locatello
OOD
OODD
AAML
49
71
0
19 Jul 2022
Selection Bias Induced Spurious Correlations in Large Language Models
Selection Bias Induced Spurious Correlations in Large Language Models
Emily McMilin
20
7
0
18 Jul 2022
Segmenting white matter hyperintensities on isotropic three-dimensional
  Fluid Attenuated Inversion Recovery magnetic resonance images: Assessing deep
  learning tools on norwegian imaging database
Segmenting white matter hyperintensities on isotropic three-dimensional Fluid Attenuated Inversion Recovery magnetic resonance images: Assessing deep learning tools on norwegian imaging database
M. Røvang
P. Selnes
B. MacIntosh
I. Groote
Lene Paalhaugen
Sudre Carole
T. Fladby
A. Bjørnerud
14
1
0
18 Jul 2022
Plex: Towards Reliability using Pretrained Large Model Extensions
Plex: Towards Reliability using Pretrained Large Model Extensions
Dustin Tran
J. Liu
Michael W. Dusenberry
Du Phan
Mark Collier
...
D. Sculley
Y. Gal
Zoubin Ghahramani
Jasper Snoek
Balaji Lakshminarayanan
VLM
23
124
0
15 Jul 2022
PIAT: Physics Informed Adversarial Training for Solving Partial
  Differential Equations
PIAT: Physics Informed Adversarial Training for Solving Partial Differential Equations
S. Shekarpaz
Mohammad Azizmalayeri
M. Rohban
15
4
0
14 Jul 2022
Predicting is not Understanding: Recognizing and Addressing
  Underspecification in Machine Learning
Predicting is not Understanding: Recognizing and Addressing Underspecification in Machine Learning
Damien Teney
Maxime Peyrard
Ehsan Abbasnejad
25
29
0
06 Jul 2022
Counterbalancing Teacher: Regularizing Batch Normalized Models for
  Robustness
Counterbalancing Teacher: Regularizing Batch Normalized Models for Robustness
Saeid Asgari Taghanaki
A. Gholami
Fereshte Khani
Kristy Choi
Linh-Tam Tran
Ran Zhang
Aliasghar Khani
4
0
0
04 Jul 2022
Auditing Visualizations: Transparency Methods Struggle to Detect
  Anomalous Behavior
Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior
Jean-Stanislas Denain
Jacob Steinhardt
AAML
15
7
0
27 Jun 2022
Gated Domain Units for Multi-source Domain Generalization
Gated Domain Units for Multi-source Domain Generalization
Simon Foll
Alina Dubatovka
Eugen Ernst
Siu Lun Chau
Martin Maritsch
Patrik Okanovic
Gudrun Thater
J. M. Buhmann
Felix Wortmann
Krikamol Muandet
OOD
33
3
0
24 Jun 2022
On Specifying for Trustworthiness
On Specifying for Trustworthiness
Dhaminda B. Abeywickrama
A. Bennaceur
Greg Chance
Y. Demiris
Anastasia Kordoni
...
S. Ramamoorthy
Jan Oliver Ringert
James Wilson
Shane Windsor
Kerstin Eder
14
19
0
22 Jun 2022
Performance Prediction Under Dataset Shift
Performance Prediction Under Dataset Shift
Simona Maggio
Victor Bouvier
L. Dreyfus-Schmidt
OOD
AI4TS
16
2
0
21 Jun 2022
Identifiability of deep generative models without auxiliary information
Identifiability of deep generative models without auxiliary information
Bohdan Kivva
Goutham Rajendran
Pradeep Ravikumar
Bryon Aragam
DRL
18
48
0
20 Jun 2022
Disentangling Model Multiplicity in Deep Learning
Disentangling Model Multiplicity in Deep Learning
Ari Heljakka
Martin Trapp
Juho Kannala
Arno Solin
18
4
0
17 Jun 2022
Efficiently Training Low-Curvature Neural Networks
Efficiently Training Low-Curvature Neural Networks
Suraj Srinivas
Kyle Matoba
Himabindu Lakkaraju
F. Fleuret
AAML
23
15
0
14 Jun 2022
Invariant Structure Learning for Better Generalization and Causal
  Explainability
Invariant Structure Learning for Better Generalization and Causal Explainability
Yunhao Ge
Sercan Ö. Arik
Jinsung Yoon
Ao Xu
Laurent Itti
Tomas Pfister
OOD
CML
18
2
0
13 Jun 2022
OOD Augmentation May Be at Odds with Open-Set Recognition
OOD Augmentation May Be at Odds with Open-Set Recognition
Mohammad Azizmalayeri
M. Rohban
6
9
0
09 Jun 2022
Certifying Data-Bias Robustness in Linear Regression
Certifying Data-Bias Robustness in Linear Regression
Anna P. Meyer
Aws Albarghouthi
Loris Dántoni
27
3
0
07 Jun 2022
Metrics reloaded: Recommendations for image analysis validation
Metrics reloaded: Recommendations for image analysis validation
Lena Maier-Hein
Annika Reinke
Patrick Godau
M. Tizabi
Florian Buettner
...
Aleksei Tiulpin
Sotirios A. Tsaftaris
Ben Van Calster
Gaël Varoquaux
Paul F. Jäger
22
214
0
03 Jun 2022
Generalization for multiclass classification with overparameterized
  linear models
Generalization for multiclass classification with overparameterized linear models
Vignesh Subramanian
Rahul Arya
A. Sahai
AI4CE
19
9
0
03 Jun 2022
Rashomon Capacity: A Metric for Predictive Multiplicity in
  Classification
Rashomon Capacity: A Metric for Predictive Multiplicity in Classification
Hsiang Hsu
Flavio du Pin Calmon
17
38
0
02 Jun 2022
Predictive Multiplicity in Probabilistic Classification
Predictive Multiplicity in Probabilistic Classification
J. Watson-Daniels
David C. Parkes
Berk Ustun
9
38
0
02 Jun 2022
Previous
12345678
Next