ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.03395
  4. Cited By
Underspecification Presents Challenges for Credibility in Modern Machine
  Learning
v1v2 (latest)

Underspecification Presents Challenges for Credibility in Modern Machine Learning

6 November 2020
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
Alex Beutel
Christina W. Chen
Jonathan Deaton
Jacob Eisenstein
Matthew D. Hoffman
F. Hormozdiari
N. Houlsby
Shaobo Hou
Ghassen Jerfel
Alan Karthikesalingam
Mario Lucic
Yi-An Ma
Cory Y. McLean
Diana Mincu
A. Mitani
Andrea Montanari
Zachary Nado
Vivek Natarajan
Christopher Nielson
T. Osborne
R. Raman
K. Ramasamy
Rory Sayres
Jessica Schrouff
Martin G. Seneviratne
Shannon Sequeira
Harini Suresh
Victor Veitch
Max Vladymyrov
Xuezhi Wang
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Underspecification Presents Challenges for Credibility in Modern Machine Learning"

50 / 377 papers shown
Many Ways to be Right: Rashomon Sets for Concept-Based Neural Networks
Many Ways to be Right: Rashomon Sets for Concept-Based Neural Networks
Shihan Feng
Cheng Zhang
Michael Xi
Ethan Hsu
Lesia Semenova
Chudi Zhong
137
1
0
24 Nov 2025
SORTeD Rashomon Sets of Sparse Decision Trees: Anytime Enumeration
SORTeD Rashomon Sets of Sparse Decision Trees: Anytime Enumeration
Elif Arslan
J. G. M. van der Linden
Serge Hoogendoorn
Marco Rinaldi
Emir Demirović
132
0
0
05 Nov 2025
Accounting for Underspecification in Statistical Claims of Model Superiority
Accounting for Underspecification in Statistical Claims of Model Superiority
Thomas Sanchez
Pedro M. Gordaliza
Meritxell Bach Cuadra
94
0
0
04 Nov 2025
Hybrid Explanation-Guided Learning for Transformer-Based Chest X-Ray Diagnosis
Hybrid Explanation-Guided Learning for Transformer-Based Chest X-Ray Diagnosis
Shelley Zixin Shu
Haozhe Luo
Alexander Poellinger
Mauricio Reyes
ViTMedIm
169
0
0
14 Oct 2025
MaP: A Unified Framework for Reliable Evaluation of Pre-training Dynamics
MaP: A Unified Framework for Reliable Evaluation of Pre-training Dynamics
Jiapeng Wang
Changxin Tian
Kunlong Chen
Ziqi Liu
Jiaxin Mao
Wayne Xin Zhao
Zhiqiang Zhang
Jun Zhou
117
1
0
10 Oct 2025
Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization
Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization
Antoine Maier
Aude Maier
Tom David
131
0
0
03 Oct 2025
The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks
The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks
Arda Uzunoglu
Tianjian Li
Daniel Khashabi
178
0
0
30 Sep 2025
Probabilistic Runtime Verification, Evaluation and Risk Assessment of Visual Deep Learning Systems
Probabilistic Runtime Verification, Evaluation and Risk Assessment of Visual Deep Learning Systems
Birk Torpmann-Hagen
Pål Halvorsen
Michael A. Riegler
Dag Johansen
139
0
0
23 Sep 2025
KANO: Kolmogorov-Arnold Neural Operator
KANO: Kolmogorov-Arnold Neural Operator
Jin Lee
Ziming Liu
Xinling Yu
Yixuan Wang
Haewon Jeong
Murphy Yuezhen Niu
Zheng Zhang
237
1
0
20 Sep 2025
From Distributional to Quantile Neural Basis Models: the case of Electricity Price Forecasting
From Distributional to Quantile Neural Basis Models: the case of Electricity Price Forecasting
A. Brusaferri
Danial Ramin
A. Ballarino
AI4TS
123
0
0
17 Sep 2025
"A 6 or a 9?": Ensemble Learning Through the Multiplicity of Performant Models and Explanations
"A 6 or a 9?": Ensemble Learning Through the Multiplicity of Performant Models and ExplanationsACM Transactions on Knowledge Discovery from Data (TKDD), 2025
Gianlucca L. Zuin
Adriano Veloso
189
0
0
11 Sep 2025
ACE and Diverse Generalization via Selective Disagreement
ACE and Diverse Generalization via Selective Disagreement
Oliver Daniels
Stuart Armstrong
Alexandre Maranhao
Mahirah Fairuz Rahman
Benjamin M. Marlin
Rebecca Gorman
OODD
248
0
0
09 Sep 2025
On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study
On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study
Jacqueline Jil Vallon
William Overman
Wanqiao Xu
Neil Panjwani
Xi Ling
...
Geoffrey Sonn
Sandy Srinivas
E. Pollom
Mark K. Buyyounouski
Mohsen Bayati
151
1
0
04 Sep 2025
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
David Heineman
Valentin Hofmann
Ian H. Magnusson
Yuling Gu
Noah A. Smith
Hannaneh Hajishirzi
Kyle Lo
Jesse Dodge
ALM
168
6
0
18 Aug 2025
Grounding Natural Language for Multi-agent Decision-Making with Multi-agentic LLMs
Grounding Natural Language for Multi-agent Decision-Making with Multi-agentic LLMs
Dom Huh
P. Mohapatra
LLMAGLM&Ro
79
0
0
10 Aug 2025
Charting 15 years of progress in deep learning for speech emotion recognition: A replication study
Charting 15 years of progress in deep learning for speech emotion recognition: A replication study
Andreas Triantafyllopoulos
A. Batliner
B. Schuller
AI4TS
186
0
0
04 Aug 2025
Graph Lineages and Skeletal Graph Products
Graph Lineages and Skeletal Graph Products
Eric Mjolsness
Cory Braker Scott
AI4CE
192
0
0
31 Jul 2025
Observational Multiplicity
Observational Multiplicity
Erin E. George
Deanna Needell
Berk Ustun
157
1
0
30 Jul 2025
On Arbitrary Predictions from Equally Valid Models
On Arbitrary Predictions from Equally Valid Models
Sarah Lockfisch
Kristian Schwethelm
Martin Menten
R. Braren
Daniel Rueckert
Alexander Ziller
Georgios Kaissis
178
0
0
25 Jul 2025
What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models
What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models
Keyon Vafa
Peter G. Chang
Ashesh Rambachan
S. Mullainathan
690
23
0
09 Jul 2025
Selecting for Less Discriminatory Algorithms: A Relational Search Framework for Navigating Fairness-Accuracy Trade-offs in Practice
Selecting for Less Discriminatory Algorithms: A Relational Search Framework for Navigating Fairness-Accuracy Trade-offs in Practice
Hana Samad
Michael Akinwumi
Jameel Khan
Christoph Mügge-Durum
Emmanuel O. Ogundimu
225
1
0
02 Jun 2025
Be.FM: Open Foundation Models for Human Behavior
Be.FM: Open Foundation Models for Human Behavior
Yutong Xie
Zhuoheng Li
Xiyuan Wang
Yijun Pan
Qijia Liu
...
Xingjian Zhang
Jin Huang
Walter Yuan
Matthew O Jackson
Qiaozhu Mei
AI4CE
153
3
0
29 May 2025
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects
Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects
Reva Schwartz
Rumman Chowdhury
Akash Kundu
Heather Frase
Marzieh Fadaee
...
Andrew Thompson
Maya Carlyle
Qinghua Lu
Matthew Holmes
Theodora Skeadas
387
7
0
24 May 2025
Small-to-Large Generalization: Data Influences Models Consistently Across Scale
Small-to-Large Generalization: Data Influences Models Consistently Across Scale
Alaa Khaddaj
Logan Engstrom
Aleksander Madry
TDIAI4CE
327
1
0
22 May 2025
What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts
What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts
Chenyang Yang
Y. Shi
Qianou Ma
Michael Xieyang Liu
Jane Hsieh
Tongshuang Wu
460
15
0
19 May 2025
Toward Adaptive Categories: Dimensional Governance for Agentic AI
Toward Adaptive Categories: Dimensional Governance for Agentic AI
Zeynep Engin
David Hand
331
5
0
16 May 2025
Perils of Label Indeterminacy: A Case Study on Prediction of Neurological Recovery After Cardiac Arrest
Perils of Label Indeterminacy: A Case Study on Prediction of Neurological Recovery After Cardiac ArrestConference on Fairness, Accountability and Transparency (FAccT), 2025
Jakob Schoeffer
Maria De-Arteaga
Jonathan Elmer
988
2
0
05 Apr 2025
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training RunsInternational Conference on Learning Representations (ICLR), 2025
Oskar van der Wal
Pietro Lesci
Max Muller-Eberstein
Naomi Saphra
Hailey Schoelkopf
Willem H. Zuidema
Stella Biderman
LRM
428
19
0
12 Mar 2025
The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition
The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour RecognitionComputer Vision and Pattern Recognition (CVPR), 2025
Otto Brookes
Maksim Kukushkin
Majid Mirmehdi
Colleen Stephens
Paula Dieguez
...
Lukas Boesch
Thomas Schmid
M. Arandjelovic
H. Kühl
T. Burghardt
325
2
0
28 Feb 2025
Societal Alignment Frameworks Can Improve LLM Alignment
Karolina Stañczak
Nicholas Meade
Mehar Bhatia
Hattie Zhou
Konstantin Böttinger
...
Timothy P. Lillicrap
Ana Marasović
Sylvie Delacroix
Gillian K. Hadfield
Siva Reddy
1.0K
5
0
27 Feb 2025
Random Scaling of Emergent Capabilities
Random Scaling of Emergent Capabilities
Rosie Zhao
Tian Qin
David Alvarez-Melis
Sham Kakade
Sham Kakade
LRM
444
2
0
24 Feb 2025
Machine Learning Should Maximize Welfare, but Not by (Only) Maximizing Accuracy
Machine Learning Should Maximize Welfare, but Not by (Only) Maximizing Accuracy
Nir Rosenfeld
Haifeng Xu
FaMLHAI
363
2
0
17 Feb 2025
Be Intentional About Fairness!: Fairness, Size, and Multiplicity in the Rashomon Set
Gordon Dai
Pavan Ravishankar
Rachel Yuan
Daniel B. Neill
Emily Black
204
11
0
28 Jan 2025
The Curious Case of Arbitrariness in Machine Learning
Prakhar Ganesh
Afaf Taik
G. Farnadi
425
6
0
28 Jan 2025
Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction
Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction
P. Melki
Lionel Bombrun
Boubacar Diallo
Jérôme Dias
Jean-Pierre da Costa
214
1
0
13 Jan 2025
Test-Time Alignment via Hypothesis Reweighting
Test-Time Alignment via Hypothesis Reweighting
Yoonho Lee
Jonathan Williams
Henrik Marklund
Archit Sharma
E. Mitchell
Anikait Singh
Chelsea Finn
317
8
0
11 Dec 2024
Attuned to Change: Causal Fine-Tuning under Latent-Confounded Shifts
Attuned to Change: Causal Fine-Tuning under Latent-Confounded Shifts
Jialin Yu
Yuxiang Zhou
Yulan He
Nevin L. Zhang
Ricardo Silva
Philip Torr
Ricardo M. A. Silva
423
0
0
18 Oct 2024
From Transparency to Accountability and Back: A Discussion of Access and
  Evidence in AI Auditing
From Transparency to Accountability and Back: A Discussion of Access and Evidence in AI AuditingConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2024
Sarah H. Cen
Rohan Alur
296
11
0
07 Oct 2024
Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Ann Huang
Satpreet H. Singh
Flavio Martinelli
Kanaka Rajan
428
7
0
04 Oct 2024
OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?
OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?
Liangze Jiang
Damien Teney
OODDOOD
587
2
0
03 Oct 2024
Perceptions of the Fairness Impacts of Multiplicity in Machine Learning
Perceptions of the Fairness Impacts of Multiplicity in Machine LearningInternational Conference on Human Factors in Computing Systems (CHI), 2024
Anna P. Meyer
Yea-Seul Kim
Aws Albarghouthi
Loris DÁntoni
FaML
161
7
0
18 Sep 2024
Self-Supervised Learning for Building Robust Pediatric Chest X-ray
  Classification Models
Self-Supervised Learning for Building Robust Pediatric Chest X-ray Classification Models
Sheng Cheng
Zbigniew A. Starosolski
Devika Subramanian
SSL
293
0
0
30 Aug 2024
UTrack: Multi-Object Tracking with Uncertain Detections
UTrack: Multi-Object Tracking with Uncertain Detections
Edgardo Solano-Carrillo
Felix Sattler
Antje Alex
Alexander Klein
Bruno Pereira Costa
Ángel Bueno Rodríguez
Jannis Stoppe
VOT
331
5
0
30 Aug 2024
Can Optimization Trajectories Explain Multi-Task Transfer?
Can Optimization Trajectories Explain Multi-Task Transfer?
David Mueller
Mark Dredze
Nicholas Andrews
428
2
0
26 Aug 2024
Assessing Robustness of Machine Learning Models using Covariate
  Perturbations
Assessing Robustness of Machine Learning Models using Covariate Perturbations
Arun Prakash
A. Bhattacharyya
Eric Heim
Vijayan N. Nair Model
OODAAML
161
1
0
02 Aug 2024
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft
  Agent
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Karolis Jucys
George Adamopoulos
Mehrab Hamidi
Stephanie Milani
Mohammad Reza Samsami
Artem Zholus
Sonia Joseph
Blake A. Richards
Irina Rish
Özgür Simsek
311
4
0
16 Jul 2024
Amazing Things Come From Having Many Good Models
Amazing Things Come From Having Many Good Models
Cynthia Rudin
Chudi Zhong
Lesia Semenova
Margo Seltzer
Ronald E. Parr
Jiachang Liu
Srikar Katta
Jon Donnelly
Harry Chen
Zachery Boner
312
62
0
05 Jul 2024
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept
  Space
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space
Core Francisco Park
Maya Okawa
Andrew Lee
Ekdeep Singh Lubana
Hidenori Tanaka
406
30
0
27 Jun 2024
Aligning Model Properties via Conformal Risk Control
Aligning Model Properties via Conformal Risk Control
William Overman
Jacqueline Jil Vallon
Mohsen Bayati
239
7
0
26 Jun 2024
Tree-based variational inference for Poisson log-normal models
Tree-based variational inference for Poisson log-normal models
Alexandre Chaussard
Anna Bonnet
Elisabeth Gassiat
Sylvain Le Corff
357
3
0
25 Jun 2024
12345678
Next
Page 1 of 8