v1v2 (latest)

Underspecification Presents Challenges for Credibility in Modern Machine Learning

6 November 2020

Matthew D. Hoffman

Alan Karthikesalingam

Martin G. Seneviratne

Papers citing "Underspecification Presents Challenges for Credibility in Modern Machine Learning"

50 / 377 papers shown

Many Ways to be Right: Rashomon Sets for Concept-Based Neural Networks

161

24 Nov 2025

SORTeD Rashomon Sets of Sparse Decision Trees: Anytime Enumeration

Elif Arslan

J. G. M. van der Linden

Serge Hoogendoorn

Marco Rinaldi

Emir Demirović

149

05 Nov 2025

Accounting for Underspecification in Statistical Claims of Model Superiority

Thomas Sanchez

Pedro M. Gordaliza

Meritxell Bach Cuadra

121

04 Nov 2025

Hybrid Explanation-Guided Learning for Transformer-Based Chest X-Ray Diagnosis

179

14 Oct 2025

MaP: A Unified Framework for Reliable Evaluation of Pre-training Dynamics

147

10 Oct 2025

Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

Antoine Maier

Aude Maier

Tom David

158

03 Oct 2025

The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks

Arda Uzunoglu

Tianjian Li

Daniel Khashabi

203

30 Sep 2025

Probabilistic Runtime Verification, Evaluation and Risk Assessment of Visual Deep Learning Systems

157

23 Sep 2025

KANO: Kolmogorov-Arnold Neural Operator

270

20 Sep 2025

From Distributional to Quantile Neural Basis Models: the case of Electricity Price Forecasting

174

17 Sep 2025

"A 6 or a 9?": Ensemble Learning Through the Multiplicity of Performant Models and ExplanationsACM Transactions on Knowledge Discovery from Data (TKDD), 2025

Gianlucca L. Zuin

Adriano Veloso

199

11 Sep 2025

ACE and Diverse Generalization via Selective Disagreement

Oliver Daniels

Stuart Armstrong

Alexandre Maranhao

Mahirah Fairuz Rahman

Benjamin M. Marlin

Rebecca Gorman

OODD

268

09 Sep 2025

On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study

Jacqueline Jil Vallon

...

196

04 Sep 2025

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation

268

18 Aug 2025

Grounding Natural Language for Multi-agent Decision-Making with Multi-agentic LLMs

Dom Huh

P. Mohapatra

LLMAG LM&Ro

10 Aug 2025

Charting 15 years of progress in deep learning for speech emotion recognition: A replication study

Andreas Triantafyllopoulos

A. Batliner

B. Schuller

AI4TS

201

04 Aug 2025

Graph Lineages and Skeletal Graph Products

Eric Mjolsness

Cory Braker Scott

AI4CE

235

31 Jul 2025

Observational Multiplicity

Erin E. George

Deanna Needell

Berk Ustun

197

30 Jul 2025

On Arbitrary Predictions from Equally Valid Models

203

25 Jul 2025

What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models

755

09 Jul 2025

Selecting for Less Discriminatory Algorithms: A Relational Search Framework for Navigating Fairness-Accuracy Trade-offs in Practice

Hana Samad

Michael Akinwumi

Jameel Khan

Christoph Mügge-Durum

Emmanuel O. Ogundimu

277

02 Jun 2025

Be.FM: Open Foundation Models for Human Behavior

...

181

29 May 2025

Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects

...

466

24 May 2025

Small-to-Large Generalization: Data Influences Models Consistently Across Scale

359

22 May 2025

What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts

487

19 May 2025

Toward Adaptive Categories: Dimensional Governance for Agentic AI

Zeynep Engin

David Hand

368

16 May 2025

Perils of Label Indeterminacy: A Case Study on Prediction of Neurological Recovery After Cardiac ArrestConference on Fairness, Accountability and Transparency (FAccT), 2025

Jakob Schoeffer

Maria De-Arteaga

Jonathan Elmer

1.0K

05 Apr 2025

PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training RunsInternational Conference on Learning Representations (ICLR), 2025

456

12 Mar 2025

The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour RecognitionComputer Vision and Pattern Recognition (CVPR), 2025

...

391

28 Feb 2025

Societal Alignment Frameworks Can Improve LLM Alignment

...

1.1K

27 Feb 2025

Random Scaling of Emergent Capabilities

489

24 Feb 2025

Machine Learning Should Maximize Welfare, but Not by (Only) Maximizing Accuracy

Nir Rosenfeld

Haifeng Xu

FaML HAI

391

17 Feb 2025

Be Intentional About Fairness!: Fairness, Size, and Multiplicity in the Rashomon Set

213

28 Jan 2025

The Curious Case of Arbitrariness in Machine Learning

Prakhar Ganesh

Afaf Taik

G. Farnadi

431

28 Jan 2025

Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction

236

13 Jan 2025

Test-Time Alignment via Hypothesis Reweighting

333

11 Dec 2024

Attuned to Change: Causal Fine-Tuning under Latent-Confounded Shifts

453

18 Oct 2024

From Transparency to Accountability and Back: A Discussion of Access and Evidence in AI AuditingConference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO), 2024

Sarah H. Cen

Rohan Alur

322

07 Oct 2024

Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks

452

04 Oct 2024

OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?

Liangze Jiang

Damien Teney

OODD OOD

622

03 Oct 2024

Perceptions of the Fairness Impacts of Multiplicity in Machine LearningInternational Conference on Human Factors in Computing Systems (CHI), 2024

Anna P. Meyer

Yea-Seul Kim

Aws Albarghouthi

Loris DÁntoni

FaML

182

18 Sep 2024

Self-Supervised Learning for Building Robust Pediatric Chest X-ray Classification Models

Sheng Cheng

Zbigniew A. Starosolski

Devika Subramanian

SSL

318

30 Aug 2024

UTrack: Multi-Object Tracking with Uncertain Detections

Edgardo Solano-Carrillo

Ángel Bueno Rodríguez

Jannis Stoppe

VOT

361

30 Aug 2024

Can Optimization Trajectories Explain Multi-Task Transfer?

David Mueller

Mark Dredze

Nicholas Andrews

459

26 Aug 2024

Assessing Robustness of Machine Learning Models using Covariate Perturbations

Arun Prakash

A. Bhattacharyya

Eric Heim

Vijayan N. Nair Model

OOD AAML

171

02 Aug 2024

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent

Mohammad Reza Samsami

344

16 Jul 2024

Amazing Things Come From Having Many Good Models

Cynthia Rudin

Jon Donnelly

365

05 Jul 2024

Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space

460

27 Jun 2024

Aligning Model Properties via Conformal Risk Control

William Overman

Jacqueline Jil Vallon

Mohsen Bayati

264

26 Jun 2024

Tree-based variational inference for Poisson log-normal models

386

25 Jun 2024