Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.03395
Cited By
Underspecification Presents Challenges for Credibility in Modern Machine Learning
6 November 2020
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
Alex Beutel
Christina W. Chen
Jonathan Deaton
Jacob Eisenstein
Matthew D. Hoffman
F. Hormozdiari
N. Houlsby
Shaobo Hou
Ghassen Jerfel
Alan Karthikesalingam
Mario Lucic
Yi-An Ma
Cory Y. McLean
Diana Mincu
A. Mitani
Andrea Montanari
Zachary Nado
Vivek Natarajan
Christopher Nielson
T. Osborne
R. Raman
K. Ramasamy
Rory Sayres
Jessica Schrouff
Martin G. Seneviratne
Shannon Sequeira
Harini Suresh
Victor Veitch
Max Vladymyrov
Xuezhi Wang
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Underspecification Presents Challenges for Credibility in Modern Machine Learning"
50 / 351 papers shown
Title
Perils of Label Indeterminacy: A Case Study on Prediction of Neurological Recovery After Cardiac Arrest
Jakob Schoeffer
Maria De-Arteaga
Jonathan Elmer
92
0
0
05 Apr 2025
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Oskar van der Wal
Pietro Lesci
Max Muller-Eberstein
Naomi Saphra
Hailey Schoelkopf
Willem H. Zuidema
Stella Biderman
LRM
56
0
0
12 Mar 2025
The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition
Otto Brookes
Maksim Kukushkin
Majid Mirmehdi
Colleen Stephens
Paula Dieguez
...
Lukas Boesch
Thomas Schmid
M. Arandjelovic
H. Kühl
T. Burghardt
46
0
0
28 Feb 2025
Societal Alignment Frameworks Can Improve LLM Alignment
Karolina Stañczak
Nicholas Meade
Mehar Bhatia
Hattie Zhou
Konstantin Böttinger
...
Timothy P. Lillicrap
Ana Marasović
Sylvie Delacroix
Gillian K. Hadfield
Siva Reddy
107
0
0
27 Feb 2025
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao
Tian Qin
David Alvarez-Melis
Sham Kakade
Naomi Saphra
LRM
37
0
0
24 Feb 2025
Less is More for Synthetic Speech Detection in the Wild
Ashi Garg
Zexin Cai
Henry Li Xinyuan
Leibny Paola García-Perera
Kevin Duh
Sanjeev Khudanpur
Matthew Wiesner
Nicholas Andrews
74
0
0
17 Feb 2025
Machine Learning Should Maximize Welfare, Not (Only) Accuracy
Nir Rosenfeld
Haifeng Xu
HAI
FaML
71
1
0
17 Feb 2025
Be Intentional About Fairness!: Fairness, Size, and Multiplicity in the Rashomon Set
Gordon Dai
Pavan Ravishankar
Rachel Yuan
Daniel B. Neill
Emily Black
34
0
0
28 Jan 2025
The Curious Case of Arbitrariness in Machine Learning
Prakhar Ganesh
Afaf Taik
G. Farnadi
59
2
0
28 Jan 2025
Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction
P. Melki
Lionel Bombrun
Boubacar Diallo
Jérôme Dias
Jean-Pierre da Costa
41
0
0
13 Jan 2025
Test-Time Alignment via Hypothesis Reweighting
Yoonho Lee
Jonathan Williams
Henrik Marklund
Archit Sharma
E. Mitchell
Anikait Singh
Chelsea Finn
91
3
0
11 Dec 2024
Fine-Tuning Pre-trained Language Models for Robust Causal Representation Learning
Jialin Yu
Yuxiang Zhou
Yulan He
Nevin L. Zhang
Ricardo Silva
31
0
0
18 Oct 2024
From Transparency to Accountability and Back: A Discussion of Access and Evidence in AI Auditing
Sarah H. Cen
Rohan Alur
19
1
0
07 Oct 2024
Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Ann Huang
Satpreet H. Singh
Kanaka Rajan
14
0
0
04 Oct 2024
OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?
Liangze Jiang
Damien Teney
OODD
OOD
28
1
0
03 Oct 2024
Perceptions of the Fairness Impacts of Multiplicity in Machine Learning
Anna P. Meyer
Yea-Seul Kim
Aws Albarghouthi
Loris DÁntoni
FaML
24
1
0
18 Sep 2024
Self-Supervised Learning for Building Robust Pediatric Chest X-ray Classification Models
Sheng Cheng
Zbigniew A. Starosolski
Devika Subramanian
SSL
29
0
0
30 Aug 2024
UTrack: Multi-Object Tracking with Uncertain Detections
Edgardo Solano-Carrillo
Felix Sattler
Antje Alex
Alexander Klein
Bruno Pereira Costa
Ángel Bueno Rodríguez
Jannis Stoppe
VOT
32
1
0
30 Aug 2024
Can Optimization Trajectories Explain Multi-Task Transfer?
David Mueller
Mark Dredze
Nicholas Andrews
53
1
0
26 Aug 2024
Assessing Robustness of Machine Learning Models using Covariate Perturbations
Arun Prakash
A. Bhattacharyya
Eric Heim
Vijayan N. Nair Model
OOD
AAML
31
1
0
02 Aug 2024
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Karolis Jucys
George Adamopoulos
Mehrab Hamidi
Stephanie Milani
Mohammad Reza Samsami
Artem Zholus
Sonia Joseph
Blake A. Richards
Irina Rish
Özgür Simsek
34
2
0
16 Jul 2024
Amazing Things Come From Having Many Good Models
Cynthia Rudin
Chudi Zhong
Lesia Semenova
Margo Seltzer
Ronald E. Parr
Jiachang Liu
Srikar Katta
Jon Donnelly
Harry Chen
Zachery Boner
26
23
0
05 Jul 2024
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space
Core Francisco Park
Maya Okawa
Andrew Lee
Ekdeep Singh Lubana
Hidenori Tanaka
52
7
0
27 Jun 2024
Aligning Model Properties via Conformal Risk Control
William Overman
Jacqueline Jil Vallon
Mohsen Bayati
33
2
0
26 Jun 2024
Tree-based variational inference for Poisson log-normal models
Alexandre Chaussard
Anna Bonnet
Elisabeth Gassiat
Sylvain Le Corff
22
0
0
25 Jun 2024
MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data
Renqing Luo
Yuhan Xu
33
0
0
24 Jun 2024
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding
Ukyo Honda
Tatsushi Oka
Peinan Zhang
Masato Mita
42
1
0
17 Jun 2024
Teleporter Theory: A General and Simple Approach for Modeling Cross-World Counterfactual Causality
Jiangmeng Li
Bin Qin
Qirui Ji
Yi Li
Wenwen Qiang
Jianwen Cao
Fanjiang Xu
44
0
0
17 Jun 2024
Management Decisions in Manufacturing using Causal Machine Learning -- To Rework, or not to Rework?
Philipp Schwarz
Oliver Schacht
Sven Klaassen
Daniel Grünbaum
Sebastian Imhof
Martin Spindler
CML
17
0
0
17 Jun 2024
How Does Distribution Matching Help Domain Generalization: An Information-theoretic Analysis
Yuxin Dong
Tieliang Gong
Hong Chen
Shuangyong Song
Weizhan Zhang
Chen Li
OOD
37
0
0
14 Jun 2024
The Penalized Inverse Probability Measure for Conformal Classification
P. Melki
Lionel Bombrun
Boubacar Diallo
Jérôme Dias
Jean-Pierre da Costa
37
2
0
13 Jun 2024
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition
Andreas Triantafyllopoulos
A. Batliner
Simon Rampp
M. Milling
Björn Schuller
VLM
18
0
0
10 Jun 2024
On Affine Homotopy between Language Encoders
Robin SM Chan
Reda Boumasmoud
Anej Svete
Yuxin Ren
Qipeng Guo
...
Shauli Ravfogel
Mrinmaya Sachan
Bernhard Schölkopf
Mennatallah El-Assady
Ryan Cotterell
38
3
0
04 Jun 2024
Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
G. Farnadi
Mohammad Havaei
Negar Rostamzadeh
32
2
0
03 Jun 2024
Reconciling Model Multiplicity for Downstream Decision Making
Ally Yalei Du
Dung Daniel Ngo
Zhiwei Steven Wu
21
5
0
30 May 2024
AI Risk Management Should Incorporate Both Safety and Security
Xiangyu Qi
Yangsibo Huang
Yi Zeng
Edoardo Debenedetti
Jonas Geiping
...
Chaowei Xiao
Bo-wen Li
Dawn Song
Peter Henderson
Prateek Mittal
AAML
43
10
0
29 May 2024
The Cost of Arbitrariness for Individuals: Examining the Legal and Technical Challenges of Model Multiplicity
Prakhar Ganesh
Ihsan Ibrahim Daldaban
Ignacio Cofone
G. Farnadi
54
2
0
28 May 2024
Learning from Uncertain Data: From Possible Worlds to Possible Models
Jiongli Zhu
Su Feng
Boris Glavic
Babak Salimi
14
0
0
28 May 2024
Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables
James Hinns
David Martens
41
2
0
24 May 2024
Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks
Jingchi Jiang
Rujia Shen
Boran Wang
Yi Guan
OffRL
BDL
26
1
0
23 May 2024
Agent Design Pattern Catalogue: A Collection of Architectural Patterns for Foundation Model based Agents
Yue Liu
Sin Kit Lo
Qinghua Lu
Liming Zhu
Dehai Zhao
Xiwei Xu
Stefan Harrer
Jon Whittle
LLMAG
AI4CE
25
10
0
16 May 2024
Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data
Leonhard Hennicke
C. Adriano
Holger Giese
Jan Mathias Koehler
Lukas Schott
DiffM
45
2
0
06 May 2024
Position: Why We Must Rethink Empirical Research in Machine Learning
Moritz Herrmann
F. J. D. Lange
Katharina Eggensperger
Giuseppe Casalicchio
Marcel Wever
Matthias Feurer
David Rügamer
Eyke Hüllermeier
A. Boulesteix
Bernd Bischl
44
6
0
03 May 2024
On the Rashomon ratio of infinite hypothesis sets
Evzenie Coupkova
Mireille Boutin
26
1
0
27 Apr 2024
From Model Performance to Claim: How a Change of Focus in Machine Learning Replicability Can Help Bridge the Responsibility Gap
Tianqi Kou
32
0
0
19 Apr 2024
Beyond development: Challenges in deploying machine learning models for structural engineering applications
M. Z. Esteghamati
Brennan Bean
Henry V. Burton
M. Z. Naser
AI4CE
21
1
0
18 Apr 2024
Machine Learning Robustness: A Primer
Houssem Ben Braiek
Foutse Khomh
AAML
OOD
32
5
0
01 Apr 2024
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
56
0
0
13 Mar 2024
Fusing Climate Data Products using a Spatially Varying Autoencoder
Jacob A. Johnson
Matthew J. Heaton
William F. Christensen
Lynsie R. Warr
S. Rupper
AI4CE
16
0
0
12 Mar 2024
Calibrating Large Language Models Using Their Generations Only
Dennis Ulmer
Martin Gubri
Hwaran Lee
Sangdoo Yun
Seong Joon Oh
UQLM
411
18
1
09 Mar 2024
1
2
3
4
5
6
7
8
Next