Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.04689
Cited By
Multicalibration for Confidence Scoring in LLMs
6 April 2024
Gianluca Detommaso
Martín Bertrán
Riccardo Fogliato
Aaron Roth
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multicalibration for Confidence Scoring in LLMs"
14 / 14 papers shown
Title
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
Toghrul Abbasli
Kentaroh Toyoda
Yuan Wang
Leon Witt
Muhammad Asif Ali
Yukai Miao
Dan Li
Qingsong Wei
UQCV
79
0
0
25 Apr 2025
Calibrating Bayesian Learning via Regularization, Confidence Minimization, and Selective Inference
Jiayi Huang
Sangwoo Park
Osvaldo Simeone
56
2
0
03 Jan 2025
A Survey of Calibration Process for Black-Box LLMs
Liangru Xie
Hui Liu
Jingying Zeng
Xianfeng Tang
Yan Han
Chen Luo
Jing Huang
Zhen Li
Suhang Wang
Qi He
74
1
0
17 Dec 2024
Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
Paulius Rauba
Nabeel Seedat
Krzysztof Kacprzyk
M. Schaar
AI4CE
42
0
0
31 Oct 2024
Learning to Route LLMs with Confidence Tokens
Yu-Neng Chuang
Helen Zhou
Prathusha Kameswara Sarma
Parikshit Gopalan
John Boccio
Sara Bolouki
Xia Hu
20
0
0
17 Oct 2024
Ordinal Preference Optimization: Aligning Human Preferences via NDCG
Yang Zhao
Yixin Wang
Mingzhang Yin
24
2
0
06 Oct 2024
Cost-Effective Hallucination Detection for LLMs
Simon Valentin
Jinmiao Fu
Gianluca Detommaso
Shaoyuan Xu
Giovanni Zappella
Bryan Wang
HILM
24
4
0
31 Jul 2024
Large language model validity via enhanced conformal prediction methods
John J. Cherian
Isaac Gibbs
Emmanuel J. Candès
18
19
0
14 Jun 2024
When is Multicalibration Post-Processing Necessary?
Dutch Hansen
Siddartha Devic
Preetum Nakkiran
Vatsal Sharan
23
4
0
10 Jun 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
25
2
0
22 Mar 2024
Conformal Language Modeling
Victor Quach
Adam Fisch
Tal Schuster
Adam Yala
J. Sohn
Tommi Jaakkola
Regina Barzilay
74
55
0
16 Jun 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul
Adian Liusie
Mark J. F. Gales
HILM
LRM
145
386
0
15 Mar 2023
Batch Multivalid Conformal Prediction
Christopher Jung
Georgy Noarov
Ramya Ramalingam
Aaron Roth
52
48
0
30 Sep 2022
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation
Tianyu Liu
Yizhe Zhang
Chris Brockett
Yi Mao
Zhifang Sui
Weizhu Chen
W. Dolan
HILM
209
140
0
18 Apr 2021
1