Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.19145
Cited By
Do Large Language Models (Really) Need Statistical Foundations?
25 May 2025
Weijie Su
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Do Large Language Models (Really) Need Statistical Foundations?"
50 / 73 papers shown
Title
PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective
Tim Tsz-Kit Lau
Qi Long
Weijie Su
5
1
0
27 May 2025
Text Generation Beyond Discrete Token Sampling
Yufan Zhuang
Liyuan Liu
Chandan Singh
Jingbo Shang
Jianfeng Gao
OOD
90
1
0
20 May 2025
Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach
Jiancong Xiao
Bojian Hou
Zhanliang Wang
Ruochen Jin
Q. Long
Weijie Su
Li Shen
57
1
0
04 May 2025
SConU: Selective Conformal Uncertainty in Large Language Models
Zhiyuan Wang
Qingni Wang
Yue Zhang
Tianlong Chen
Xiaofeng Zhu
Xiaoshuang Shi
Kaidi Xu
78
6
0
19 Apr 2025
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue
Zhiqi Chen
Rui Lu
Andrew Zhao
Zhaokai Wang
Yang Yue
Shiji Song
Gao Huang
ReLM
LRM
96
55
0
18 Apr 2025
You've Changed: Detecting Modification of Black-Box Large Language Models
Alden Dima
James R. Foulds
Shimei Pan
Philip G. Feldman
62
1
0
14 Apr 2025
Conditional Data Synthesis Augmentation
Xinyu Tian
Xiaotong Shen
37
1
0
10 Apr 2025
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
Kai Ye
Hongyi Zhou
Jin Zhu
Francesco Quinzan
C. Shi
48
3
0
03 Apr 2025
Large Language Models Pass the Turing Test
Cameron R. Jones
Benjamin K. Bergen
ALM
ELM
94
8
0
31 Mar 2025
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey
Xiaoou Liu
Tiejin Chen
Longchao Da
Chacha Chen
Zhen Lin
Hua Wei
HILM
90
6
0
20 Mar 2025
Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium
Kaizhao Liu
Qi Long
Zhekun Shi
Weijie J. Su
Jiancong Xiao
45
5
0
14 Mar 2025
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Gokul Swamy
Sanjiban Choudhury
Wen Sun
Zhiwei Steven Wu
J. Andrew Bagnell
OffRL
93
13
0
03 Mar 2025
Large Language Diffusion Models
Shen Nie
Fengqi Zhu
Zebin You
Xiaolu Zhang
Jingyang Ou
Jun Hu
Jun Zhou
Yankai Lin
Ji-Rong Wen
Chongxuan Li
148
38
0
14 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
157
1,368
0
22 Jan 2025
Transformers Simulate MLE for Sequence Generation in Bayesian Networks
Yuan Cao
Yihan He
Dennis Wu
Hong-Yu Chen
Jianqing Fan
Han Liu
41
0
0
05 Jan 2025
A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models
Yinpeng Cai
Lexin Li
Linjun Zhang
368
1
0
05 Jan 2025
Robust Detection of Watermarks for Large Language Models Under Human Edits
Xiang Li
Feng Ruan
Huiyuan Wang
Qi Long
Weijie J. Su
WaLM
81
3
0
21 Nov 2024
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
Evan Miller
ELM
45
22
0
01 Nov 2024
Universality of the
π
2
/
6
π^2/6
π
2
/6
Pathway in Avoiding Model Collapse
Apratim Dey
D. Donoho
95
8
0
30 Oct 2024
Learning the Bitter Lesson: Empirical Evidence from 20 Years of CVPR Proceedings
Mojtaba Yousefi
Jack Collins
60
1
0
12 Oct 2024
Veridical Data Science for Medical Foundation Models
Ahmed Alaa
Bin Yu
AI4CE
16
0
0
15 Sep 2024
Synthetic continued pretraining
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
61
12
0
11 Sep 2024
From optimal score matching to optimal sampling
Zehao Dou
Subhodh Kotekal
Zhehao Xu
Harrison H. Zhou
DiffM
18
9
0
11 Sep 2024
Improving Pretraining Data Using Perplexity Correlations
Tristan Thrush
Christopher Potts
Tatsunori Hashimoto
68
19
0
09 Sep 2024
RegMix: Data Mixture as Regression for Language Model Pre-training
Qian Liu
Xiaosen Zheng
Niklas Muennighoff
Guangtao Zeng
Longxu Dou
Tianyu Pang
Jing Jiang
Min Lin
MoE
99
48
1
01 Jul 2024
Understanding and Mitigating Tokenization Bias in Language Models
Buu Phan
Marton Havasi
Matthew Muckley
Karen Ullrich
69
5
0
24 Jun 2024
Nemotron-4 340B Technical Report
Nvidia
:
Bo Adler
Niket Agarwal
Ashwath Aithal
...
Jimmy Zhang
Jing Zhang
Vivienne Zhang
Yian Zhang
Chen Zhu
81
60
0
17 Jun 2024
Quantifying Variance in Evaluation Benchmarks
Lovish Madaan
Aaditya K. Singh
Rylan Schaeffer
Andrew Poulton
Sanmi Koyejo
Pontus Stenetorp
Sharan Narang
Dieuwke Hupkes
56
12
0
14 Jun 2024
Large language model validity via enhanced conformal prediction methods
John J. Cherian
Isaac Gibbs
Emmanuel J. Candès
49
28
0
14 Jun 2024
To Believe or Not to Believe Your LLM
Yasin Abbasi-Yadkori
Ilja Kuzborskij
András György
Csaba Szepesvári
UQCV
78
47
0
04 Jun 2024
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
Jiancong Xiao
Ziniu Li
Xingyu Xie
E. Getzen
Cong Fang
Qi Long
Weijie J. Su
59
16
0
26 May 2024
Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees
Yu Gui
Ying Jin
Zhimei Ren
MedIm
138
18
0
16 May 2024
Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks
Lujing Zhang
Aaron Roth
Linjun Zhang
FaML
92
8
0
03 May 2024
Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning
Ruiqi Zhang
Licong Lin
Yu Bai
Song Mei
MU
95
150
0
08 Apr 2024
Uncertainty in Language Models: Assessment through Rank-Calibration
Xinmeng Huang
Shuo Li
Mengxin Yu
Matteo Sesia
Hamed Hassani
Insup Lee
Osbert Bastani
Yan Sun
53
17
0
04 Apr 2024
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data
Matthias Gerstgrasser
Rylan Schaeffer
Apratim Dey
Rafael Rafailov
Henry Sleight
...
Andrey Gromov
Daniel A. Roberts
Diyi Yang
D. Donoho
Oluwasanmi Koyejo
73
61
0
01 Apr 2024
tinyBenchmarks: evaluating LLMs with fewer examples
Felipe Maia Polo
Lucas Weber
Leshem Choshen
Yuekai Sun
Gongjun Xu
Mikhail Yurochkin
ELM
60
85
0
22 Feb 2024
Language Models with Conformal Factuality Guarantees
Christopher Mohri
Tatsunori Hashimoto
HILM
135
40
0
15 Feb 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
73
144
0
26 Jan 2024
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Damai Dai
Chengqi Deng
Chenggang Zhao
R. X. Xu
Huazuo Gao
...
Panpan Huang
Fuli Luo
Chong Ruan
Zhifang Sui
W. Liang
MoE
52
271
0
11 Jan 2024
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Albert Gu
Tri Dao
Mamba
58
2,552
0
01 Dec 2023
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
Lei Huang
Weijiang Yu
Weitao Ma
Weihong Zhong
Zhangyin Feng
...
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
LRM
HILM
74
805
0
09 Nov 2023
Simplifying Transformer Blocks
Bobby He
Thomas Hofmann
51
34
0
03 Nov 2023
Large Language Model Unlearning
Yuanshun Yao
Xiaojun Xu
Yang Liu
MU
67
119
0
14 Oct 2023
Gender bias and stereotypes in Large Language Models
Hadas Kotek
Rikker Dockum
David Q. Sun
66
220
0
28 Aug 2023
What Should Data Science Education Do with Large Language Models?
Xinming Tu
James Zou
Weijie J. Su
Linjun Zhang
AI4Ed
52
35
0
06 Jul 2023
RWKV: Reinventing RNNs for the Transformer Era
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
158
581
0
22 May 2023
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining
Sang Michael Xie
Hieu H. Pham
Xuanyi Dong
Nan Du
Hanxiao Liu
Yifeng Lu
Percy Liang
Quoc V. Le
Tengyu Ma
Adams Wei Yu
MoMe
MoE
86
195
0
17 May 2023
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Ronen Eldan
Yuan-Fang Li
SyDa
LRM
46
255
0
12 May 2023
Whose Opinions Do Language Models Reflect?
Shibani Santurkar
Esin Durmus
Faisal Ladhak
Cinoo Lee
Percy Liang
Tatsunori Hashimoto
57
409
0
30 Mar 2023
1
2
Next