ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 0907.3740
  4. Cited By
Empirical Bernstein Bounds and Sample Variance Penalization

Empirical Bernstein Bounds and Sample Variance Penalization

Annual Conference Computational Learning Theory (COLT), 2009
21 July 2009
Andreas Maurer
Massimiliano Pontil
ArXiv (abs)PDFHTML

Papers citing "Empirical Bernstein Bounds and Sample Variance Penalization"

50 / 335 papers shown
Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity
Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity
Diego Martinez-Taboada
Tomás González
Aaditya Ramdas
121
3
0
05 Nov 2025
Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live
Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live
Hanchen Li
Qiuyang Mang
Runyuan He
Qizheng Zhang
Huanzhi Mao
Xiaokun Chen
Alvin Cheung
Joseph E. Gonzalez
Ion Stoica
Ion Stoica
477
12
0
04 Nov 2025
Towards Scalable Oversight via Partitioned Human Supervision
Towards Scalable Oversight via Partitioned Human Supervision
Ren Yin
Takashi Ishida
Masashi Sugiyama
207
0
0
26 Oct 2025
Fair Representation Learning with Controllable High Confidence Guarantees via Adversarial Inference
Fair Representation Learning with Controllable High Confidence Guarantees via Adversarial Inference
Yuhong Luo
Austin Hoag
Xintong Wang
Philip S Thomas
Przemyslaw A. Grabowicz
FaML
327
0
0
23 Oct 2025
Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach
Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach
Sebastian Reboul
Hélène Halconruy
Randal Douc
OffRL
167
0
0
22 Oct 2025
Fast Best-in-Class Regret for Contextual Bandits
Fast Best-in-Class Regret for Contextual Bandits
Samuel Girard
Aurélien Bibaut
Houssam Zenati
Nathan Kallus
Houssam Zenati
OffRL
166
0
0
17 Oct 2025
Risk-Aware Reinforcement Learning with Bandit-Based Adaptation for Quadrupedal Locomotion
Risk-Aware Reinforcement Learning with Bandit-Based Adaptation for Quadrupedal Locomotion
Yuanhong Zeng
Anushri Dixit
OffRL
130
0
0
16 Oct 2025
SGM: A Statistical Godel Machine for Risk-Controlled Recursive Self-Modification
SGM: A Statistical Godel Machine for Risk-Controlled Recursive Self-Modification
X. Wu
Shenqin Yin
Yanlan Kang
Xinhang Zhang
Qianya Xu
Zeping Chen
Wenqiang Zhang
161
3
0
11 Oct 2025
Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning
Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning
Alakh Sharma
Gaurish Trivedi
Kartikey Singh Bhandari
Yash Sinha
Dhruv Kumar
Pratik Narang
Jagat Sesh Challa
142
0
0
27 Sep 2025
Hybrid Safety Verification of Multi-Agent Systems using $ψ$-Weighted CBFs and PAC Guarantees
Hybrid Safety Verification of Multi-Agent Systems using ψψψ-Weighted CBFs and PAC Guarantees
Venkat Margapuri
Garik Kazanjian
Naren Kosaraju
173
0
0
24 Sep 2025
Sample Efficient Certification of Discrete-Time Control Barrier Functions
Sample Efficient Certification of Discrete-Time Control Barrier Functions
Sampath Kumar Mulagaleti
Andrea Del Prete
104
0
0
04 Sep 2025
ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning
ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning
Debamita Ghosh
George Atia
Yue Wang
OffRLOOD
440
3
0
05 Aug 2025
Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction
Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction
Zain Ulabedeen Farhat
Debamita Ghosh
George Atia
Yue Wang
233
2
0
04 Aug 2025
How Much Is Too Much? Adaptive, Context-Aware Risk Detection in Naturalistic Driving
How Much Is Too Much? Adaptive, Context-Aware Risk Detection in Naturalistic Driving
Amir Hossein Kalantari
Eleonora Papadimitriou
Arkady Zgonnikov
Amir Pooyan Afghari
328
0
0
26 Jul 2025
Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability
Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability
Dirk van der Hoeven
Julia Olkhovskaia
T. Erven
196
2
0
23 Jul 2025
The Sample Complexity of Parameter-Free Stochastic Convex Optimization
The Sample Complexity of Parameter-Free Stochastic Convex Optimization
Jared Lawrence
Ari Kalinsky
Hannah Bradfield
Y. Carmon
Oliver Hinder
263
0
0
12 Jun 2025
Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity
Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity
Victor Li
Baiting Chen
Yuzhen Mao
Qi Lei
Zhun Deng
246
0
0
30 May 2025
STaR-Bets: Sequential Target-Recalculating Bets for Tighter Confidence Intervals
STaR-Bets: Sequential Target-Recalculating Bets for Tighter Confidence Intervals
Václav Voráček
Francesco Orabona
289
3
0
28 May 2025
Adaptive Prediction-Powered AutoEval with Reliability and Efficiency Guarantees
Adaptive Prediction-Powered AutoEval with Reliability and Efficiency Guarantees
Sangwoo Park
Matteo Zecchin
Osvaldo Simeone
386
4
0
24 May 2025
Rethink Repeatable Measures of Robot Performance with Statistical Query
Rethink Repeatable Measures of Robot Performance with Statistical Query
Bowen Weng
L. Capito
Guillermo A. Castillo
Dylan Khor
495
1
0
13 May 2025
Lower Bounds on the MMSE of Adversarially Inferring Sensitive Features
Lower Bounds on the MMSE of Adversarially Inferring Sensitive Features
Monica Welfert
Nathan Stromberg
Mario Díaz
Lalitha Sankar
AAML
431
0
0
13 May 2025
Online Episodic Convex Reinforcement Learning
Online Episodic Convex Reinforcement Learning
B. Moreno
Khaled Eldowa
Pierre Gaillard
Margaux Brégère
Nadia Oudjane
OffRL
356
0
0
12 May 2025
Compute-Optimal LLMs Provably Generalize Better With Scale
Compute-Optimal LLMs Provably Generalize Better With ScaleInternational Conference on Learning Representations (ICLR), 2025
Marc Finzi
Sanyam Kapoor
Diego Granziol
Anming Gu
Christopher De Sa
J. Zico Kolter
Andrew Gordon Wilson
498
5
0
21 Apr 2025
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
Qian Zuo
Fengxiang He
384
0
0
07 Apr 2025
Bridging the Theoretical Gap in Randomized Smoothing
Bridging the Theoretical Gap in Randomized SmoothingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Blaise Delattre
Paul Caillon
Quentin Barthélemy
Erwan Fagnou
Alexandre Allauzen
AAML
479
1
0
03 Apr 2025
Estimating stationary mass, frequency by frequency
Estimating stationary mass, frequency by frequencyAnnual Conference Computational Learning Theory (COLT), 2025
Milind Nakul
Vidya Muthukumar
A. Pananjady
682
2
0
17 Mar 2025
Seldonian Reinforcement Learning for Ad Hoc Teamwork
Seldonian Reinforcement Learning for Ad Hoc Teamwork
Edoardo Zorzi
A. Castellini
Leonidas Bakopoulos
Georgios Chalkiadakis
Alessandro Farinelli
OffRL
357
1
0
05 Mar 2025
A Refined Analysis of UCBVI
A Refined Analysis of UCBVI
Simone Drago
Marco Mussi
Alberto Maria Metelli
415
1
0
24 Feb 2025
On Agnostic PAC Learning in the Small Error Regime
On Agnostic PAC Learning in the Small Error Regime
Julian Asilis
Mikael Møller Høgsgaard
Grigoris Velegkas
308
0
0
13 Feb 2025
Near-Optimal Reinforcement Learning with Shuffle Differential Privacy
Shaojie Bai
Mohammad Sadegh Talebi
Chengcheng Zhao
Peng Cheng
Jiming Chen
OffRL
518
0
0
18 Nov 2024
Towards Harmless Rawlsian Fairness Regardless of Demographic Prior
Towards Harmless Rawlsian Fairness Regardless of Demographic PriorNeural Information Processing Systems (NeurIPS), 2024
Xuanqian Wang
Jing Li
Ivor Tsang
Yew-Soon Ong
440
2
0
04 Nov 2024
Federated UCBVI: Communication-Efficient Federated Regret Minimization
  with Heterogeneous Agents
Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous AgentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Safwan Labbi
D. Tiapkin
Lorenzo Mancini
Paul Mangold
Eric Moulines
FedML
322
5
0
30 Oct 2024
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging
  Small LMs
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
A. S. Rawat
Veeranjaneyulu Sadhanala
Afshin Rostamizadeh
Ayan Chakrabarti
Wittawat Jitkrittum
...
Rakesh Shivanna
Sashank J. Reddi
A. Menon
Rohan Anil
Sanjiv Kumar
552
14
0
24 Oct 2024
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive
  Approach
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive ApproachNeural Information Processing Systems (NeurIPS), 2024
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
190
3
0
17 Oct 2024
Linguistically Grounded Analysis of Language Models using Shapley Head Values
Linguistically Grounded Analysis of Language Models using Shapley Head ValuesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Marcell Richard Fekete
Johannes Bjerva
524
1
0
17 Oct 2024
Instrumental variables: A non-asymptotic viewpoint
Instrumental variables: A non-asymptotic viewpoint
Eric Xia
Martin J. Wainwright
Whitney Newey
206
2
0
02 Oct 2024
Empirical Bernstein in smooth Banach spaces
Empirical Bernstein in smooth Banach spaces
Diego Martinez-Taboada
Aaditya Ramdas
355
12
0
09 Sep 2024
Randomization Techniques to Mitigate the Risk of Copyright Infringement
Randomization Techniques to Mitigate the Risk of Copyright Infringement
Wei-Ning Chen
Peter Kairouz
Sewoong Oh
Zheng Xu
AAML
265
1
0
21 Aug 2024
Efficient Reinforcement Learning in Probabilistic Reward Machines
Efficient Reinforcement Learning in Probabilistic Reward MachinesAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaofeng Lin
Xuezhou Zhang
312
3
0
19 Aug 2024
Making Robust Generalizers Less Rigid with Loss Concentration
Making Robust Generalizers Less Rigid with Loss Concentration
Matthew J. Holland
Toma Hamada
OOD
437
0
0
07 Aug 2024
Early Stopping Based on Repeated Significance
Early Stopping Based on Repeated SignificanceBigData Congress [Services Society] (BSS), 2024
Eric Bax
Arundhyoti Sarkar
Alex Shtoff
255
1
0
01 Aug 2024
How to Shrink Confidence Sets for Many Equivalent Discrete
  Distributions?
How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?
Odalric-Ambrym Maillard
M. S. Talebi
178
0
0
22 Jul 2024
Semi-Variance Reduction for Fair Federated Learning
Semi-Variance Reduction for Fair Federated Learning
Saber Malekmohammadi
Yaoliang Yu
FedML
467
2
0
23 Jun 2024
A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning
A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning
Gianluca Drappo
Alberto Maria Metelli
Marcello Restelli
143
0
0
21 Jun 2024
A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models
A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
321
0
0
11 Jun 2024
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection
  and Learning
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and LearningNeural Information Processing Systems (NeurIPS), 2024
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
364
13
0
23 May 2024
DirMixE: Harnessing Test Agnostic Long-tail Recognition with Hierarchical Label Vartiations
DirMixE: Harnessing Test Agnostic Long-tail Recognition with Hierarchical Label VartiationsInternational Conference on Machine Learning (ICML), 2024
Zhiyong Yang
Qianqian Xu
Zitai Wang
Sicong Li
Boyu Han
Shilong Bao
268
18
0
13 May 2024
Optimistic Regret Bounds for Online Learning in Adversarial Markov
  Decision Processes
Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision ProcessesConference on Uncertainty in Artificial Intelligence (UAI), 2024
Sang Bin Moon
Abolfazl Hashemi
252
2
0
03 May 2024
Multi-Objective Recommendation via Multivariate Policy Learning
Multi-Objective Recommendation via Multivariate Policy LearningACM Conference on Recommender Systems (RecSys), 2024
Olivier Jeunen
Jatin Mandav
Ivan Potapov
Nakul Agarwal
Sourabh Vaid
Wenzhe Shi
Aleksei Ustimenko
OffRL
283
12
0
03 May 2024
An Information Theoretic Perspective on Conformal Prediction
An Information Theoretic Perspective on Conformal PredictionNeural Information Processing Systems (NeurIPS), 2024
Alvaro H.C. Correia
F. V. Massoli
Christos Louizos
Arash Behboodi
519
19
0
03 May 2024
1234567
Next
Page 1 of 7