v1v2 (latest)

One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities

23 September 2016

Michalis K. Titsias

UQCV

ArXiv (abs)PDF HTML

Papers citing "One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities"

32 / 32 papers shown

Improved Stochastic Optimization of LogSumExp

208

29 Sep 2025

Bayesian Principles Improve Prompt Learning In Vision-Language ModelsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

403

19 Apr 2025

Accelerating Convergence in Bayesian Few-Shot ClassificationInternational Conference on Machine Learning (ICML), 2024

Tianjun Ke

Haoqun Cao

Feng Zhou

385

02 May 2024

HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption

316

21 Mar 2024

Convex Bounds on the Softmax Function with Applications to Robustness VerificationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Min Wu

179

03 Mar 2023

On the inconsistency of separable losses for structured predictionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Caio Corro

271

25 Jan 2023

LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition

Yuguang Yang

Yu Pan

Jingjing Yin

Heng Lu

300

05 Dec 2022

Hyperbolic Cosine Transformer for LiDAR 3D Object Detection

Fanhang Yang

162

10 Nov 2022

The Devil in Linear TransformerConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Zhen Qin

Lingpeng Kong

318

112

19 Oct 2022

Neural Architecture Search on Efficient Transformers and Beyond

Zhen Qin

280

28 Jul 2022

Enhancing Classifier Conservativeness and Robustness by PolynomialityComputer Vision and Pattern Recognition (CVPR), 2022

Ziqi Wang

Marco Loog

AAML

230

23 Mar 2022

cosFormer: Rethinking Softmax in AttentionInternational Conference on Learning Representations (ICLR), 2022

Zhen Qin

Lingpeng Kong

453

300

17 Feb 2022

Understanding Negative Samples in Instance Discriminative Self-supervised Representation LearningNeural Information Processing Systems (NeurIPS), 2021

Kento Nozawa

Issei Sato

SSL

584

13 Feb 2021

Statistical optimality and stability of tangent transform algorithms in logit modelsJournal of machine learning research (JMLR), 2020

I. Ghosh

A. Bhattacharya

D. Pati

314

25 Oct 2020

BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit SignalsKnowledge Discovery and Data Mining (KDD), 2020

299

28 Aug 2020

Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes

Jake C. Snell

R. Zemel

363

20 Jul 2020

Preferential Batch Bayesian OptimizationInternational Workshop on Machine Learning for Signal Processing (MLSP), 2020

E. Siivola

Akash Kumar Dhaka

Michael Riis Andersen

Javier I. González

Pablo G. Moreno

Aki Vehtari

290

25 Mar 2020

Fast Predictive Uncertainty for Classification with Bayesian Deep NetworksConference on Uncertainty in Artificial Intelligence (UAI), 2020

531

02 Mar 2020

Extreme Classification via Adversarial Softmax ApproximationInternational Conference on Learning Representations (ICLR), 2020

Kushagra Pandey

Stephan Mandt

225

15 Feb 2020

End to end learning and optimization on graphsNeural Information Processing Systems (NeurIPS), 2019

319

124

31 May 2019

Multi-Class Gaussian Process Classification Made Conjugate: Efficient Inference via Data AugmentationConference on Uncertainty in Artificial Intelligence (UAI), 2019

239

23 May 2019

Latent Variable Session-Based Recommendation

D. Rohde

Stephen Bonner

BDL

359

24 Apr 2019

Multimodal Explanations by Predicting Counterfactuality in Videos

195

04 Dec 2018

Sigsoftmax: Reanalysis of the Softmax Bottleneck

324

28 May 2018

Unbiased scalable softmax optimization

Francois Fagan

G. Iyengar

146

22 Mar 2018

Augment and Reduce: Stochastic Inference for Large Categorical Distributions

327

12 Feb 2018

Physics-constrained, data-driven discovery of coarse-grained dynamics

L. Felsberger

P. Koutsourelakis

AI4CE

233

11 Feb 2018

SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and Complements

Francisco J. R. Ruiz

Susan Athey

David M. Blei

674

09 Nov 2017

Candidates vs. Noises Estimation for Large Multi-Class Classification Problem

Lei Han

Yiheng Huang

Tong Zhang

180

02 Nov 2017

On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning

Bolin Gao

Lacra Pavel

FAtt

455

371

03 Apr 2017

Generative and Discriminative Text Classification with Recurrent Neural Networks

307

213

06 Mar 2017

Aggressive Sampling for Multi-class to Binary Reduction with Applications to Text ClassificationNeural Information Processing Systems (NeurIPS), 2017

392

23 Jan 2017