ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.07410
  4. Cited By
One-vs-Each Approximation to Softmax for Scalable Estimation of
  Probabilities
v1v2 (latest)

One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities

23 September 2016
Michalis K. Titsias
    UQCV
ArXiv (abs)PDFHTML

Papers citing "One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities"

32 / 32 papers shown
Improved Stochastic Optimization of LogSumExp
Improved Stochastic Optimization of LogSumExp
E. Gladin
Alexey Kroshnin
Jia Jie Zhu
Pavel Dvurechensky
208
1
0
29 Sep 2025
Bayesian Principles Improve Prompt Learning In Vision-Language Models
Bayesian Principles Improve Prompt Learning In Vision-Language ModelsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Mingyu Kim
Jongwoo Ko
Mijung Park
VLM
403
1
0
19 Apr 2025
Accelerating Convergence in Bayesian Few-Shot Classification
Accelerating Convergence in Bayesian Few-Shot ClassificationInternational Conference on Machine Learning (ICML), 2024
Tianjun Ke
Haoqun Cao
Feng Zhou
385
2
0
02 May 2024
HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic
  Encryption
HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption
Seewoo Lee
Garam Lee
Jung Woo Kim
Junbum Shin
Mun-Kyu Lee
316
52
0
21 Mar 2024
Convex Bounds on the Softmax Function with Applications to Robustness
  Verification
Convex Bounds on the Softmax Function with Applications to Robustness VerificationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Dennis L. Wei
Haoze Wu
Min Wu
Pin-Yu Chen
Clark W. Barrett
E. Farchi
UQCVAAML
179
16
0
03 Mar 2023
On the inconsistency of separable losses for structured prediction
On the inconsistency of separable losses for structured predictionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Caio Corro
271
3
0
25 Jan 2023
LMEC: Learnable Multiplicative Absolute Position Embedding Based
  Conformer for Speech Recognition
LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition
Yuguang Yang
Yu Pan
Jingjing Yin
Heng Lu
300
5
0
05 Dec 2022
Hyperbolic Cosine Transformer for LiDAR 3D Object Detection
Hyperbolic Cosine Transformer for LiDAR 3D Object Detection
Jigang Tong
Fanhang Yang
Sen Yang
Enzeng Dong
Shengzhi Du
Xing-jun Wang
Xianlin Yi
3DPCViT
162
1
0
10 Nov 2022
The Devil in Linear Transformer
The Devil in Linear TransformerConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhen Qin
Xiaodong Han
Weixuan Sun
Dongxu Li
Lingpeng Kong
Nick Barnes
Yiran Zhong
318
112
0
19 Oct 2022
Neural Architecture Search on Efficient Transformers and Beyond
Neural Architecture Search on Efficient Transformers and Beyond
Zexiang Liu
Dong Li
Kaiyue Lu
Zhen Qin
Weixuan Sun
Jiacheng Xu
Yiran Zhong
280
23
0
28 Jul 2022
Enhancing Classifier Conservativeness and Robustness by Polynomiality
Enhancing Classifier Conservativeness and Robustness by PolynomialityComputer Vision and Pattern Recognition (CVPR), 2022
Ziqi Wang
Marco Loog
AAML
230
3
0
23 Mar 2022
cosFormer: Rethinking Softmax in Attention
cosFormer: Rethinking Softmax in AttentionInternational Conference on Learning Representations (ICLR), 2022
Zhen Qin
Weixuan Sun
Huicai Deng
Dongxu Li
Yunshen Wei
Baohong Lv
Junjie Yan
Lingpeng Kong
Yiran Zhong
453
300
0
17 Feb 2022
Understanding Negative Samples in Instance Discriminative
  Self-supervised Representation Learning
Understanding Negative Samples in Instance Discriminative Self-supervised Representation LearningNeural Information Processing Systems (NeurIPS), 2021
Kento Nozawa
Issei Sato
SSL
584
53
0
13 Feb 2021
Statistical optimality and stability of tangent transform algorithms in
  logit models
Statistical optimality and stability of tangent transform algorithms in logit modelsJournal of machine learning research (JMLR), 2020
I. Ghosh
A. Bhattacharya
D. Pati
314
5
0
25 Oct 2020
BLOB : A Probabilistic Model for Recommendation that Combines Organic
  and Bandit Signals
BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit SignalsKnowledge Discovery and Data Mining (KDD), 2020
Otmane Sakhi
Stephen Bonner
D. Rohde
Flavian Vasile
299
37
0
28 Aug 2020
Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma
  Augmented Gaussian Processes
Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes
Jake C. Snell
R. Zemel
363
68
0
20 Jul 2020
Preferential Batch Bayesian Optimization
Preferential Batch Bayesian OptimizationInternational Workshop on Machine Learning for Signal Processing (MLSP), 2020
E. Siivola
Akash Kumar Dhaka
Michael Riis Andersen
Javier I. González
Pablo G. Moreno
Aki Vehtari
290
25
0
25 Mar 2020
Fast Predictive Uncertainty for Classification with Bayesian Deep
  Networks
Fast Predictive Uncertainty for Classification with Bayesian Deep NetworksConference on Uncertainty in Artificial Intelligence (UAI), 2020
Marius Hobbhahn
Agustinus Kristiadi
Philipp Hennig
BDLUQCV
531
40
0
02 Mar 2020
Extreme Classification via Adversarial Softmax Approximation
Extreme Classification via Adversarial Softmax ApproximationInternational Conference on Learning Representations (ICLR), 2020
Kushagra Pandey
Stephan Mandt
225
25
0
15 Feb 2020
End to end learning and optimization on graphs
End to end learning and optimization on graphsNeural Information Processing Systems (NeurIPS), 2019
Bryan Wilder
Eric Ewing
B. Dilkina
Milind Tambe
GNN
319
124
0
31 May 2019
Multi-Class Gaussian Process Classification Made Conjugate: Efficient
  Inference via Data Augmentation
Multi-Class Gaussian Process Classification Made Conjugate: Efficient Inference via Data AugmentationConference on Uncertainty in Artificial Intelligence (UAI), 2019
Théo Galy-Fajou
F. Wenzel
Christian Donner
Manfred Opper
239
32
0
23 May 2019
Latent Variable Session-Based Recommendation
Latent Variable Session-Based Recommendation
D. Rohde
Stephen Bonner
BDL
359
3
0
24 Apr 2019
Multimodal Explanations by Predicting Counterfactuality in Videos
Multimodal Explanations by Predicting Counterfactuality in Videos
Atsushi Kanehira
Kentaro Takemoto
S. Inayoshi
Tatsuya Harada
195
42
0
04 Dec 2018
Sigsoftmax: Reanalysis of the Softmax Bottleneck
Sigsoftmax: Reanalysis of the Softmax Bottleneck
Sekitoshi Kanai
Yasuhiro Fujiwara
Yuki Yamanaka
S. Adachi
324
78
0
28 May 2018
Unbiased scalable softmax optimization
Unbiased scalable softmax optimization
Francois Fagan
G. Iyengar
146
14
0
22 Mar 2018
Augment and Reduce: Stochastic Inference for Large Categorical
  Distributions
Augment and Reduce: Stochastic Inference for Large Categorical Distributions
Francisco J. R. Ruiz
Michalis K. Titsias
Adji Bousso Dieng
David M. Blei
BDL
327
22
0
12 Feb 2018
Physics-constrained, data-driven discovery of coarse-grained dynamics
Physics-constrained, data-driven discovery of coarse-grained dynamics
L. Felsberger
P. Koutsourelakis
AI4CE
233
20
0
11 Feb 2018
SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and
  Complements
SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and Complements
Francisco J. R. Ruiz
Susan Athey
David M. Blei
674
96
0
09 Nov 2017
Candidates vs. Noises Estimation for Large Multi-Class Classification
  Problem
Candidates vs. Noises Estimation for Large Multi-Class Classification Problem
Lei Han
Yiheng Huang
Tong Zhang
180
3
0
02 Nov 2017
On the Properties of the Softmax Function with Application in Game
  Theory and Reinforcement Learning
On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning
Bolin Gao
Lacra Pavel
FAtt
455
371
0
03 Apr 2017
Generative and Discriminative Text Classification with Recurrent Neural
  Networks
Generative and Discriminative Text Classification with Recurrent Neural Networks
Dani Yogatama
Chris Dyer
Wang Ling
Phil Blunsom
307
213
0
06 Mar 2017
Aggressive Sampling for Multi-class to Binary Reduction with
  Applications to Text Classification
Aggressive Sampling for Multi-class to Binary Reduction with Applications to Text ClassificationNeural Information Processing Systems (NeurIPS), 2017
Bikash Joshi
Massih-Reza Amini
Ioannis Partalas
F. Iutzeler
Yury Maximov
MQ
392
16
0
23 Jan 2017
1
Page 1 of 1