ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.06059
  4. Cited By
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for
  Sampling Sequences Without Replacement

Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement

14 March 2019
W. Kool
H. V. Hoof
Max Welling
ArXivPDFHTML

Papers citing "Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement"

36 / 36 papers shown
Title
Optimizing Temperature for Language Models with Multi-Sample Inference
Optimizing Temperature for Language Models with Multi-Sample Inference
Weihua Du
Yiming Yang
Sean Welleck
56
2
0
07 Feb 2025
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
Emir Ceyani
Han Xie
Baturalp Buyukates
Carl Yang
Salman Avestimehr
FedML
95
0
0
22 Jan 2025
GraphXForm: Graph transformer for computer-aided molecular design
GraphXForm: Graph transformer for computer-aided molecular design
Jonathan Pirnay
Jan G. Rittig
Alexander B. Wolf
Martin Grohe
Jakob Burger
Alexander Mitsos
D. G. Grimm
AI4CE
49
1
0
03 Nov 2024
Self-evolving Agents with reflective and memory-augmented abilities
Self-evolving Agents with reflective and memory-augmented abilities
Xuechen Liang
Yangfan He
Yinghui Xia
Xinyuan Song
Jianhui Wang
...
Keqin Li
Jiaqi Chen
Jinsong Yang
Siyuan Chen
Tianyu Shi
LLMAG
KELM
CLL
33
2
0
01 Sep 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
25
7
0
31 Aug 2024
Coupling without Communication and Drafter-Invariant Speculative Decoding
Coupling without Communication and Drafter-Invariant Speculative Decoding
Majid Daliri
Christopher Musco
A. Suresh
46
1
0
15 Aug 2024
Informed Correctors for Discrete Diffusion Models
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
Lester W. Mackey
Scott W. Linderman
Lester Mackey
Scott Linderman
42
9
0
30 Jul 2024
Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders
Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders
Christian Toth
Christian Knoll
Franz Pernkopf
Robert Peharz
CML
41
1
0
22 Feb 2024
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Yuu Jinnai
Ukyo Honda
Tetsuro Morimura
Peinan Zhang
26
6
0
10 Jan 2024
Unsupervised Extractive Summarization with Learnable Length Control
  Strategies
Unsupervised Extractive Summarization with Learnable Length Control Strategies
Renlong Jie
Xiaojun Meng
Xin Jiang
Qun Liu
19
1
0
12 Dec 2023
Everything Perturbed All at Once: Enabling Differentiable Graph Attacks
Everything Perturbed All at Once: Enabling Differentiable Graph Attacks
Haoran Liu
Bokun Wang
Jianling Wang
Xiangjue Dong
Tianbao Yang
James Caverlee
AAML
26
3
0
29 Aug 2023
Order-based Structure Learning with Normalizing Flows
Order-based Structure Learning with Normalizing Flows
Hamidreza Kamkari
Vahid Balazadeh Meresht
Vahid Zehtab
Rahul G. Krishnan
CML
21
1
0
14 Aug 2023
Policy-Based Self-Competition for Planning Problems
Policy-Based Self-Competition for Planning Problems
Jonathan Pirnay
Q. Göttl
Jakob Burger
D. G. Grimm
32
3
0
07 Jun 2023
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
28
57
0
11 Feb 2023
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text
  Generation via Concentrating Attention
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
Wenhao Li
Xiaoyuan Yi
Jinyi Hu
Maosong Sun
Xing Xie
21
0
0
14 Nov 2022
Truncation Sampling as Language Model Desmoothing
Truncation Sampling as Language Model Desmoothing
John Hewitt
Christopher D. Manning
Percy Liang
BDL
36
75
0
27 Oct 2022
Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models
Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models
Luke Vilnis
Yury Zemlyanskiy
Patrick C. Murray
Alexandre Passos
Sumit Sanghai
54
9
0
18 Oct 2022
Is margin all you need? An extensive empirical study of active learning
  on tabular data
Is margin all you need? An extensive empirical study of active learning on tabular data
Dara Bahri
Heinrich Jiang
Tal Schuster
Afshin Rostamizadeh
LMTD
37
10
0
07 Oct 2022
SIMPLE: A Gradient Estimator for $k$-Subset Sampling
SIMPLE: A Gradient Estimator for kkk-Subset Sampling
Kareem Ahmed
Zhe Zeng
Mathias Niepert
Guy Van den Broeck
BDL
26
24
0
04 Oct 2022
Variational Open-Domain Question Answering
Variational Open-Domain Question Answering
Valentin Liévin
Andreas Geert Motzfeldt
Ida Riis Jensen
Ole Winther
OOD
BDL
26
8
0
23 Sep 2022
Sparse Graph Learning from Spatiotemporal Time Series
Sparse Graph Learning from Spatiotemporal Time Series
Andrea Cini
Daniele Zambon
C. Alippi
CML
AI4TS
35
18
0
26 May 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
24
6
0
11 Apr 2022
Learning Group Importance using the Differentiable Hypergeometric
  Distribution
Learning Group Importance using the Differentiable Hypergeometric Distribution
Thomas M. Sutter
Laura Manduchi
Alain Ryser
Julia E. Vogt
36
7
0
03 Mar 2022
Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement
  Learning
Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement Learning
Tristan S. W. Stevens
Nishith Chennakeshava
F. D. Bruijn
M. Pekar
Ruud J. G. van Sloun
11
4
0
24 Jan 2022
A Review of the Gumbel-max Trick and its Extensions for Discrete
  Stochasticity in Machine Learning
A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning
Iris A. M. Huijben
W. Kool
Max B. Paulus
Ruud J. G. van Sloun
24
92
0
04 Oct 2021
Conditional Poisson Stochastic Beam Search
Conditional Poisson Stochastic Beam Search
Clara Meister
Afra Amini
Tim Vieira
Ryan Cotterell
24
10
0
22 Sep 2021
Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
Yuanyi Zhong
Bodi Yuan
Hong Wu
Zhiqiang Yuan
Jian Peng
Yu-xiong Wang
30
167
0
20 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
37
53
0
10 Aug 2021
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text
  Generation
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation
Xiang Lin
Simeng Han
Shafiq R. Joty
12
24
0
14 Jun 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
8
12
0
08 Jun 2021
Incremental Sampling Without Replacement for Sequence Models
Incremental Sampling Without Replacement for Sequence Models
Kensen Shi
David Bieber
Charles Sutton
VLM
SyDa
BDL
16
24
0
21 Feb 2020
Estimating Gradients for Discrete Random Variables by Sampling without
  Replacement
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
W. Kool
H. V. Hoof
Max Welling
BDL
15
49
0
14 Feb 2020
Adversarial Filters of Dataset Biases
Adversarial Filters of Dataset Biases
Ronan Le Bras
Swabha Swayamdipta
Chandra Bhagavatula
Rowan Zellers
Matthew E. Peters
Ashish Sabharwal
Yejin Choi
29
220
0
10 Feb 2020
Paraphrase Generation with Latent Bag of Words
Paraphrase Generation with Latent Bag of Words
Yao Fu
Yansong Feng
John P. Cunningham
BDL
25
91
0
07 Jan 2020
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
48
185
0
14 Nov 2017
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,740
0
26 Sep 2016
1