ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.02960
  4. Cited By
Sequence-to-Sequence Learning as Beam-Search Optimization

Sequence-to-Sequence Learning as Beam-Search Optimization

9 June 2016
Sam Wiseman
Alexander M. Rush
ArXivPDFHTML

Papers citing "Sequence-to-Sequence Learning as Beam-Search Optimization"

50 / 276 papers shown
Title
Parallel Scaling Law for Language Models
Parallel Scaling Law for Language Models
Mouxiang Chen
Binyuan Hui
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
Jianling Sun
Junyang Lin
Zhongxin Liu
MoE
LRM
37
0
0
15 May 2025
Latent Beam Diffusion Models for Decoding Image Sequences
Latent Beam Diffusion Models for Decoding Image Sequences
Guilherme Fernandes
Vasco Ramos
Regev Cohen
Idan Szpektor
João Magalhães
78
0
0
26 Mar 2025
LREF: A Novel LLM-based Relevance Framework for E-commerce
Tian Tang
Zhixing Tian
Zhenyu Zhu
Chenyang Wang
Haiqing Hu
Guoyu Tang
Lin Liu
Sulong Xu
60
0
0
12 Mar 2025
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
Xianliang Li
Jun Luo
Zhiwei Zheng
Hanxiao Wang
Li Luo
Lingkun Wen
Linlong Wu
Sheng Xu
72
0
0
29 Nov 2024
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos
Tomás Almeida
Daniel Vareta
Filipe Azevedo
Sweta Agrawal
Patrick Fernandes
André F. T. Martins
31
1
0
08 Nov 2024
FastSurvival: Hidden Computational Blessings in Training Cox
  Proportional Hazards Models
FastSurvival: Hidden Computational Blessings in Training Cox Proportional Hazards Models
Jiachang Liu
Rui Zhang
Cynthia Rudin
29
0
0
24 Oct 2024
Self-Explained Keywords Empower Large Language Models for Code
  Generation
Self-Explained Keywords Empower Large Language Models for Code Generation
Lishui Fan
Mouxiang Chen
Zhongxin Liu
40
1
0
21 Oct 2024
Positive Text Reframing under Multi-strategy Optimization
Positive Text Reframing under Multi-strategy Optimization
Shutong Jia
Biwei Cao
Qingqing Gao
Jiuxin Cao
Bo Liu
23
1
0
25 Jul 2024
Improving Autoregressive Training with Dynamic Oracles
Improving Autoregressive Training with Dynamic Oracles
Jianing Yang
Harshine Visvanathan
Yilin Wang
Xinyi Hu
Matthew R. Gormley
54
0
0
13 Jun 2024
RITUAL: Random Image Transformations as a Universal Anti-hallucination
  Lever in LVLMs
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in LVLMs
Sangmin Woo
Jaehyuk Jang
Donguk Kim
Yubin Choi
Changick Kim
42
1
0
28 May 2024
The Road Less Scheduled
The Road Less Scheduled
Aaron Defazio
Xingyu Yang
Yang
Harsh Mehta
Konstantin Mishchenko
Ahmed Khaled
Ashok Cutkosky
33
45
0
24 May 2024
Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models
Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models
Khawir Mahmood
Jehandad Khan
Hammad Afzal
21
0
0
15 Apr 2024
Investigating the Performance of Language Models for Completing Code in
  Functional Programming Languages: a Haskell Case Study
Investigating the Performance of Language Models for Completing Code in Functional Programming Languages: a Haskell Case Study
Tim van Dam
Frank van der Heijden
Philippe de Bekker
Berend Nieuwschepen
Marc Otten
M. Izadi
41
5
0
22 Mar 2024
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Mouxiang Chen
Hao Tian
Zhongxi Liu
Xiaoxue Ren
Jianling Sun
SyDa
KELM
40
2
0
15 Jan 2024
Locally Optimal Descent for Dynamic Stepsize Scheduling
Locally Optimal Descent for Dynamic Stepsize Scheduling
Gilad Yehudai
Alon Cohen
Amit Daniely
Yoel Drori
Tomer Koren
Mariano Schain
34
0
0
23 Nov 2023
Aligning Neural Machine Translation Models: Human Feedback in Training
  and Inference
Aligning Neural Machine Translation Models: Human Feedback in Training and Inference
Miguel Moura Ramos
Patrick Fernandes
António Farinhas
André F. T. Martins
ALM
22
15
0
15 Nov 2023
Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment
Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment
Geyang Guo
Ranchi Zhao
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
ALM
34
27
0
07 Nov 2023
KeyGen2Vec: Learning Document Embedding via Multi-label Keyword
  Generation in Question-Answering
KeyGen2Vec: Learning Document Embedding via Multi-label Keyword Generation in Question-Answering
Iftitahu Ni'mah
Samaneh Khoshrou
Vlado Menkovski
Mykola Pechenizkiy
17
0
0
30 Oct 2023
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering
Md. Rafi Ur Rashid
Vishnu Asutosh Dasu
Kang Gu
Najrin Sultana
Shagufta Mehnaz
AAML
FedML
46
10
0
24 Oct 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li
Yiran Liu
Xingxing Zhang
Wei Lu
Furu Wei
ALM
32
3
0
20 Oct 2023
Formally Specifying the High-Level Behavior of LLM-Based Agents
Formally Specifying the High-Level Behavior of LLM-Based Agents
M. Crouse
Ibrahim Abdelaziz
Ramón Fernández Astudillo
Kinjal Basu
Soham Dan
Sadhana Kumaravel
Achille Fokoue
Pavan Kapanipathi
Salim Roukos
Luis A. Lastras
LLMAG
20
8
0
12 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
90
12
0
12 Oct 2023
Bilevel Scheduled Sampling for Dialogue Generation
Bilevel Scheduled Sampling for Dialogue Generation
Jiawen Liu
Kan Li
25
0
0
05 Sep 2023
Chunk, Align, Select: A Simple Long-sequence Processing Method for
  Transformers
Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers
Jiawen Xie
Pengyu Cheng
Xiao Liang
Yong Dai
Nan Du
40
7
0
25 Aug 2023
O-1: Self-training with Oracle and 1-best Hypothesis
O-1: Self-training with Oracle and 1-best Hypothesis
M. Baskar
Andrew Rosenberg
Bhuvana Ramabhadran
Kartik Audhkhasi
VLM
22
0
0
14 Aug 2023
Learning to Generate Better Than Your LLM
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
19
41
0
20 Jun 2023
On Learning to Summarize with Large Language Models as References
On Learning to Summarize with Large Language Models as References
Yixin Liu
Kejian Shi
Katherine S He
Longtian Ye
Alexander R. Fabbri
Pengfei Liu
Dragomir R. Radev
Arman Cohan
ELM
28
71
0
23 May 2023
Exploring Energy-based Language Models with Different Architectures and
  Training Methods for Speech Recognition
Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
Hong Liu
Z. Lv
Zhijian Ou
Wenbo Zhao
Qing Xiao
22
0
0
22 May 2023
ACRoBat: Optimizing Auto-batching of Dynamic Deep Learning at Compile
  Time
ACRoBat: Optimizing Auto-batching of Dynamic Deep Learning at Compile Time
Pratik Fegade
Tianqi Chen
Phillip B. Gibbons
T. Mowry
35
2
0
17 May 2023
HPE:Answering Complex Questions over Text by Hybrid Question Parsing and
  Execution
HPE:Answering Complex Questions over Text by Hybrid Question Parsing and Execution
Ye Liu
Semih Yavuz
Rui Meng
Dragomir R. Radev
Caiming Xiong
Yingbo Zhou
32
8
0
12 May 2023
ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A
  Million-scale Knowledge Base
ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base
Siyu Yuan
Jiangjie Chen
Changzhi Sun
Jiaqing Liang
Yanghua Xiao
Deqing Yang
39
16
0
10 May 2023
Can Diffusion Model Achieve Better Performance in Text Generation?
  Bridging the Gap between Training and Inference!
Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference!
Zecheng Tang
Pinzheng Wang
Keyan Zhou
Juntao Li
Ziqiang Cao
M. Zhang
DiffM
31
11
0
08 May 2023
OKRidge: Scalable Optimal k-Sparse Ridge Regression
OKRidge: Scalable Optimal k-Sparse Ridge Regression
Jiachang Liu
Sam Rosen
Chudi Zhong
Cynthia Rudin
22
4
0
13 Apr 2023
Better Language Models of Code through Self-Improvement
Better Language Models of Code through Self-Improvement
H. To
Nghi D. Q. Bui
Jingnan Guo
T. Nguyen
SyDa
39
15
0
02 Apr 2023
Transformer-based Planning for Symbolic Regression
Transformer-based Planning for Symbolic Regression
Parshin Shojaee
Kazem Meidani
A. Farimani
Chandan K. Reddy
47
33
0
13 Mar 2023
NL2CMD: An Updated Workflow for Natural Language to Bash Commands
  Translation
NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation
Quchen Fu
Zhongwei Teng
Marco Georgaklis
Jules White
C. Schmidt
25
6
0
15 Feb 2023
Unleashing the True Potential of Sequence-to-Sequence Models for
  Sequence Tagging and Structure Parsing
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing
Han He
Jinho D. Choi
36
4
0
05 Feb 2023
Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective
Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective
Michael E. Sander
J. Puigcerver
Josip Djolonga
Gabriel Peyré
Mathieu Blondel
21
18
0
02 Feb 2023
Learning-Rate-Free Learning by D-Adaptation
Learning-Rate-Free Learning by D-Adaptation
Aaron Defazio
Konstantin Mishchenko
24
77
0
18 Jan 2023
Active Learning for Neural Machine Translation
Active Learning for Neural Machine Translation
Neeraj Vashistha
Kritika Singh
Ramakant Shakya
13
0
0
30 Dec 2022
Claim Optimization in Computational Argumentation
Claim Optimization in Computational Argumentation
Gabriella Skitalinskaya
Maximilian Spliethover
Henning Wachsmuth
30
7
0
17 Dec 2022
Momentum Calibration for Text Generation
Momentum Calibration for Text Generation
Xingxing Zhang
Yiran Liu
Xun Wang
Pengcheng He
Yang Yu
Si-Qing Chen
Wayne Xiong
Furu Wei
66
8
0
08 Dec 2022
Contrastive Decoding: Open-ended Text Generation as Optimization
Contrastive Decoding: Open-ended Text Generation as Optimization
Xiang Lisa Li
Ari Holtzman
Daniel Fried
Percy Liang
Jason Eisner
Tatsunori Hashimoto
Luke Zettlemoyer
M. Lewis
33
323
0
27 Oct 2022
SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored
  GEC-Oriented Parser
SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser
Yue Zhang
Bo Zhang
Zhenghua Li
Rasna Goyal
Chen Li
Min Zhang
27
42
0
22 Oct 2022
Shift-Reduce Task-Oriented Semantic Parsing with Stack-Transformers
Shift-Reduce Task-Oriented Semantic Parsing with Stack-Transformers
Daniel Fernández-González
31
0
0
21 Oct 2022
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
Zilu Tang
Muhammed Yusuf Kocyigit
Derry Wijaya
35
8
0
20 Oct 2022
Entity-to-Text based Data Augmentation for various Named Entity
  Recognition Tasks
Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks
Xuming Hu
Yong-jia Jiang
Aiwei Liu
Zhongqiang Huang
Pengjun Xie
Fei Huang
Lijie Wen
Philip S. Yu
36
13
0
19 Oct 2022
Teacher Forcing Recovers Reward Functions for Text Generation
Teacher Forcing Recovers Reward Functions for Text Generation
Yongchang Hao
Yuxin Liu
Lili Mou
OffRL
32
11
0
17 Oct 2022
Hierarchical Few-Shot Object Detection: Problem, Benchmark and Method
Hierarchical Few-Shot Object Detection: Problem, Benchmark and Method
Lu Zhang
Yang Wang
Jiaogen Zhou
Chenbo Zhang
Yinglu Zhang
Jihong Guan
Yatao Bian
Shuigeng Zhou
29
9
0
08 Oct 2022
Medical Image Captioning via Generative Pretrained Transformers
Medical Image Captioning via Generative Pretrained Transformers
Alexander Selivanov
Oleg Y. Rogov
Daniil Chesakov
Artem Shelmanov
Irina Fedulova
Dmitry V. Dylov
MedIm
54
55
0
28 Sep 2022
123456
Next