Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.3215
Cited By
Sequence to Sequence Learning with Neural Networks
10 September 2014
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence to Sequence Learning with Neural Networks"
50 / 4,872 papers shown
Title
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management
Wonbeom Lee
Jungi Lee
Junghwan Seo
Jaewoong Sim
RALM
34
75
0
28 Jun 2024
DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems
Kexiong Yu
Hang Zhao
Yuhang Huang
Renjiao Yi
Kai Xu
Chenyang Zhu
51
0
0
28 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CML
LRM
47
4
0
27 Jun 2024
CharED: Character-wise Ensemble Decoding for Large Language Models
Kevin Gu
Eva Tuecke
Dmitriy Katz
R. Horesh
David Alvarez-Melis
Mikhail Yurochkin
33
2
0
25 Jun 2024
Towards LLM-Powered Ambient Sensor Based Multi-Person Human Activity Recognition
Xi Chen
Julien Cumin
F. Ramparany
Dominique Vaufreydaz
34
1
0
25 Jun 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
33
51
0
24 Jun 2024
Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Yuxuan Wan
Chaozheng Wang
Yi Dong
Wenxuan Wang
Shuqing Li
Yintong Huo
M. Lyu
3DV
76
10
0
24 Jun 2024
F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data
Zexing Xu
Linjun Zhang
Sitan Yang
Rasoul Etesami
Hanghang Tong
Huan Zhang
Jiawei Han
AI4TS
36
3
0
23 Jun 2024
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization
Kshitij Bhatta
Geigh Zollicoffer
Manish Bhattarai
Phil Romero
C. Negre
Anders M. N. Niklasson
A. Adedoyin
19
0
0
23 Jun 2024
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification
Honori Udo
Takafumi Koshinaka
VLM
43
0
0
22 Jun 2024
Depth
F
1
F_1
F
1
: Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability
Parker Seegmiller
Joseph Gatto
S. Preum
VLM
37
0
0
20 Jun 2024
A Unified Framework for Combinatorial Optimization Based on Graph Neural Networks
Yaochu Jin
Xueming Yan
Shiqing Liu
Xiangyu Wang
51
3
0
19 Jun 2024
SAGDFN: A Scalable Adaptive Graph Diffusion Forecasting Network for Multivariate Time Series Forecasting
Yue Jiang
Xiucheng Li
Yile Chen
Shuai Liu
Weilong Kong
Antonis F. Lentzakis
Gao Cong
AI4TS
AI4CE
31
1
0
18 Jun 2024
GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting
Fan Zhou
Chen Pan
Lintao Ma
Yu Liu
James Zhang
...
Weitao Lin
Zi Zhuang
Wenxin Ning
Yunhua Hu
Siqiao Xue
AI4TS
32
1
0
18 Jun 2024
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models
Haopeng Zhang
Philip S. Yu
Jiawei Zhang
37
17
0
17 Jun 2024
A Survey on Human Preference Learning for Large Language Models
Ruili Jiang
Kehai Chen
Xuefeng Bai
Zhixuan He
Juntao Li
Muyun Yang
Tiejun Zhao
Liqiang Nie
Min Zhang
49
8
0
17 Jun 2024
Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation
Boxuan Lyu
Hidetaka Kamigaito
Kotaro Funakoshi
Manabu Okumura
40
0
0
17 Jun 2024
SPEAR: Receiver-to-Receiver Acoustic Neural Warping Field
Yuhang He
Shitong Xu
Jia-Xing Zhong
Sangyun Shin
Niki Trigoni
Andrew Markham
38
0
0
16 Jun 2024
Reinforced Decoder: Towards Training Recurrent Neural Networks for Time Series Forecasting
Qi Sima
Xinze Zhang
Yukun Bao
Siyue Yang
Liang Shen
AI4TS
53
1
0
14 Jun 2024
Investigating the translation capabilities of Large Language Models trained on parallel data only
Javier García Gilabert
Carlos Escolano
Aleix Sant Savall
Francesca de Luca Fornaciari
Audrey Mash
Xixian Liao
Maite Melero
LRM
42
2
0
13 Jun 2024
Inverse Probability of Treatment Weighting with Deep Sequence Models Enables Accurate treatment effect Estimation from Electronic Health Records
Junghwan Lee
Simin Ma
N. Serban
Shihao Yang
CML
21
0
0
13 Jun 2024
Transformer-based Model for ASR N-Best Rescoring and Rewriting
Iwen E. Kang
Christophe Van Gysel
Man-Hung Siu
39
2
0
12 Jun 2024
Semi-Supervised Spoken Language Glossification
Huijie Yao
Wengang Zhou
Hao Zhou
Houqiang Li
34
0
0
12 Jun 2024
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems
Qianli Wang
Tatiana Anikina
Nils Feldhus
Simon Ostermann
Sebastian Möller
24
2
0
12 Jun 2024
Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey
Hao Yang
Yanyan Zhao
Yang Wu
Shilong Wang
Tian Zheng
Hongbo Zhang
Zongyang Ma
Wanxiang Che
Bing Qin
47
9
0
12 Jun 2024
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Trang Le
Daniel Lazar
Suyoun Kim
Shan Jiang
Duc Le
Adithya Sagar
Aleksandr Livshits
Ahmed Aly
Akshat Shrivastava
48
0
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
77
53
0
12 Jun 2024
Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models
Shoutao Guo
Shaolei Zhang
Zhengrui Ma
Min Zhang
Yang Feng
LLMAG
33
1
0
11 Jun 2024
What Can We Learn from State Space Models for Machine Learning on Graphs?
Yinan Huang
Siqi Miao
Pan Li
47
7
0
09 Jun 2024
Recent advancements in computational morphology : A comprehensive survey
Jatayu Baxi
Brijesh S. Bhatt
AI4CE
43
1
0
08 Jun 2024
Optimizing Time Series Forecasting Architectures: A Hierarchical Neural Architecture Search Approach
Difan Deng
Marius Lindauer
AI4TS
53
0
0
07 Jun 2024
Semantically Diverse Language Generation for Uncertainty Estimation in Language Models
L. Aichberger
Kajetan Schweighofer
Mykyta Ielanskyi
Sepp Hochreiter
HILM
30
10
0
06 Jun 2024
Proactive Detection of Physical Inter-rule Vulnerabilities in IoT Services Using a Deep Learning Approach
Bing Huang
Chen Chen
K. Lam
Fuqun Huang
AAML
22
1
0
06 Jun 2024
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
Bo Xia
Yilun Kong
Yongzhe Chang
Bo Yuan
Zhiheng Li
Xueqian Wang
Bin Liang
OffRL
50
3
0
05 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
31
0
0
05 Jun 2024
Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
Firas Trabelsi
David Vilar
Mara Finkelstein
Markus Freitag
37
6
0
05 Jun 2024
Short-term Inland Vessel Trajectory Prediction with Encoder-Decoder Models
Kathrin Donandt
Karim Böttger
Dirk Söffker
24
8
0
04 Jun 2024
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Ruslan Svirschevski
Avner May
Zhuoming Chen
Beidi Chen
Zhihao Jia
Max Ryabinin
39
12
0
04 Jun 2024
UniOQA: A Unified Framework for Knowledge Graph Question Answering with Large Language Models
Zhuoyang Li
Liran Deng
Hui Liu
Qiaoqiao Liu
Junzhao Du
RALM
40
4
0
04 Jun 2024
A Global Geometric Analysis of Maximal Coding Rate Reduction
Peng Wang
Huikang Liu
Druv Pai
Yaodong Yu
Zhihui Zhu
Q. Qu
Yi Ma
39
6
0
04 Jun 2024
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Saierdaer Yusuyin
Te Ma
Hao Huang
Wenbo Zhao
Zhijian Ou
52
2
0
04 Jun 2024
Sequence-to-Sequence Multi-Modal Speech In-Painting
Mahsa Kadkhodaei Elyaderani
S. Shirani
14
1
0
03 Jun 2024
Robust Multi-Modal Speech In-Painting: A Sequence-to-Sequence Approach
Mahsa Kadkhodaei Elyaderani
Shahram Shirani
43
0
0
02 Jun 2024
YODAS: Youtube-Oriented Dataset for Audio and Speech
Xinjian Li
Shinnosuke Takamichi
Takaaki Saeki
William Chen
Sayaka Shiota
Shinji Watanabe
42
17
0
02 Jun 2024
Formality Style Transfer in Persian
P. Falakaflaki
M. Shamsfard
13
1
0
02 Jun 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedIm
ViT
43
7
0
02 Jun 2024
Amalgam: A Framework for Obfuscated Neural Network Training on the Cloud
Sifat Ut Taki
Spyridon Mastorakis
FedML
34
1
0
02 Jun 2024
Recurrent neural networks: vanishing and exploding gradients are not the end of the story
Nicolas Zucchet
Antonio Orvieto
ODL
AAML
45
9
0
31 May 2024
WaveCastNet: An AI-enabled Wavefield Forecasting Framework for Earthquake Early Warning
Dongwei Lyu
R. Nakata
Pu Ren
Michael W. Mahoney
A. Pitarka
Nori Nakata
N. Benjamin Erichson
41
2
0
30 May 2024
Deep Learning Approaches for Detecting Adversarial Cyberbullying and Hate Speech in Social Networks
S. Azumah
Nelly Elsayed
Zag ElSayed
Murat Ozer
Amanda La Guardia
51
1
0
30 May 2024
Previous
1
2
3
...
5
6
7
...
96
97
98
Next