Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.03122
Cited By
v1
v2
v3 (latest)
Convolutional Sequence to Sequence Learning
8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Convolutional Sequence to Sequence Learning"
50 / 1,343 papers shown
Traffic Prediction using Artificial Intelligence: Review of Recent Advances and Emerging Opportunities
Transportation Research Part C: Emerging Technologies (TRC), 2022
Maryam Shaygan
Collin Meese
Wanxin Li
Xiaoliang (George) Zhao
Mark M. Nejad
267
174
0
31 May 2023
Neural Machine Translation with Dynamic Graph Convolutional Decoder
Lei Li
Kai Fan
Ling Yang
Hongjian Li
Chun Yuan
152
5
0
28 May 2023
Randomized Positional Encodings Boost Length Generalization of Transformers
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Anian Ruoss
Grégoire Delétang
Tim Genewein
Jordi Grau-Moya
Róbert Csordás
Mehdi Abbana Bennani
Shane Legg
J. Veness
LLMAG
236
128
0
26 May 2023
Neural Machine Translation for Mathematical Formulae
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Felix Petersen
M. Schubotz
André Greiner-Petter
Bela Gipp
195
10
0
25 May 2023
Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Markus Freitag
Behrooz Ghorbani
Patrick Fernandes
210
57
0
17 May 2023
Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Songming Zhang
Yunlong Liang
Shuaibo Wang
Wenjuan Han
Jian Liu
Jinan Xu
Jinan Xu
317
14
0
14 May 2023
Tomography of Quantum States from Structured Measurements via quantum-aware transformer
IEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2023
Hailan Ma
Zhenhong Sun
Daoyi Dong
Chunlin Chen
H. Rabitz
407
10
0
09 May 2023
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Peng Lu
Ahmad Rashid
I. Kobyzev
Mehdi Rezagholizadeh
Philippe Langlais
147
0
0
08 May 2023
Backdoor Learning on Sequence to Sequence Models
Lichang Chen
Minhao Cheng
Heng-Chiao Huang
SILM
208
19
0
03 May 2023
Technical Report: Impact of Position Bias on Language Models in Token Classification
Mehdi Ben Amor
Michael Granitzer
Jelena Mitrović
367
3
0
26 Apr 2023
Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies
Neural Information Processing Systems (NeurIPS), 2023
Wei Fang
Zhaofei Yu
Zhaokun Zhou
Ding Chen
Yanqing Chen
Zhengyu Ma
T. Masquelier
Yonghong Tian
343
70
0
25 Apr 2023
Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading
Shuwen Deng
D. R. Reich
Paul Prasse
Patrick Haller
Tobias Scheffer
Lena A. Jäger
216
25
0
21 Apr 2023
Reference-guided Controllable Inpainting of Neural Radiance Fields
IEEE International Conference on Computer Vision (ICCV), 2023
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Marcus A. Brubaker
J. Kelly
Alex Levinshtein
Konstantinos G. Derpanis
Igor Gilitschenski
344
43
0
19 Apr 2023
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering
Rishov Paul
Md. Mohib Hossain
Mohammed Latif Siddiq
Masum Hasan
Anindya Iqbal
Joanna C. S. Santos
KELM
228
16
0
16 Apr 2023
A Comprehensive Evaluation of Neural SPARQL Query Generation from Natural Language Questions
IEEE Access (IEEE Access), 2023
Papa Abdou Karim Karou Diallo
Samuel Reyd
Payel Das
235
16
0
16 Apr 2023
TransDocs: Optical Character Recognition with word to word translation
Abhishek Bamotra
P. Uppala
85
3
0
15 Apr 2023
Masked Pre-Training of Transformers for Histology Image Analysis
Journal of Pathology Informatics (J Pathol Inform), 2023
Shuai Jiang
Liesbeth Hondelink
A. Suriawinata
Saeed Hassanpour
MedIm
135
23
0
14 Apr 2023
Best Practices for 2-Body Pose Forecasting
Muhammad Rameez Ur Rahman
Luca Scofano
Edoardo De Matteis
Alessandro Flaborea
Alessio Sampieri
Fabio Galasso
189
13
0
12 Apr 2023
Dynamic Graph Representation Learning with Neural Networks: A Survey
IEEE Access (IEEE Access), 2023
Leshanshui Yang
Sébastien Adam
Clément Chatelain
AI4TS
AI4CE
187
31
0
12 Apr 2023
Multi-Graph Convolution Network for Pose Forecasting
Hongwei Ren
Yuhong Shi
Kewei Liang
3DH
166
1
0
11 Apr 2023
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation
Qi Wu
David Bauer
Yuyang Chen
Kwan-Liu Ma
213
21
0
09 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
167
61
0
08 Apr 2023
Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages
Viet H. Pham
Thang M. Pham
Giang Nguyen
Long H. B. Nguyen
D. Dinh
68
1
0
02 Apr 2023
Analysis and Comparison of Two-Level KFAC Methods for Training Deep Neural Networks
Abdoulaye Koroko
A. Anciaux-Sedrakian
I. B. Gharbia
Valérie Garès
M. Haddou
Quang-Huy Tran
251
0
0
31 Mar 2023
Backdoor Attacks with Input-unique Triggers in NLP
Xukun Zhou
Jiwei Li
Tianwei Zhang
Lingjuan Lyu
Muqiao Yang
Jun He
SILM
AAML
166
11
0
25 Mar 2023
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
251
18
0
24 Mar 2023
Integrating Image Features with Convolutional Sequence-to-sequence Network for Multilingual Visual Question Answering
Journal of Computer Science and Cybernetics (JCSC), 2023
T. M. Thai
Son T. Luu
223
0
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
303
199
0
21 Mar 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
448
70
0
21 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
111
40
0
17 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Shiyang Feng
Jiaming Song
Jianbo Shi
3DPC
224
80
0
14 Mar 2023
Learning Transductions and Alignments with RNN Seq2seq Models
International Conference on Graphics and Interaction (GI), 2023
Zhengxiang Wang
286
0
0
13 Mar 2023
Convex Bounds on the Softmax Function with Applications to Robustness Verification
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Dennis L. Wei
Haoze Wu
Min Wu
Pin-Yu Chen
Clark W. Barrett
E. Farchi
UQCV
AAML
106
12
0
03 Mar 2023
Leveraging Large Text Corpora for End-to-End Speech Summarization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Tomohiro Tanaka
A. Ogawa
Marc Delcroix
Ryo Masumura
151
17
0
02 Mar 2023
Variance-reduced Clipping for Non-convex Optimization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Amirhossein Reisizadeh
Haochuan Li
Subhro Das
Ali Jadbabaie
358
34
0
02 Mar 2023
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data
International Conference on Learning Representations (ICLR), 2023
M. Crawshaw
Yajie Bao
Mingrui Liu
FedML
201
10
0
14 Feb 2023
Enhancing Multivariate Time Series Classifiers through Self-Attention and Relative Positioning Infusion
IEEE Access (IEEE Access), 2023
Mehryar Abbasi
Parvaneh Saeedi
AI4TS
227
9
0
13 Feb 2023
Protecting Language Generation Models via Invisible Watermarking
International Conference on Machine Learning (ICML), 2023
Xuandong Zhao
Yu-Xiang Wang
Lei Li
WaLM
372
109
0
06 Feb 2023
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair
International Conference on Software Engineering (ICSE), 2023
Nan Jiang
Thibaud Lutellier
Xin Peng
Lin Tan
Dan Goldwasser
Xinming Zhang
290
52
0
03 Feb 2023
Learning the Dynamics of Sparsely Observed Interacting Systems
International Conference on Machine Learning (ICML), 2023
Linus Bleistein
Adeline Fermanian
A. Jannot
Agathe Guilloux
360
5
0
27 Jan 2023
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
232
3
0
26 Jan 2023
PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning
Hawaii International Conference on System Sciences (HICSS), 2023
Thorsten Wittkopp
Dominik Scheinert
Philipp Wiesner
Alexander Acker
O. Kao
AI4TS
160
6
0
25 Jan 2023
Variation-Aware Semantic Image Synthesis
Image and Vision Computing (IVC), 2023
Mingle Xu
Jaehwan Lee
Sook Yoon
Hyongsuk Kim
D. Park
207
4
0
25 Jan 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Vilém Zouhar
Shehzaad Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
331
11
0
21 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective Context
International Conference on Database Systems for Advanced Applications (DASFAA), 2023
Jian Yang
Yuwei Yin
Shuming Ma
Liqun Yang
Hongcheng Guo
Haoyang Huang
Dongdong Zhang
Yutao Zeng
Zhoujun Li
Furu Wei
205
7
0
17 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
340
63
0
29 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLM
LRM
289
179
0
20 Dec 2022
EIT: Enhanced Interactive Transformer
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
289
3
0
20 Dec 2022
Graph Learning and Its Advancements on Large Language Models: A Holistic Survey
Shaopeng Wei
Yu Zhao
Xingyan Chen
Qing Li
Fuzhen Zhuang
Ji Liu
Fuji Ren
Gang Kou
AI4CE
422
6
0
17 Dec 2022
Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition
Qin Li
Xu Yang
Yong Wang
Yuankai Wu
Deqiang He
180
20
0
12 Dec 2022
Previous
1
2
3
4
5
...
25
26
27
Next