ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.03122
  4. Cited By
Convolutional Sequence to Sequence Learning
v1v2v3 (latest)

Convolutional Sequence to Sequence Learning

8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Convolutional Sequence to Sequence Learning"

50 / 1,343 papers shown
Traffic Prediction using Artificial Intelligence: Review of Recent
  Advances and Emerging Opportunities
Traffic Prediction using Artificial Intelligence: Review of Recent Advances and Emerging OpportunitiesTransportation Research Part C: Emerging Technologies (TRC), 2022
Maryam Shaygan
Collin Meese
Wanxin Li
Xiaoliang (George) Zhao
Mark M. Nejad
267
174
0
31 May 2023
Neural Machine Translation with Dynamic Graph Convolutional Decoder
Neural Machine Translation with Dynamic Graph Convolutional Decoder
Lei Li
Kai Fan
Ling Yang
Hongjian Li
Chun Yuan
152
5
0
28 May 2023
Randomized Positional Encodings Boost Length Generalization of
  Transformers
Randomized Positional Encodings Boost Length Generalization of TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Anian Ruoss
Grégoire Delétang
Tim Genewein
Jordi Grau-Moya
Róbert Csordás
Mehdi Abbana Bennani
Shane Legg
J. Veness
LLMAG
236
128
0
26 May 2023
Neural Machine Translation for Mathematical Formulae
Neural Machine Translation for Mathematical FormulaeAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Felix Petersen
M. Schubotz
André Greiner-Petter
Bela Gipp
195
10
0
25 May 2023
Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum
  Bayes Risk Decoding for Machine Translation
Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Markus Freitag
Behrooz Ghorbani
Patrick Fernandes
210
57
0
17 May 2023
Towards Understanding and Improving Knowledge Distillation for Neural
  Machine Translation
Towards Understanding and Improving Knowledge Distillation for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Songming Zhang
Yunlong Liang
Shuaibo Wang
Wenjuan Han
Jian Liu
Jinan Xu
Jinan Xu
317
14
0
14 May 2023
Tomography of Quantum States from Structured Measurements via quantum-aware transformer
Tomography of Quantum States from Structured Measurements via quantum-aware transformerIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2023
Hailan Ma
Zhenhong Sun
Daoyi Dong
Chunlin Chen
H. Rabitz
407
10
0
09 May 2023
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
LABO: Towards Learning Optimal Label Regularization via Bi-level OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Peng Lu
Ahmad Rashid
I. Kobyzev
Mehdi Rezagholizadeh
Philippe Langlais
147
0
0
08 May 2023
Backdoor Learning on Sequence to Sequence Models
Backdoor Learning on Sequence to Sequence Models
Lichang Chen
Minhao Cheng
Heng-Chiao Huang
SILM
208
19
0
03 May 2023
Technical Report: Impact of Position Bias on Language Models in Token
  Classification
Technical Report: Impact of Position Bias on Language Models in Token Classification
Mehdi Ben Amor
Michael Granitzer
Jelena Mitrović
367
3
0
26 Apr 2023
Parallel Spiking Neurons with High Efficiency and Ability to Learn
  Long-term Dependencies
Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term DependenciesNeural Information Processing Systems (NeurIPS), 2023
Wei Fang
Zhaofei Yu
Zhaokun Zhou
Ding Chen
Yanqing Chen
Zhengyu Ma
T. Masquelier
Yonghong Tian
343
70
0
25 Apr 2023
Eyettention: An Attention-based Dual-Sequence Model for Predicting Human
  Scanpaths during Reading
Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading
Shuwen Deng
D. R. Reich
Paul Prasse
Patrick Haller
Tobias Scheffer
Lena A. Jäger
216
25
0
21 Apr 2023
Reference-guided Controllable Inpainting of Neural Radiance Fields
Reference-guided Controllable Inpainting of Neural Radiance FieldsIEEE International Conference on Computer Vision (ICCV), 2023
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Marcus A. Brubaker
J. Kelly
Alex Levinshtein
Konstantinos G. Derpanis
Igor Gilitschenski
344
43
0
19 Apr 2023
Enhancing Automated Program Repair through Fine-tuning and Prompt
  Engineering
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering
Rishov Paul
Md. Mohib Hossain
Mohammed Latif Siddiq
Masum Hasan
Anindya Iqbal
Joanna C. S. Santos
KELM
228
16
0
16 Apr 2023
A Comprehensive Evaluation of Neural SPARQL Query Generation from
  Natural Language Questions
A Comprehensive Evaluation of Neural SPARQL Query Generation from Natural Language QuestionsIEEE Access (IEEE Access), 2023
Papa Abdou Karim Karou Diallo
Samuel Reyd
Payel Das
235
16
0
16 Apr 2023
TransDocs: Optical Character Recognition with word to word translation
TransDocs: Optical Character Recognition with word to word translation
Abhishek Bamotra
P. Uppala
85
3
0
15 Apr 2023
Masked Pre-Training of Transformers for Histology Image Analysis
Masked Pre-Training of Transformers for Histology Image AnalysisJournal of Pathology Informatics (J Pathol Inform), 2023
Shuai Jiang
Liesbeth Hondelink
A. Suriawinata
Saeed Hassanpour
MedIm
135
23
0
14 Apr 2023
Best Practices for 2-Body Pose Forecasting
Best Practices for 2-Body Pose Forecasting
Muhammad Rameez Ur Rahman
Luca Scofano
Edoardo De Matteis
Alessandro Flaborea
Alessio Sampieri
Fabio Galasso
189
13
0
12 Apr 2023
Dynamic Graph Representation Learning with Neural Networks: A Survey
Dynamic Graph Representation Learning with Neural Networks: A SurveyIEEE Access (IEEE Access), 2023
Leshanshui Yang
Sébastien Adam
Clément Chatelain
AI4TSAI4CE
187
31
0
12 Apr 2023
Multi-Graph Convolution Network for Pose Forecasting
Multi-Graph Convolution Network for Pose Forecasting
Hongwei Ren
Yuhong Shi
Kewei Liang
3DH
166
1
0
11 Apr 2023
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural
  Representations via Knowledge Distillation
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation
Qi Wu
David Bauer
Yuyang Chen
Kwan-Liu Ma
213
21
0
09 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a
  Regularized Encoder-Decoder
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
167
61
0
08 Apr 2023
Semi-supervised Neural Machine Translation with Consistency
  Regularization for Low-Resource Languages
Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages
Viet H. Pham
Thang M. Pham
Giang Nguyen
Long H. B. Nguyen
D. Dinh
68
1
0
02 Apr 2023
Analysis and Comparison of Two-Level KFAC Methods for Training Deep
  Neural Networks
Analysis and Comparison of Two-Level KFAC Methods for Training Deep Neural Networks
Abdoulaye Koroko
A. Anciaux-Sedrakian
I. B. Gharbia
Valérie Garès
M. Haddou
Quang-Huy Tran
251
0
0
31 Mar 2023
Backdoor Attacks with Input-unique Triggers in NLP
Backdoor Attacks with Input-unique Triggers in NLP
Xukun Zhou
Jiwei Li
Tianwei Zhang
Lingjuan Lyu
Muqiao Yang
Jun He
SILMAAML
166
11
0
25 Mar 2023
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep
  Learning Clusters
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
251
18
0
24 Mar 2023
Integrating Image Features with Convolutional Sequence-to-sequence
  Network for Multilingual Visual Question Answering
Integrating Image Features with Convolutional Sequence-to-sequence Network for Multilingual Visual Question AnsweringJournal of Computer Science and Cybernetics (JCSC), 2023
T. M. Thai
Son T. Luu
223
0
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to
  GPT-5 All You Need?
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
303
199
0
21 Mar 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
448
70
0
21 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method
  using cross attention and latent transformer
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
111
40
0
17 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for
  3D Point Cloud Analysis
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Shiyang Feng
Jiaming Song
Jianbo Shi
3DPC
224
80
0
14 Mar 2023
Learning Transductions and Alignments with RNN Seq2seq Models
Learning Transductions and Alignments with RNN Seq2seq ModelsInternational Conference on Graphics and Interaction (GI), 2023
Zhengxiang Wang
286
0
0
13 Mar 2023
Convex Bounds on the Softmax Function with Applications to Robustness
  Verification
Convex Bounds on the Softmax Function with Applications to Robustness VerificationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Dennis L. Wei
Haoze Wu
Min Wu
Pin-Yu Chen
Clark W. Barrett
E. Farchi
UQCVAAML
106
12
0
03 Mar 2023
Leveraging Large Text Corpora for End-to-End Speech Summarization
Leveraging Large Text Corpora for End-to-End Speech SummarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Tomohiro Tanaka
A. Ogawa
Marc Delcroix
Ryo Masumura
151
17
0
02 Mar 2023
Variance-reduced Clipping for Non-convex Optimization
Variance-reduced Clipping for Non-convex OptimizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Amirhossein Reisizadeh
Haochuan Li
Subhro Das
Ali Jadbabaie
358
34
0
02 Mar 2023
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections
  for Federated Learning with Heterogeneous Data
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous DataInternational Conference on Learning Representations (ICLR), 2023
M. Crawshaw
Yajie Bao
Mingrui Liu
FedML
201
10
0
14 Feb 2023
Enhancing Multivariate Time Series Classifiers through Self-Attention
  and Relative Positioning Infusion
Enhancing Multivariate Time Series Classifiers through Self-Attention and Relative Positioning InfusionIEEE Access (IEEE Access), 2023
Mehryar Abbasi
Parvaneh Saeedi
AI4TS
227
9
0
13 Feb 2023
Protecting Language Generation Models via Invisible Watermarking
Protecting Language Generation Models via Invisible WatermarkingInternational Conference on Machine Learning (ICML), 2023
Xuandong Zhao
Yu-Xiang Wang
Lei Li
WaLM
372
109
0
06 Feb 2023
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program
  Repair
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program RepairInternational Conference on Software Engineering (ICSE), 2023
Nan Jiang
Thibaud Lutellier
Xin Peng
Lin Tan
Dan Goldwasser
Xinming Zhang
290
52
0
03 Feb 2023
Learning the Dynamics of Sparsely Observed Interacting Systems
Learning the Dynamics of Sparsely Observed Interacting SystemsInternational Conference on Machine Learning (ICML), 2023
Linus Bleistein
Adeline Fermanian
A. Jannot
Agathe Guilloux
360
5
0
27 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
232
3
0
26 Jan 2023
PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning
PULL: Reactive Log Anomaly Detection Based On Iterative PU LearningHawaii International Conference on System Sciences (HICSS), 2023
Thorsten Wittkopp
Dominik Scheinert
Philipp Wiesner
Alexander Acker
O. Kao
AI4TS
160
6
0
25 Jan 2023
Variation-Aware Semantic Image Synthesis
Variation-Aware Semantic Image SynthesisImage and Vision Computing (IVC), 2023
Mingle Xu
Jaehwan Lee
Sook Yoon
Hyongsuk Kim
D. Park
207
4
0
25 Jan 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics
  Without the Reference
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the ReferenceConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Vilém Zouhar
Shehzaad Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
331
11
0
21 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective Context
HanoiT: Enhancing Context-aware Translation via Selective ContextInternational Conference on Database Systems for Advanced Applications (DASFAA), 2023
Jian Yang
Yuwei Yin
Shuming Ma
Liqun Yang
Hongcheng Guo
Haoyang Huang
Dongdong Zhang
Yutao Zeng
Zhoujun Li
Furu Wei
205
7
0
17 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development TrajectoryIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
340
63
0
29 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
A Survey of Deep Learning for Mathematical ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLMLRM
289
179
0
20 Dec 2022
EIT: Enhanced Interactive Transformer
EIT: Enhanced Interactive TransformerAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
289
3
0
20 Dec 2022
Graph Learning and Its Advancements on Large Language Models: A Holistic
  Survey
Graph Learning and Its Advancements on Large Language Models: A Holistic Survey
Shaopeng Wei
Yu Zhao
Xingyan Chen
Qing Li
Fuzhen Zhuang
Ji Liu
Fuji Ren
Gang Kou
AI4CE
422
6
0
17 Dec 2022
Spatial-temporal traffic modeling with a fusion graph reconstructed by
  tensor decomposition
Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition
Qin Li
Xu Yang
Yong Wang
Yuankai Wu
Deqiang He
180
20
0
12 Dec 2022
Previous
12345...252627
Next