Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1409.3215
Cited By
v1
v2
v3 (latest)
Sequence to Sequence Learning with Neural Networks
Neural Information Processing Systems (NeurIPS), 2014
10 September 2014
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sequence to Sequence Learning with Neural Networks"
50 / 6,867 papers shown
Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation
Baban Gain
Dibyanayan Bandyopadhyay
Asif Ekbal
Trilok Nath Singh
LM&MA
386
8
0
02 Apr 2025
Conditional Temporal Neural Processes with Covariance Loss
International Conference on Machine Learning (ICML), 2025
Boseon Yoo
Jiwoo Lee
Janghoon Ju
Seijun Chung
Soyeon Kim
Jaesik Choi
283
18
0
01 Apr 2025
VNJPTranslate: A comprehensive pipeline for Vietnamese-Japanese translation
Hoang Hai Phan
Nguyen Duc Minh Vu
Nam Dang Phuong
217
0
0
01 Apr 2025
Artificial Intelligence and Deep Learning Algorithms for Epigenetic Sequence Analysis: A Review for Epigeneticists and AI Experts
Muhammad Tahir
Mahboobeh Norouzi
Shehroz S. Khan
James Davie
Soichiro Yamanaka
A. Ashraf
325
15
0
01 Apr 2025
A Theory of Machine Understanding via the Minimum Description Length Principle
Canlin Zhang
Xiuwen Liu
357
0
0
01 Apr 2025
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs
Computer Vision and Pattern Recognition (CVPR), 2025
Lucas Ventura
Antoine Yang
Cordelia Schmid
Gül Varol
270
4
0
31 Mar 2025
FRASE: Structured Representations for Generalizable SPARQL Query Generation
Papa Abdou Karim Karou Diallo
Payel Das
170
0
0
28 Mar 2025
Domain Specific Question to SQL Conversion with Embedded Data Balancing Technique
Jyothi
T. Satyanarayana Murthy
238
0
0
28 Mar 2025
Probabilistic Functional Neural Networks
Haixu Wang
Jiguo Cao
AI4TS
179
0
0
27 Mar 2025
Towards Generalizable Forgery Detection and Reasoning
Y. Gao
Dongliang Chang
Bingyao Yu
Haotian Qin
Muxi Diao
Lei Chen
Kongming Liang
Zhanyu Ma
360
2
0
27 Mar 2025
Autoregressive Language Models for Knowledge Base Population: A case study in the space mission domain
Andrés García-Silva
José Manuél Gómez-Pérez
202
0
0
24 Mar 2025
Temporal Encoding Strategies for Energy Time Series Prediction
Aayam Bansal
Keertan Balaji
Zeus Lalani
AI4TS
205
8
0
19 Mar 2025
Toward Large-Scale Distributed Quantum Long Short-Term Memory with Modular Quantum Computers
International Conference on Wireless Communications and Mobile Computing (IWCMC), 2025
Kuan-Cheng Chen
Samuel Yen-Chi Chen
Chen-Yu Liu
Kin K Leung
261
12
0
18 Mar 2025
Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency
Jiangxuan Long
Zhao Song
Chiwun Yang
AI4TS
939
2
0
18 Mar 2025
AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
Findings (Findings), 2025
Wuwei Huang
Dexin Wang
Deyi Xiong
340
4
0
18 Mar 2025
DDPM-Polycube: A Denoising Diffusion Probabilistic Model for Polycube-Based Hexahedral Mesh Generation and Volumetric Spline Construction
Yuxuan Yu
Yuzhuo Fang
Hua Tong
J. Liu
Y. Zhang
215
1
0
16 Mar 2025
Bridging Language Models and Financial Analysis
Alejandro Lopez-Lira
Jihoon Kwon
Sangwoon Yoon
Jy-yong Sohn
Chanyeol Choi
AIFin
291
4
0
14 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
364
5
0
14 Mar 2025
Deep Learning for Time Series Forecasting: A Survey
International Journal of Machine Learning and Cybernetics (IJMLC), 2025
X. Kong
Zhenghao Chen
Weiyao Liu
Kaili Ning
Lechao Zhang
Syauqie Muhammad Marier
Yichen Liu
Yuhao Chen
Xiwei Xu
AI4TS
AI4CE
337
34
0
13 Mar 2025
Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs
Chongjun Tu
Peng Ye
Dongzhan Zhou
Wenlong Zhang
Gang Yu
Tao Chen
Wanli Ouyang
288
7
0
13 Mar 2025
Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding
Shunqi Mao
Chaoyi Zhang
Weidong Cai
MLLM
1.1K
4
0
13 Mar 2025
Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning
Wenyi Lian
Patrick Micke
Patrick Micke
Natasa Sladoje
326
2
0
12 Mar 2025
A Deep-Learning Iterative Stacked Approach for Prediction of Reactive Dissolution in Porous Media
Marcos Cirne
Hannah Menke
A. Abdellatif
Julien Maes
Florian Doster
A. Elsheikh
AI4CE
193
0
0
11 Mar 2025
MinGRU-Based Encoder for Turbo Autoencoder Frameworks
Rick Fritschek
Rafael F. Schaefer
297
0
0
11 Mar 2025
LATMOS: Latent Automaton Task Model from Observation Sequences
Weixiao Zhan
Qiyue Dong
Eduardo Sebastián
Nikolay Atanasov
356
1
0
11 Mar 2025
Stick to Facts: Towards Fidelity-oriented Product Description Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Zhangming Chan
Preslav Nakov
Yongliang Wang
Jia-Nan Li
Qing Cui
Kun Gai
Dongyan Zhao
Rui Yan
324
25
0
11 Mar 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
1.1K
9
0
09 Mar 2025
Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation
ACM Transactions on Internet Technology (TOIT), 2025
Christian Rondanini
B. Carminati
E. Ferrari
Antonio Gaudiano
Ashish Kundu
247
6
0
06 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Computer Vision and Pattern Recognition (CVPR), 2025
Rui Zhao
Weijia Mao
Mike Zheng Shou
305
4
0
05 Mar 2025
Deep Causal Behavioral Policy Learning: Applications to Healthcare
Jonas Knecht
Anna Zink
Jonathan Kolstad
Maya Petersen
CML
265
0
0
05 Mar 2025
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Zhengyang Ji
Shang Gao
Li Liu
Yifan Jia
Yutao Yue
218
1
0
04 Mar 2025
Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
Chahine-Nicolas Zede
Laurent Carrafa
Valérie Gouet-Brunet
3DPC
443
0
0
28 Feb 2025
Learning to Substitute Components for Compositional Generalization
Hao Sun
Gangwei Jiang
Chenwang Wu
Ying Wei
Defu Lian
Tong Xu
316
0
0
28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Conference on Machine Translation (WMT), 2025
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
890
8
0
27 Feb 2025
Chemical knowledge-informed framework for privacy-aware retrosynthesis learning
Nature Communications (Nat Commun), 2025
Guikun Chen
Xu Zhang
Yue Yang
Yong Liu
Yi Yang
Wenguan Wang
293
0
0
26 Feb 2025
Introduction to Sequence Modeling with Transformers
Joni-Kristian Kämäräinen
177
2
0
26 Feb 2025
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
Yuxuan Zhang
CLL
ALM
480
1
0
25 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Yangshijie Zhang
AAML
163
5
0
24 Feb 2025
Forecasting Local Ionospheric Parameters Using Transformers
D. J. Alford-Lago
C. Curtis
Alexander T. Ihler
Katherine A. Zawdie
Douglas P. Drob
202
0
0
24 Feb 2025
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Mst. Fahmida Sultana Naznin
Adnan Ibney Faruq
Mostafa Rifat Tazwar
Md Jobayer
Md. Mehedi Hasan Shawon
Md Rakibul Hasan
MedIm
244
0
0
21 Feb 2025
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models
International Journal of Computer Vision (IJCV), 2025
Peirong Zhang
Jiaxin Zhang
Jiahuan Cao
Hongliang Li
Lianwen Jin
140
3
0
21 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Yuan Chen
Abdul Khaliq
Khaled M. Furati
AI4CE
308
2
0
20 Feb 2025
Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Caihua Liu
Xu Li
Wenjing Xue
Wei Tang
Xia Feng
203
0
0
20 Feb 2025
Language Models Can Predict Their Own Behavior
Dhananjay Ashok
Jonathan May
AI4TS
ReLM
LRM
426
5
0
18 Feb 2025
Investigating Inference-time Scaling for Chain of Multi-modal Thought: A Preliminary Study
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yujie Lin
Ante Wang
Moye Chen
Jingyao Liu
Hao Liu
Jinsong Su
Xinyan Xiao
LRM
338
11
0
17 Feb 2025
A Robust Attack: Displacement Backdoor Attack
Yong Li
Han Gao
AAML
246
0
0
14 Feb 2025
Spatiotemporal Graph Neural Networks in short term load forecasting: Does adding Graph Structure in Consumption Data Improve Predictions?
Quoc Viet Nguyen
Joaquín Delgado Fernández
Sergio Potenciano Menci
AI4TS
256
1
0
14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng
Yihan Geng
Jian Guan
Wei Wu
Liwei Wang
Di He
DiffM
374
1
0
13 Feb 2025
Enhancing LLM Character-Level Manipulation via Divide and Conquer
Zhen Xiong
Yujun Cai
Bryan Hooi
Nanyun Peng
Kai-Wei Chang
Zhecheng Li
386
0
0
12 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Zhihui Guo
Zhiming Liu
Qianli Ren
Yuexian Zou
437
9
0
10 Feb 2025
Previous
1
2
3
4
5
6
...
136
137
138
Next