ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 5,946 papers shown
Title
Channel-Attentive Graph Neural Networks
Tuğrul Hasan Karabulut
İnci M. Baytaş
47
0
0
01 Mar 2025
Attend or Perish: Benchmarking Attention in Algorithmic Reasoning
Michal Spiegel
Michal Štefánik
Marek Kadlcík
Josef Kuchař
37
0
0
28 Feb 2025
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Yifei Xia
Suhan Ling
Fangcheng Fu
Yijiao Wang
Huixia Li
Xuefeng Xiao
Bin Cui
VGen
65
2
0
28 Feb 2025
Learning to Substitute Components for Compositional Generalization
Learning to Substitute Components for Compositional Generalization
Zechao Li
Gangwei Jiang
Chenwang Wu
Ying Wei
Defu Lian
Enhong Chen
62
0
0
28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
169
0
0
27 Feb 2025
Representing Signs as Signs: One-Shot ISLR to Facilitate Functional Sign Language Technologies
Representing Signs as Signs: One-Shot ISLR to Facilitate Functional Sign Language Technologies
Toon Vandendriessche
Mathieu De Coster
Annelies Lejon
J. Dambre
SLR
94
0
0
27 Feb 2025
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Long Minh Bui
Tho Tran Huu
Duy-Tung Dinh
T. Nguyen
Trong Nghia Hoang
52
2
0
27 Feb 2025
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces
J. Wang
Weishan Ye
Jialin He
Li Zhang
G. Huang
Zhuliang Yu
Zhen Liang
80
0
0
26 Feb 2025
Multiview graph dual-attention deep learning and contrastive learning for multi-criteria recommender systems
Multiview graph dual-attention deep learning and contrastive learning for multi-criteria recommender systems
Saman Forouzandeh
P. Krivitsky
Rohitash Chandra
60
0
0
26 Feb 2025
A HEART for the environment: Transformer-Based Spatiotemporal Modeling for Air Quality Prediction
A HEART for the environment: Transformer-Based Spatiotemporal Modeling for Air Quality Prediction
Norbert Bodendorfer
65
1
0
26 Feb 2025
Application of Attention Mechanism with Bidirectional Long Short-Term Memory (BiLSTM) and CNN for Human Conflict Detection using Computer Vision
Application of Attention Mechanism with Bidirectional Long Short-Term Memory (BiLSTM) and CNN for Human Conflict Detection using Computer Vision
Erick da Silva Farias
Eduardo Palhares Junior
62
0
0
25 Feb 2025
Self-Adjust Softmax
Self-Adjust Softmax
Chuanyang Zheng
Yihang Gao
Guoxuan Chen
Han Shi
Jing Xiong
Xiaozhe Ren
Chao Huang
Xin Jiang
Zhiyu Li
Yu Li
50
0
0
25 Feb 2025
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks
Remi Genet
102
1
0
25 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Yangshijie Zhang
AAML
42
0
0
24 Feb 2025
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms
Feiyang Chen
Yu Cheng
Lei Wang
Yuqing Xia
Ziming Miao
...
Fan Yang
Jinbao Xue
Zhi Yang
M. Yang
H. Chen
81
1
0
24 Feb 2025
GraphFM: Graph Factorization Machines for Feature Interaction Modeling
GraphFM: Graph Factorization Machines for Feature Interaction Modeling
Shu Wu
Zekun Li
Yunyue Su
Zeyu Cui
Xiaoyu Zhang
Liang Wang
72
22
0
24 Feb 2025
Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models
Andrew DiGiugno
Ausif Mahmood
38
0
0
24 Feb 2025
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration
Xin Zhang
Liangxiu Han
Stephen White
Saad Hassan
Philip A Kalra
James Ritchie
Carl Diver
Jennie Shorley
82
1
0
24 Feb 2025
SR-LLM: Rethinking the Structured Representation in Large Language Model
SR-LLM: Rethinking the Structured Representation in Large Language Model
Jiahuan Zhang
Tianheng Wang
Hanqing Wu
Ziyi Huang
Yulong Wu
Dongbai Chen
Linfeng Song
Yue Zhang
Guozheng Rao
Kaicheng Yu
50
1
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM
3DV
64
2
0
21 Feb 2025
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Sheng-Yu Wang
Aaron Hertzmann
Alexei A. Efros
Jun-Yan Zhu
Richard Zhang
TDI
128
2
0
21 Feb 2025
Connecting the geometry and dynamics of many-body complex systems with message passing neural operators
N. Gabriel
N. Johnson
George Em Karniadakis
AI4CE
54
0
0
21 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Yuan Chen
Abdul Khaliq
Khaled M. Furati
AI4CE
61
0
0
20 Feb 2025
Some Insights of Construction of Feature Graph to Learn Pairwise Feature Interactions with Graph Neural Networks
Some Insights of Construction of Feature Graph to Learn Pairwise Feature Interactions with Graph Neural Networks
Phaphontee Yamchote
Saw Nay Htet Win
Chainarong Amornbunchornvej
Thanapon Noraset
FAtt
79
0
0
20 Feb 2025
Hallucinations are inevitable but statistically negligible
Hallucinations are inevitable but statistically negligible
Atsushi Suzuki
Yulan He
Feng Tian
Zhongyuan Wang
HILM
49
0
0
15 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Behrooz Azarkhalili
Maxwell Libbrecht
39
0
0
14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng
Yihan Geng
Jian Guan
Wei Wu
Liwei Wang
Di He
DiffM
55
0
0
13 Feb 2025
A Deep Inverse-Mapping Model for a Flapping Robotic Wing
A Deep Inverse-Mapping Model for a Flapping Robotic Wing
Hadar Sharvit
Raz Karl
Tsevi Beatus
62
0
0
13 Feb 2025
Handwritten Text Recognition: A Survey
Handwritten Text Recognition: A Survey
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
106
0
0
12 Feb 2025
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features
Ugochukwu Orji
Çiçek Güven
Dan Stowell
AI4TS
50
0
0
12 Feb 2025
Beyond Literal Token Overlap: Token Alignability for Multilinguality
Beyond Literal Token Overlap: Token Alignability for Multilinguality
Katharina Hämmerl
Tomasz Limisiewicz
Jindrich Libovický
Alexander Fraser
51
0
0
10 Feb 2025
A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions
Elisa Negrini
Yuxuan Liu
Liu Yang
Stanley Osher
Hayden Schaeffer
AI4CE
93
0
0
09 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
66
0
0
07 Feb 2025
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Adam Stooke
Rohit Prabhavalkar
K. Sim
P. M. Mengibar
39
0
0
06 Feb 2025
A comparison of translation performance between DeepL and Supertext
A comparison of translation performance between DeepL and Supertext
Alex Flückiger
Chantal Amrhein
Tim Graf
Frédéric Odermatt
Martin Pömsl
Philippe Schläpfer
Florian Schottmann
Samuel Laubli
ELM
45
0
0
04 Feb 2025
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
George Whittle
Juliusz Ziomek
Jacob Rawling
Michael A. Osborne
97
2
0
04 Feb 2025
Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
Anneketh Vij
Changhao Liu
Rahul Anil Nair
Theo Ho
Edward Shi
Ayan Bhowmick
53
1
0
04 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
43
0
0
02 Feb 2025
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures
Gabriel Lindenmaier
Sean Papay
Sebastian Padó
65
0
0
02 Feb 2025
PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration
PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration
Songhao Wu
Ang Lv
Xiao Feng
Wenjie Qu
Xun Zhang
Guojun Yin
Wei Lin
Rui Yan
MQ
57
0
0
01 Feb 2025
Efficient and Interpretable Neural Networks Using Complex Lehmer Transform
M. Ataei
Xiaogang Wang
36
0
0
28 Jan 2025
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention
Qiuhao Zeng
Jerry Huang
Peng Lu
Gezheng Xu
Boxing Chen
Charles Ling
Boyu Wang
54
1
0
24 Jan 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
Duc Hau Nguyen
Duc Hau Nguyen
Pascale Sébillot
52
5
0
23 Jan 2025
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
Wei Zou
Shujian Huang
Jiajun Chen
AAML
73
0
0
21 Jan 2025
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation
Andreea Iana
Fabian David Schmidt
Goran Glavas
Heiko Paulheim
71
3
0
20 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
42
1
0
18 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
41
0
0
13 Jan 2025
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph
Wenhan Jiang
Tingting Chai
Hongri Liu
Kai Wang
Hongke Zhang
41
0
0
13 Jan 2025
Iconicity in Large Language Models
Iconicity in Large Language Models
Anna Marklová
Jiří Milička
Leonid Ryvkin
Ľudmila Lacková Bennet
Libuše Kormaníková
40
0
0
10 Jan 2025
On Creating A Brain-To-Text Decoder
On Creating A Brain-To-Text Decoder
Zenon Lamprou
Yashar Moshfeghi
38
0
0
10 Jan 2025
Previous
123456...117118119
Next