Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 5,946 papers shown

Title
Channel-Attentive Graph Neural Networks Tuğrul Hasan Karabulut İnci M. Baytaş 47 0 0 01 Mar 2025
Attend or Perish: Benchmarking Attention in Algorithmic Reasoning Michal Spiegel Michal Štefánik Marek Kadlcík Josef Kuchař 37 0 0 28 Feb 2025
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation Yifei Xia Suhan Ling Fangcheng Fu Yijiao Wang Huixia Li Xuefeng Xiao Bin Cui VGen 65 2 0 28 Feb 2025
Learning to Substitute Components for Compositional Generalization Zechao Li Gangwei Jiang Chenwang Wu Ying Wei Defu Lian Enhong Chen 62 0 0 28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation Shaharukh Khan Ayush Tarun Ali Faraz Palash Kamble Vivek Dahiya Praveen Kumar Pokala Ashish Kulkarni Chandra Khatri Abhinav Ravi Shubham Agarwal 169 0 0 27 Feb 2025
Representing Signs as Signs: One-Shot ISLR to Facilitate Functional Sign Language Technologies Toon Vandendriessche Mathieu De Coster Annelies Lejon J. Dambre SLR 94 0 0 27 Feb 2025
Revisiting Kernel Attention with Correlated Gaussian Process Representation Long Minh Bui Tho Tran Huu Duy-Tung Dinh T. Nguyen Trong Nghia Hoang 52 2 0 27 Feb 2025
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces J. Wang Weishan Ye Jialin He Li Zhang G. Huang Zhuliang Yu Zhen Liang 80 0 0 26 Feb 2025
Multiview graph dual-attention deep learning and contrastive learning for multi-criteria recommender systems Saman Forouzandeh P. Krivitsky Rohitash Chandra 60 0 0 26 Feb 2025
A HEART for the environment: Transformer-Based Spatiotemporal Modeling for Air Quality Prediction Norbert Bodendorfer 65 1 0 26 Feb 2025
Application of Attention Mechanism with Bidirectional Long Short-Term Memory (BiLSTM) and CNN for Human Conflict Detection using Computer Vision Erick da Silva Farias Eduardo Palhares Junior 62 0 0 25 Feb 2025
Self-Adjust Softmax Chuanyang Zheng Yihang Gao Guoxuan Chen Han Shi Jing Xiong Xiaozhe Ren Chao Huang Xin Jiang Zhiyu Li Yu Li 50 0 0 25 Feb 2025
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks Remi Genet 102 1 0 25 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences Yangshijie Zhang AAML 42 0 0 24 Feb 2025
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms Feiyang Chen Yu Cheng Lei Wang Yuqing Xia Ziming Miao ... Fan Yang Jinbao Xue Zhi Yang M. Yang H. Chen 81 1 0 24 Feb 2025
GraphFM: Graph Factorization Machines for Feature Interaction Modeling Shu Wu Zekun Li Yunyue Su Zeyu Cui Xiaoyu Zhang Liang Wang 72 22 0 24 Feb 2025
Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models Andrew DiGiugno Ausif Mahmood 38 0 0 24 Feb 2025
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration Xin Zhang Liangxiu Han Stephen White Saad Hassan Philip A Kalra James Ritchie Carl Diver Jennie Shorley 82 1 0 24 Feb 2025
SR-LLM: Rethinking the Structured Representation in Large Language Model Jiahuan Zhang Tianheng Wang Hanqing Wu Ziyi Huang Yulong Wu Dongbai Chen Linfeng Song Yue Zhang Guozheng Rao Kaicheng Yu 50 1 0 21 Feb 2025
A Survey of Model Architectures in Information Retrieval Zhichao Xu Fengran Mo Zhiqi Huang Crystina Zhang Puxuan Yu Bei Wang Jimmy J. Lin Vivek Srikumar KELM 3DV 64 2 0 21 Feb 2025
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images Sheng-Yu Wang Aaron Hertzmann Alexei A. Efros Jun-Yan Zhu Richard Zhang TDI 128 2 0 21 Feb 2025
Connecting the geometry and dynamics of many-body complex systems with message passing neural operators N. Gabriel N. Johnson George Em Karniadakis AI4CE 54 0 0 21 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations Yuan Chen Abdul Khaliq Khaled M. Furati AI4CE 61 0 0 20 Feb 2025
Some Insights of Construction of Feature Graph to Learn Pairwise Feature Interactions with Graph Neural Networks Phaphontee Yamchote Saw Nay Htet Win Chainarong Amornbunchornvej Thanapon Noraset FAtt 79 0 0 20 Feb 2025
Hallucinations are inevitable but statistically negligible Atsushi Suzuki Yulan He Feng Tian Zhongyuan Wang HILM 49 0 0 15 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow Behrooz Azarkhalili Maxwell Libbrecht 39 0 0 14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model Guhao Feng Yihan Geng Jian Guan Wei Wu Liwei Wang Di He DiffM 55 0 0 13 Feb 2025
A Deep Inverse-Mapping Model for a Flapping Robotic Wing Hadar Sharvit Raz Karl Tsevi Beatus 62 0 0 13 Feb 2025
Handwritten Text Recognition: A Survey Carlos Garrido-Munoz Antonio Ríos-Vila Jorge Calvo-Zaragoza 106 0 0 12 Feb 2025
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features Ugochukwu Orji Çiçek Güven Dan Stowell AI4TS 50 0 0 12 Feb 2025
Beyond Literal Token Overlap: Token Alignability for Multilinguality Katharina Hämmerl Tomasz Limisiewicz Jindrich Libovický Alexander Fraser 51 0 0 10 Feb 2025
A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions Elisa Negrini Yuxuan Liu Liu Yang Stanley Osher Hayden Schaeffer AI4CE 93 0 0 09 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution Alhossien Waly Bassant Tarek Ali Feteha Rewan Yehia Gasser Amr Walid Gomaa Ahmed M. Fares 66 0 0 07 Feb 2025
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers Adam Stooke Rohit Prabhavalkar K. Sim P. M. Mengibar 39 0 0 06 Feb 2025
A comparison of translation performance between DeepL and Supertext Alex Flückiger Chantal Amrhein Tim Graf Frédéric Odermatt Martin Pömsl Philippe Schläpfer Florian Schottmann Samuel Laubli ELM 45 0 0 04 Feb 2025
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation George Whittle Juliusz Ziomek Jacob Rawling Michael A. Osborne 97 2 0 04 Feb 2025
Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study Anneketh Vij Changhao Liu Rahul Anil Nair Theo Ho Edward Shi Ayan Bhowmick 53 1 0 04 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities Rebecca Mobbs Dimitrios Makris Vasileios Argyriou 43 0 0 02 Feb 2025
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures Gabriel Lindenmaier Sean Papay Sebastian Padó 65 0 0 02 Feb 2025
PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration Songhao Wu Ang Lv Xiao Feng Wenjie Qu Xun Zhang Guojun Yin Wei Lin Rui Yan MQ 57 0 0 01 Feb 2025
Efficient and Interpretable Neural Networks Using Complex Lehmer Transform M. Ataei Xiaogang Wang 36 0 0 28 Jan 2025
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention Qiuhao Zeng Jerry Huang Peng Lu Gezheng Xu Boxing Chen Charles Ling Boyu Wang 54 1 0 24 Jan 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference Duc Hau Nguyen Duc Hau Nguyen Pascale Sébillot 52 5 0 23 Jan 2025
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token Wei Zou Shujian Huang Jiajun Chen AAML 73 0 0 21 Jan 2025
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation Andreea Iana Fabian David Schmidt Goran Glavas Heiko Paulheim 71 3 0 20 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection Pengcheng Zhao Zhixian He Fuwei Zhang Shujin Lin Fan Zhou 42 1 0 18 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering Anupam Pandey Deepjyoti Bodo Arpan Phukan Asif Ekbal 41 0 0 13 Jan 2025
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph Wenhan Jiang Tingting Chai Hongri Liu Kai Wang Hongke Zhang 41 0 0 13 Jan 2025
Iconicity in Large Language Models Anna Marklová Jiří Milička Leonid Ryvkin Ľudmila Lacková Bennet Libuše Kormaníková 40 0 0 10 Jan 2025
On Creating A Brain-To-Text Decoder Zenon Lamprou Yashar Moshfeghi 38 0 0 10 Jan 2025