Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 5,950 papers shown

Title
CULL-MT: Compression Using Language and Layer pruning for Machine Translation Pedram Rostami M. Dousti 39 0 0 10 Nov 2024
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review Mahtab Faraji Homa Rashidisabet George R. Nahass R. Chan Thasarat S Vajaranant Darvin Yi 34 0 0 07 Nov 2024
Pruning Literals for Highly Efficient Explainability at Word Level Rohan Kumar Yadav Bimal Bhattarai Abhik Jana Lei Jiao Seid Muhie Yimam 32 0 0 07 Nov 2024
LASER: Attention with Exponential Transformation Sai Surya Duvvuri Inderjit Dhillon 43 1 0 05 Nov 2024
Grouped Discrete Representation for Object-Centric Learning Rongzhen Zhao V. Wang Arno Solin Joni Pajarinen BDL OCL 26 1 0 04 Nov 2024
BiT-MamSleep: Bidirectional Temporal Mamba for EEG Sleep Staging Xinliang Zhou Yuzhe Han Zhenpeng Chen Chenyu Liu Yi Ding Ziyu Jia Yang Liu Mamba 39 1 0 03 Nov 2024
Differentiable architecture search with multi-dimensional attention for spiking neural networks Yilei Man Linhai Xie Shushan Qiao Yumei Zhou Delong Shang 45 1 0 01 Nov 2024
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction Guan-Hua Huang Wan-Chen Lai Tai-Been Chen Chien-Chin Hsu Huei-Yung Chen Yi-Chen Wu Li-Ren Yeh MedIm 39 2 0 31 Oct 2024
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction Qidong Yang Weicheng Zhu Joseph Keslin L. Zanna Tim G. J. Rudner Carlos Fernandez-Granda BDL UQCV AI4TS 46 0 0 30 Oct 2024
Emergence of meta-stable clustering in mean-field transformer models Giuseppe Bruno Federico Pasqualotto Andrea Agazzi 45 6 0 30 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Michael T. Matthews Michael Beukman Chris Xiaoxuan Lu Jakob Foerster OffRL AI4CE 36 2 0 30 Oct 2024
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents Ankan Mullick Sombit Bose Abhilash Nandy G. Chaitanya Pawan Goyal 24 0 0 29 Oct 2024
Scalable Message Passing Neural Networks: No Need for Attention in Large Graph Representation Learning Haitz Sáez de Ocáriz Borde Artem Lukoianov Anastasis Kratsios Michael M. Bronstein Xiaowen Dong GNN 43 1 0 29 Oct 2024
Efficient Machine Translation with a BiLSTM-Attention Approach Yuxu Wu Yiren Xing 22 0 0 29 Oct 2024
Atrial Fibrillation Detection System via Acoustic Sensing for Mobile Phones Xuanyu Liu Jiao Li Haoxian Liu Zongqi Yang Yi Huang Jin Zhang 11 0 0 28 Oct 2024
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation Wenbo Zhang Yiming Cui Kaiyan Zhang Yifa Wang Qingfu Zhu Lingzhi Li Ting Liu 63 8 0 28 Oct 2024
Visualizing attention zones in machine reading comprehension models Yiming Cui Wenbo Zhang Ting Liu 18 0 0 28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild Bo Miao Mingtao Feng Zijie Wu Mohammed Bennamoun Yongsheng Gao Ajmal Mian 26 0 0 27 Oct 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization Zhecheng Li Yijiao Wang Bryan Hooi Yujun Cai Naifan Cheung Nanyun Peng Kai-Wei Chang 44 1 0 26 Oct 2024
Provable optimal transport with transformers: The essence of depth and prompt engineering Hadi Daneshmand OT 42 0 0 25 Oct 2024
Explainable News Summarization -- Analysis and mitigation of Disagreement Problem Seema Aswani Sujala D. Shetty 36 0 0 24 Oct 2024
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition Zi-Rui Wang 26 0 0 24 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey Xinyu Wang Wenbo Zhang Sarah Rajtmajer 37 1 0 24 Oct 2024
Melody Construction for Persian lyrics using LSTM recurrent neural networks Farshad Jafari Farzad Didehvar Amin Gheibi 14 0 0 23 Oct 2024
Dynamic graph neural networks for enhanced volatility prediction in financial markets Pulikandala Nithish Kumar Nneka Umeorah Alex Alochukwu 25 0 0 22 Oct 2024
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination Jerry Huang Prasanna Parthasarathi Mehdi Rezagholizadeh Boxing Chen Sarath Chandar 53 0 0 22 Oct 2024
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic Jason Chan Robert Gaizauskas Zhixue Zhao ELM AAML LRM 35 0 0 21 Oct 2024
A Fusion-Driven Approach of Attention-Based CNN-BiLSTM for Protein Family Classification -- ProFamNet Bahar Ali Anwar Shah Malik Niaz Musadaq Mansoord Sami Ullah Muhammad Adnan 3DV 22 0 0 21 Oct 2024
Deep Graph Attention Networks Jun Kato Airi Mita Keita Gobara Akihiro Inokuchi GNN 24 0 0 21 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation Victor Junqiu Wei Weicheng Wang Di Jiang Conghui Tan Rongzhong Lian MoMe 35 0 0 21 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens Zhepeng Cen Yao Liu Siliang Zeng Pratik Chaudhar Huzefa Rangwala George Karypis Rasool Fakoor SyDa AIFin 34 3 0 18 Oct 2024
Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs Rui Pu Chaozhuo Li Rui Ha Zejian Chen Litian Zhang Ziqiang Liu Lirong Qiu Xi Zhang AAML 34 2 0 18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering Nghia Hieu Nguyen Tho Thanh Quan Ngan Luu-Thuy Nguyen 31 0 0 18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning Ilya Kaufman Omri Azencot AI4TS 31 2 0 17 Oct 2024
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good? Idris Abdulmumin B. Galadanci G. Aliyu Shamsuddeen Hassan Muhammad 37 1 0 17 Oct 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games Pranav Rajbhandari Prithviraj Dasgupta D. Sofge 21 0 0 17 Oct 2024
Reducing the Transformer Architecture to a Minimum Bernhard Bermeitinger T. Hrycej Massimo Pavone Julianus Kath Siegfried Handschuh 19 0 0 17 Oct 2024
DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone Hongfan Gao Wangmeng Shen Xiangfei Qiu Ronghui Xu Jilin Hu Bin Yang 30 5 0 17 Oct 2024
Recurrent Neural Goodness-of-Fit Test for Time Series Aoran Zhang Wenbin Zhou Liyan Xie Shixiang Zhu 40 1 0 17 Oct 2024
Artificial Kuramoto Oscillatory Neurons Takeru Miyato Sindy Lowe Andreas Geiger Max Welling AI4CE 77 6 0 17 Oct 2024
Unifying Economic and Language Models for Enhanced Sentiment Analysis of the Oil Market Himmet Kaplan R. Mundani Heiko Rölke A. Weichselbraun Martin Tschudy 14 0 0 16 Oct 2024
How much do contextualized representations encode long-range context? Simeng Sun Cheng-Ping Hsieh 46 0 0 16 Oct 2024
Network Representation Learning for Biophysical Neural Network Analysis Youngmok Ha Yongjoo Kim Hyun Jae Jang Seungyeon Lee Eunji Pak 28 0 0 15 Oct 2024
Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of Operations Seongho Kim Jihyun Moon Juntaek Oh Insu Choi Joon-Sung Yang 23 0 0 15 Oct 2024
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight Pedram Akbarian Huy Le Nguyen Xing Han Nhat Ho MoE 42 0 0 15 Oct 2024
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture Sajad Movahedi Antonio Orvieto Seyed-Mohsen Moosavi-Dezfooli AI4CE AAML 172 0 0 15 Oct 2024
ChakmaNMT: A Low-resource Machine Translation On Chakma Language Aunabil Chakma Aditya Chakma Soham Khisa Chumui Tripura Masum Hasan Rifat Shahriyar 21 0 0 14 Oct 2024
A Framework to Enable Algorithmic Design Choice Exploration in DNNs Timothy L. Cronin IV Sanmukh Kuppannagari 45 0 0 10 Oct 2024
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow Cyrile Delestre Yoann Sola 24 0 0 10 Oct 2024
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context Enrique Noriega-Atala Robert Vacareanu Salena Torres Ashton A. Pyarelal Clayton T. Morrison Mihai Surdeanu 34 0 0 10 Oct 2024