ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 5,950 papers shown
Title
CULL-MT: Compression Using Language and Layer pruning for Machine
  Translation
CULL-MT: Compression Using Language and Layer pruning for Machine Translation
Pedram Rostami
M. Dousti
39
0
0
10 Nov 2024
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma:
  A Systematic Review
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review
Mahtab Faraji
Homa Rashidisabet
George R. Nahass
R. Chan
Thasarat S Vajaranant
Darvin Yi
34
0
0
07 Nov 2024
Pruning Literals for Highly Efficient Explainability at Word Level
Pruning Literals for Highly Efficient Explainability at Word Level
Rohan Kumar Yadav
Bimal Bhattarai
Abhik Jana
Lei Jiao
Seid Muhie Yimam
32
0
0
07 Nov 2024
LASER: Attention with Exponential Transformation
LASER: Attention with Exponential Transformation
Sai Surya Duvvuri
Inderjit Dhillon
43
1
0
05 Nov 2024
Grouped Discrete Representation for Object-Centric Learning
Grouped Discrete Representation for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
BDL
OCL
26
1
0
04 Nov 2024
BiT-MamSleep: Bidirectional Temporal Mamba for EEG Sleep Staging
BiT-MamSleep: Bidirectional Temporal Mamba for EEG Sleep Staging
Xinliang Zhou
Yuzhe Han
Zhenpeng Chen
Chenyu Liu
Yi Ding
Ziyu Jia
Yang Liu
Mamba
39
1
0
03 Nov 2024
Differentiable architecture search with multi-dimensional attention for
  spiking neural networks
Differentiable architecture search with multi-dimensional attention for spiking neural networks
Yilei Man
Linhai Xie
Shushan Qiao
Yumei Zhou
Delong Shang
45
1
0
01 Nov 2024
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction
Guan-Hua Huang
Wan-Chen Lai
Tai-Been Chen
Chien-Chin Hsu
Huei-Yung Chen
Yi-Chen Wu
Li-Ren Yeh
MedIm
39
2
0
31 Oct 2024
A Monte Carlo Framework for Calibrated Uncertainty Estimation in
  Sequence Prediction
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction
Qidong Yang
Weicheng Zhu
Joseph Keslin
L. Zanna
Tim G. J. Rudner
Carlos Fernandez-Granda
BDL
UQCV
AI4TS
46
0
0
30 Oct 2024
Emergence of meta-stable clustering in mean-field transformer models
Emergence of meta-stable clustering in mean-field transformer models
Giuseppe Bruno
Federico Pasqualotto
Andrea Agazzi
45
6
0
30 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
36
2
0
30 Oct 2024
A Pointer Network-based Approach for Joint Extraction and Detection of
  Multi-Label Multi-Class Intents
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Ankan Mullick
Sombit Bose
Abhilash Nandy
G. Chaitanya
Pawan Goyal
24
0
0
29 Oct 2024
Scalable Message Passing Neural Networks: No Need for Attention in Large
  Graph Representation Learning
Scalable Message Passing Neural Networks: No Need for Attention in Large Graph Representation Learning
Haitz Sáez de Ocáriz Borde
Artem Lukoianov
Anastasis Kratsios
Michael M. Bronstein
Xiaowen Dong
GNN
43
1
0
29 Oct 2024
Efficient Machine Translation with a BiLSTM-Attention Approach
Efficient Machine Translation with a BiLSTM-Attention Approach
Yuxu Wu
Yiren Xing
22
0
0
29 Oct 2024
Atrial Fibrillation Detection System via Acoustic Sensing for Mobile
  Phones
Atrial Fibrillation Detection System via Acoustic Sensing for Mobile Phones
Xuanyu Liu
Jiao Li
Haoxian Liu
Zongqi Yang
Yi Huang
Jin Zhang
11
0
0
28 Oct 2024
A Static and Dynamic Attention Framework for Multi Turn Dialogue
  Generation
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation
Wenbo Zhang
Yiming Cui
Kaiyan Zhang
Yifa Wang
Qingfu Zhu
Lingzhi Li
Ting Liu
63
8
0
28 Oct 2024
Visualizing attention zones in machine reading comprehension models
Visualizing attention zones in machine reading comprehension models
Yiming Cui
Wenbo Zhang
Ting Liu
18
0
0
28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild
Referring Human Pose and Mask Estimation in the Wild
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Ajmal Mian
26
0
0
27 Oct 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Naifan Cheung
Nanyun Peng
Kai-Wei Chang
44
1
0
26 Oct 2024
Provable optimal transport with transformers: The essence of depth and
  prompt engineering
Provable optimal transport with transformers: The essence of depth and prompt engineering
Hadi Daneshmand
OT
42
0
0
25 Oct 2024
Explainable News Summarization -- Analysis and mitigation of
  Disagreement Problem
Explainable News Summarization -- Analysis and mitigation of Disagreement Problem
Seema Aswani
Sujala D. Shetty
36
0
0
24 Oct 2024
Integrating Canonical Neural Units and Multi-Scale Training for
  Handwritten Text Recognition
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition
Zi-Rui Wang
26
0
0
24 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Xinyu Wang
Wenbo Zhang
Sarah Rajtmajer
37
1
0
24 Oct 2024
Melody Construction for Persian lyrics using LSTM recurrent neural
  networks
Melody Construction for Persian lyrics using LSTM recurrent neural networks
Farshad Jafari
Farzad Didehvar
Amin Gheibi
14
0
0
23 Oct 2024
Dynamic graph neural networks for enhanced volatility prediction in
  financial markets
Dynamic graph neural networks for enhanced volatility prediction in financial markets
Pulikandala Nithish Kumar
Nneka Umeorah
Alex Alochukwu
25
0
0
22 Oct 2024
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Boxing Chen
Sarath Chandar
53
0
0
22 Oct 2024
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models'
  Reasoning with Formal Logic
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic
Jason Chan
Robert Gaizauskas
Zhixue Zhao
ELM
AAML
LRM
35
0
0
21 Oct 2024
A Fusion-Driven Approach of Attention-Based CNN-BiLSTM for Protein
  Family Classification -- ProFamNet
A Fusion-Driven Approach of Attention-Based CNN-BiLSTM for Protein Family Classification -- ProFamNet
Bahar Ali
Anwar Shah
Malik Niaz
Musadaq Mansoord
Sami Ullah
Muhammad Adnan
3DV
22
0
0
21 Oct 2024
Deep Graph Attention Networks
Deep Graph Attention Networks
Jun Kato
Airi Mita
Keita Gobara
Akihiro Inokuchi
GNN
24
0
0
21 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and
  Valuation
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Di Jiang
Conghui Tan
Rongzhong Lian
MoMe
35
0
0
21 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
34
3
0
18 Oct 2024
Feint and Attack: Attention-Based Strategies for Jailbreaking and
  Protecting LLMs
Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Rui Pu
Chaozhuo Li
Rui Ha
Zejian Chen
Litian Zhang
Ziqiang Liu
Lirong Qiu
Xi Zhang
AAML
34
2
0
18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using
  Transformer-based Method in Vietnamese Text-based Visual Question Answering
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
Nghia Hieu Nguyen
Tho Thanh Quan
Ngan Luu-Thuy Nguyen
31
0
0
18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via
  Manifold Learning
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning
Ilya Kaufman
Omri Azencot
AI4TS
31
2
0
17 Oct 2024
Quantity vs. Quality of Monolingual Source Data in Automatic Text
  Translation: Can It Be Too Little If It Is Too Good?
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good?
Idris Abdulmumin
B. Galadanci
G. Aliyu
Shamsuddeen Hassan Muhammad
37
1
0
17 Oct 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Pranav Rajbhandari
Prithviraj Dasgupta
D. Sofge
21
0
0
17 Oct 2024
Reducing the Transformer Architecture to a Minimum
Reducing the Transformer Architecture to a Minimum
Bernhard Bermeitinger
T. Hrycej
Massimo Pavone
Julianus Kath
Siegfried Handschuh
19
0
0
17 Oct 2024
DiffImp: Efficient Diffusion Model for Probabilistic Time Series
  Imputation with Bidirectional Mamba Backbone
DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone
Hongfan Gao
Wangmeng Shen
Xiangfei Qiu
Ronghui Xu
Jilin Hu
Bin Yang
30
5
0
17 Oct 2024
Recurrent Neural Goodness-of-Fit Test for Time Series
Recurrent Neural Goodness-of-Fit Test for Time Series
Aoran Zhang
Wenbin Zhou
Liyan Xie
Shixiang Zhu
40
1
0
17 Oct 2024
Artificial Kuramoto Oscillatory Neurons
Artificial Kuramoto Oscillatory Neurons
Takeru Miyato
Sindy Lowe
Andreas Geiger
Max Welling
AI4CE
77
6
0
17 Oct 2024
Unifying Economic and Language Models for Enhanced Sentiment Analysis of
  the Oil Market
Unifying Economic and Language Models for Enhanced Sentiment Analysis of the Oil Market
Himmet Kaplan
R. Mundani
Heiko Rölke
A. Weichselbraun
Martin Tschudy
14
0
0
16 Oct 2024
How much do contextualized representations encode long-range context?
How much do contextualized representations encode long-range context?
Simeng Sun
Cheng-Ping Hsieh
46
0
0
16 Oct 2024
Network Representation Learning for Biophysical Neural Network Analysis
Network Representation Learning for Biophysical Neural Network Analysis
Youngmok Ha
Yongjoo Kim
Hyun Jae Jang
Seungyeon Lee
Eunji Pak
28
0
0
15 Oct 2024
Survey and Evaluation of Converging Architecture in LLMs based on
  Footsteps of Operations
Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of Operations
Seongho Kim
Jihyun Moon
Juntaek Oh
Insu Choi
Joon-Sung Yang
23
0
0
15 Oct 2024
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Pedram Akbarian
Huy Le Nguyen
Xing Han
Nhat Ho
MoE
42
0
0
15 Oct 2024
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
Sajad Movahedi
Antonio Orvieto
Seyed-Mohsen Moosavi-Dezfooli
AI4CE
AAML
172
0
0
15 Oct 2024
ChakmaNMT: A Low-resource Machine Translation On Chakma Language
ChakmaNMT: A Low-resource Machine Translation On Chakma Language
Aunabil Chakma
Aditya Chakma
Soham Khisa
Chumui Tripura
Masum Hasan
Rifat Shahriyar
21
0
0
14 Oct 2024
A Framework to Enable Algorithmic Design Choice Exploration in DNNs
A Framework to Enable Algorithmic Design Choice Exploration in DNNs
Timothy L. Cronin IV
Sanmukh Kuppannagari
45
0
0
10 Oct 2024
Self-Attention Mechanism in Multimodal Context for Banking Transaction
  Flow
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
Cyrile Delestre
Yoann Sola
24
0
0
10 Oct 2024
When and Where Did it Happen? An Encoder-Decoder Model to Identify
  Scenario Context
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context
Enrique Noriega-Atala
Robert Vacareanu
Salena Torres Ashton
A. Pyarelal
Clayton T. Morrison
Mihai Surdeanu
34
0
0
10 Oct 2024
Previous
123456...117118119
Next