Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0473
Cited By
Neural Machine Translation by Jointly Learning to Align and Translate
1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Machine Translation by Jointly Learning to Align and Translate"
50 / 5,950 papers shown
Title
CULL-MT: Compression Using Language and Layer pruning for Machine Translation
Pedram Rostami
M. Dousti
39
0
0
10 Nov 2024
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review
Mahtab Faraji
Homa Rashidisabet
George R. Nahass
R. Chan
Thasarat S Vajaranant
Darvin Yi
34
0
0
07 Nov 2024
Pruning Literals for Highly Efficient Explainability at Word Level
Rohan Kumar Yadav
Bimal Bhattarai
Abhik Jana
Lei Jiao
Seid Muhie Yimam
32
0
0
07 Nov 2024
LASER: Attention with Exponential Transformation
Sai Surya Duvvuri
Inderjit Dhillon
43
1
0
05 Nov 2024
Grouped Discrete Representation for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
BDL
OCL
26
1
0
04 Nov 2024
BiT-MamSleep: Bidirectional Temporal Mamba for EEG Sleep Staging
Xinliang Zhou
Yuzhe Han
Zhenpeng Chen
Chenyu Liu
Yi Ding
Ziyu Jia
Yang Liu
Mamba
39
1
0
03 Nov 2024
Differentiable architecture search with multi-dimensional attention for spiking neural networks
Yilei Man
Linhai Xie
Shushan Qiao
Yumei Zhou
Delong Shang
45
1
0
01 Nov 2024
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction
Guan-Hua Huang
Wan-Chen Lai
Tai-Been Chen
Chien-Chin Hsu
Huei-Yung Chen
Yi-Chen Wu
Li-Ren Yeh
MedIm
39
2
0
31 Oct 2024
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction
Qidong Yang
Weicheng Zhu
Joseph Keslin
L. Zanna
Tim G. J. Rudner
Carlos Fernandez-Granda
BDL
UQCV
AI4TS
46
0
0
30 Oct 2024
Emergence of meta-stable clustering in mean-field transformer models
Giuseppe Bruno
Federico Pasqualotto
Andrea Agazzi
45
6
0
30 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
36
2
0
30 Oct 2024
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Ankan Mullick
Sombit Bose
Abhilash Nandy
G. Chaitanya
Pawan Goyal
24
0
0
29 Oct 2024
Scalable Message Passing Neural Networks: No Need for Attention in Large Graph Representation Learning
Haitz Sáez de Ocáriz Borde
Artem Lukoianov
Anastasis Kratsios
Michael M. Bronstein
Xiaowen Dong
GNN
43
1
0
29 Oct 2024
Efficient Machine Translation with a BiLSTM-Attention Approach
Yuxu Wu
Yiren Xing
22
0
0
29 Oct 2024
Atrial Fibrillation Detection System via Acoustic Sensing for Mobile Phones
Xuanyu Liu
Jiao Li
Haoxian Liu
Zongqi Yang
Yi Huang
Jin Zhang
11
0
0
28 Oct 2024
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation
Wenbo Zhang
Yiming Cui
Kaiyan Zhang
Yifa Wang
Qingfu Zhu
Lingzhi Li
Ting Liu
63
8
0
28 Oct 2024
Visualizing attention zones in machine reading comprehension models
Yiming Cui
Wenbo Zhang
Ting Liu
18
0
0
28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Ajmal Mian
26
0
0
27 Oct 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Naifan Cheung
Nanyun Peng
Kai-Wei Chang
44
1
0
26 Oct 2024
Provable optimal transport with transformers: The essence of depth and prompt engineering
Hadi Daneshmand
OT
42
0
0
25 Oct 2024
Explainable News Summarization -- Analysis and mitigation of Disagreement Problem
Seema Aswani
Sujala D. Shetty
36
0
0
24 Oct 2024
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition
Zi-Rui Wang
26
0
0
24 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Xinyu Wang
Wenbo Zhang
Sarah Rajtmajer
37
1
0
24 Oct 2024
Melody Construction for Persian lyrics using LSTM recurrent neural networks
Farshad Jafari
Farzad Didehvar
Amin Gheibi
14
0
0
23 Oct 2024
Dynamic graph neural networks for enhanced volatility prediction in financial markets
Pulikandala Nithish Kumar
Nneka Umeorah
Alex Alochukwu
25
0
0
22 Oct 2024
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Boxing Chen
Sarath Chandar
53
0
0
22 Oct 2024
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic
Jason Chan
Robert Gaizauskas
Zhixue Zhao
ELM
AAML
LRM
35
0
0
21 Oct 2024
A Fusion-Driven Approach of Attention-Based CNN-BiLSTM for Protein Family Classification -- ProFamNet
Bahar Ali
Anwar Shah
Malik Niaz
Musadaq Mansoord
Sami Ullah
Muhammad Adnan
3DV
22
0
0
21 Oct 2024
Deep Graph Attention Networks
Jun Kato
Airi Mita
Keita Gobara
Akihiro Inokuchi
GNN
24
0
0
21 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Di Jiang
Conghui Tan
Rongzhong Lian
MoMe
35
0
0
21 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
34
3
0
18 Oct 2024
Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Rui Pu
Chaozhuo Li
Rui Ha
Zejian Chen
Litian Zhang
Ziqiang Liu
Lirong Qiu
Xi Zhang
AAML
34
2
0
18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
Nghia Hieu Nguyen
Tho Thanh Quan
Ngan Luu-Thuy Nguyen
31
0
0
18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning
Ilya Kaufman
Omri Azencot
AI4TS
31
2
0
17 Oct 2024
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good?
Idris Abdulmumin
B. Galadanci
G. Aliyu
Shamsuddeen Hassan Muhammad
37
1
0
17 Oct 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Pranav Rajbhandari
Prithviraj Dasgupta
D. Sofge
21
0
0
17 Oct 2024
Reducing the Transformer Architecture to a Minimum
Bernhard Bermeitinger
T. Hrycej
Massimo Pavone
Julianus Kath
Siegfried Handschuh
19
0
0
17 Oct 2024
DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone
Hongfan Gao
Wangmeng Shen
Xiangfei Qiu
Ronghui Xu
Jilin Hu
Bin Yang
30
5
0
17 Oct 2024
Recurrent Neural Goodness-of-Fit Test for Time Series
Aoran Zhang
Wenbin Zhou
Liyan Xie
Shixiang Zhu
40
1
0
17 Oct 2024
Artificial Kuramoto Oscillatory Neurons
Takeru Miyato
Sindy Lowe
Andreas Geiger
Max Welling
AI4CE
77
6
0
17 Oct 2024
Unifying Economic and Language Models for Enhanced Sentiment Analysis of the Oil Market
Himmet Kaplan
R. Mundani
Heiko Rölke
A. Weichselbraun
Martin Tschudy
14
0
0
16 Oct 2024
How much do contextualized representations encode long-range context?
Simeng Sun
Cheng-Ping Hsieh
46
0
0
16 Oct 2024
Network Representation Learning for Biophysical Neural Network Analysis
Youngmok Ha
Yongjoo Kim
Hyun Jae Jang
Seungyeon Lee
Eunji Pak
28
0
0
15 Oct 2024
Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of Operations
Seongho Kim
Jihyun Moon
Juntaek Oh
Insu Choi
Joon-Sung Yang
23
0
0
15 Oct 2024
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Pedram Akbarian
Huy Le Nguyen
Xing Han
Nhat Ho
MoE
42
0
0
15 Oct 2024
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
Sajad Movahedi
Antonio Orvieto
Seyed-Mohsen Moosavi-Dezfooli
AI4CE
AAML
172
0
0
15 Oct 2024
ChakmaNMT: A Low-resource Machine Translation On Chakma Language
Aunabil Chakma
Aditya Chakma
Soham Khisa
Chumui Tripura
Masum Hasan
Rifat Shahriyar
21
0
0
14 Oct 2024
A Framework to Enable Algorithmic Design Choice Exploration in DNNs
Timothy L. Cronin IV
Sanmukh Kuppannagari
45
0
0
10 Oct 2024
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
Cyrile Delestre
Yoann Sola
24
0
0
10 Oct 2024
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context
Enrique Noriega-Atala
Robert Vacareanu
Salena Torres Ashton
A. Pyarelal
Clayton T. Morrison
Mihai Surdeanu
34
0
0
10 Oct 2024
Previous
1
2
3
4
5
6
...
117
118
119
Next