Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0473
Cited By
Neural Machine Translation by Jointly Learning to Align and Translate
1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Machine Translation by Jointly Learning to Align and Translate"
50 / 5,946 papers shown
Title
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
58
4
0
08 Jan 2025
Koopman Learning with Episodic Memory
William T. Redman
Dean Huang
M. Fonoberova
Igor Mezić
44
0
0
08 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
40
0
0
04 Jan 2025
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers
Markus J. Buehler
AI4CE
35
1
0
04 Jan 2025
Diversity Optimization for Travelling Salesman Problem via Deep Reinforcement Learning
Qi Li
Zhiguang Cao
Yining Ma
Yaoxin Wu
Yue-jiao Gong
53
0
0
03 Jan 2025
Kolmogorov GAM Networks are all you need!
Sarah Polson
Vadim Sokolov
34
0
0
03 Jan 2025
Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention
Zhenyu Guo
Wenguang Chen
46
0
0
01 Jan 2025
Deep Kalman Filters Can Filter
Blanka Hovart
Anastasis Kratsios
Yannick Limmer
Xuwei Yang
53
1
0
31 Dec 2024
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
88
4
0
31 Dec 2024
Symbolic Disentangled Representations for Images
Alexandr Korchemnyi
A. Kovalev
Aleksandr I. Panov
OCL
51
0
0
31 Dec 2024
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
55
4
0
31 Dec 2024
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out
Rıza Özçelik
F. Grisoni
48
0
0
24 Dec 2024
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study of Persian-Hindi Pair
Waisullah Yousofi
Pushpak Bhattacharyya
84
0
0
22 Dec 2024
Sensitive Image Classification by Vision Transformers
Hanxian He
Campbell Wilson
Thanh Thi Nguyen
Janis Dalins
ViT
89
0
0
21 Dec 2024
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation
Gautier Evennou
Antoine Chaffin
Vivien Chappelier
Ewa Kijak
DiffM
79
0
0
20 Dec 2024
Mention Attention for Pronoun Translation
Gongbo Tang
Christian Hardmeier
102
0
0
19 Dec 2024
On the Use of Deep Learning Models for Semantic Clone Detection
Subroto Nag Pinku
Debajyoti Mondal
C. Roy
77
3
0
19 Dec 2024
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance
Sukrit Leelaluk
Cheng Tang
Valdemar Švábenský
Atsushi Shimada
72
1
0
19 Dec 2024
Language verY Rare for All
Ibrahim Merad
Amos Wolf
Ziad Mazzawi
Yannick Léo
77
0
0
18 Dec 2024
Development of an End-to-end Machine Learning System with Application to In-app Purchases
Dionysios Varelas
Elena Bonan
Lewis Anderson
Anders Englesson
Christoffer Åhrling
Adrian Chmielewski-Anders
OffRL
76
0
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
89
0
0
16 Dec 2024
Learning Latent Spaces for Domain Generalization in Time Series Forecasting
Songgaojun Deng
Maarten de Rijke
CML
AI4TS
OOD
BDL
73
0
0
15 Dec 2024
The Superalignment of Superhuman Intelligence with Large Language Models
Minlie Huang
Yingkang Wang
Shiyao Cui
Pei Ke
J. Tang
115
1
0
15 Dec 2024
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
176
0
0
14 Dec 2024
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain
Benno Uthayasooriyar
A. Ly
Franck Vermet
Caio Corro
69
0
0
12 Dec 2024
A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Zhepeng Wang
Runxue Bao
Yawen Wu
Guodong Liu
Lei Yang
Liang Zhan
Feng Zheng
Weiwen Jiang
Yanfu Zhang
76
0
0
09 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
69
0
0
04 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Yi Zhang
D. Yao
70
0
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
226
0
0
30 Nov 2024
Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification
Junbo Jacob Lian
67
0
0
29 Nov 2024
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
Syed Mohammed Mostaque Billah
Ateya Ahmed Subarna
Sudipta Nandi Sarna
Ahmad Shawkat Wasit
Anika Fariha
Asif Sushmit
Arig Yousuf Sadeque
57
0
0
29 Nov 2024
An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation
Joy Mahapatra
Utpal Garain
HILM
ALM
69
1
0
28 Nov 2024
Neural Networks Use Distance Metrics
Alan Oursland
59
0
0
26 Nov 2024
Unsupervised Event Outlier Detection in Continuous Time
Somjit Nath
Yik Chau Lui
Siqi Liu
AI4TS
69
0
0
25 Nov 2024
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Jungeun Kim
Hyeongwoo Jeon
Jongseong Bae
Ha Young Kim
SLR
85
0
0
25 Nov 2024
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh
Pooja Voladoddi
Tanmay Bakshi
Uma Gunturi
72
0
0
22 Nov 2024
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
77
0
0
19 Nov 2024
Forecasting Application Counts in Talent Acquisition Platforms: Harnessing Multimodal Signals using LMs
Md. Ahsanul Kabir
Kareem E. Abdelfatah
Shushan He
M. Korayem
Mohammad Al Hasan
AI4TS
70
0
0
19 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Ziqiang Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
35
6
0
18 Nov 2024
An exploration of the effect of quantisation on energy consumption and inference time of StarCoder2
Pepijn de Reus
Ana Oprescu
Jelle Zuidema
MQ
85
1
0
15 Nov 2024
On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
161
1
0
15 Nov 2024
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner
Paula Usinger
Julius Nehring-Wirxel
Gregor Kobsik
Victor Czech
Yanjiang He
I. Lim
Leif Kobbelt
37
0
0
15 Nov 2024
Neural Operators Can Play Dynamic Stackelberg Games
Guillermo Alvarez
Ibrahim Ekren
Anastasis Kratsios
Xuwei Yang
35
0
0
14 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
48
16
0
14 Nov 2024
More Expressive Attention with Negative Weights
Ang Lv
Ruobing Xie
Shuaipeng Li
Jiayi Liao
Xingchen Sun
Zhanhui Kang
Di Wang
Rui Yan
42
0
0
11 Nov 2024
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
Alex Havrilla
Wenjing Liao
39
8
0
11 Nov 2024
CULL-MT: Compression Using Language and Layer pruning for Machine Translation
Pedram Rostami
M. Dousti
39
0
0
10 Nov 2024
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review
Mahtab Faraji
Homa Rashidisabet
George R. Nahass
R. Chan
Thasarat S Vajaranant
Darvin Yi
34
0
0
07 Nov 2024
Pruning Literals for Highly Efficient Explainability at Word Level
Rohan Kumar Yadav
Bimal Bhattarai
Abhik Jana
Lei Jiao
Seid Muhie Yimam
32
0
0
07 Nov 2024
LASER: Attention with Exponential Transformation
Sai Surya Duvvuri
Inderjit Dhillon
43
1
0
05 Nov 2024
Previous
1
2
3
4
5
...
117
118
119
Next