ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 5,946 papers shown
Title
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
58
4
0
08 Jan 2025
Koopman Learning with Episodic Memory
Koopman Learning with Episodic Memory
William T. Redman
Dean Huang
M. Fonoberova
Igor Mezić
44
0
0
08 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
40
0
0
04 Jan 2025
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers
Markus J. Buehler
AI4CE
35
1
0
04 Jan 2025
Diversity Optimization for Travelling Salesman Problem via Deep Reinforcement Learning
Qi Li
Zhiguang Cao
Yining Ma
Yaoxin Wu
Yue-jiao Gong
53
0
0
03 Jan 2025
Kolmogorov GAM Networks are all you need!
Sarah Polson
Vadim Sokolov
34
0
0
03 Jan 2025
Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention
Zhenyu Guo
Wenguang Chen
46
0
0
01 Jan 2025
Deep Kalman Filters Can Filter
Deep Kalman Filters Can Filter
Blanka Hovart
Anastasis Kratsios
Yannick Limmer
Xuwei Yang
53
1
0
31 Dec 2024
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
88
4
0
31 Dec 2024
Symbolic Disentangled Representations for Images
Symbolic Disentangled Representations for Images
Alexandr Korchemnyi
A. Kovalev
Aleksandr I. Panov
OCL
51
0
0
31 Dec 2024
Towards Visual Grounding: A Survey
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
55
4
0
31 Dec 2024
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out
Rıza Özçelik
F. Grisoni
48
0
0
24 Dec 2024
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study
  of Persian-Hindi Pair
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study of Persian-Hindi Pair
Waisullah Yousofi
Pushpak Bhattacharyya
84
0
0
22 Dec 2024
Sensitive Image Classification by Vision Transformers
Sensitive Image Classification by Vision Transformers
Hanxian He
Campbell Wilson
Thanh Thi Nguyen
Janis Dalins
ViT
89
0
0
21 Dec 2024
Reframing Image Difference Captioning with BLIP2IDC and Synthetic
  Augmentation
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation
Gautier Evennou
Antoine Chaffin
Vivien Chappelier
Ewa Kijak
DiffM
79
0
0
20 Dec 2024
Mention Attention for Pronoun Translation
Mention Attention for Pronoun Translation
Gongbo Tang
Christian Hardmeier
102
0
0
19 Dec 2024
On the Use of Deep Learning Models for Semantic Clone Detection
On the Use of Deep Learning Models for Semantic Clone Detection
Subroto Nag Pinku
Debajyoti Mondal
C. Roy
77
3
0
19 Dec 2024
Knowledge Distillation in RNN-Attention Models for Early Prediction of
  Student Performance
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance
Sukrit Leelaluk
Cheng Tang
Valdemar Švábenský
Atsushi Shimada
72
1
0
19 Dec 2024
Language verY Rare for All
Language verY Rare for All
Ibrahim Merad
Amos Wolf
Ziad Mazzawi
Yannick Léo
77
0
0
18 Dec 2024
Development of an End-to-end Machine Learning System with Application to
  In-app Purchases
Development of an End-to-end Machine Learning System with Application to In-app Purchases
Dionysios Varelas
Elena Bonan
Lewis Anderson
Anders Englesson
Christoffer Åhrling
Adrian Chmielewski-Anders
OffRL
76
0
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
89
0
0
16 Dec 2024
Learning Latent Spaces for Domain Generalization in Time Series
  Forecasting
Learning Latent Spaces for Domain Generalization in Time Series Forecasting
Songgaojun Deng
Maarten de Rijke
CML
AI4TS
OOD
BDL
73
0
0
15 Dec 2024
The Superalignment of Superhuman Intelligence with Large Language Models
The Superalignment of Superhuman Intelligence with Large Language Models
Minlie Huang
Yingkang Wang
Shiyao Cui
Pei Ke
J. Tang
115
1
0
15 Dec 2024
One Pixel is All I Need
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
176
0
0
14 Dec 2024
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in
  the Insurance Domain
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain
Benno Uthayasooriyar
A. Ly
Franck Vermet
Caio Corro
69
0
0
12 Dec 2024
A Self-guided Multimodal Approach to Enhancing Graph Representation
  Learning for Alzheimer's Diseases
A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Zhepeng Wang
Runxue Bao
Yawen Wu
Guodong Liu
Lei Yang
Liang Zhan
Feng Zheng
Weiwen Jiang
Yanfu Zhang
76
0
0
09 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
69
0
0
04 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Yi Zhang
D. Yao
70
0
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
226
0
0
30 Nov 2024
Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification
Junbo Jacob Lian
67
0
0
29 Nov 2024
Towards Santali Linguistic Inclusion: Building the First
  Santali-to-English Translation Model using mT5 Transformer and Data
  Augmentation
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
Syed Mohammed Mostaque Billah
Ateya Ahmed Subarna
Sudipta Nandi Sarna
Ahmad Shawkat Wasit
Anika Fariha
Asif Sushmit
Arig Yousuf Sadeque
57
0
0
29 Nov 2024
An Extensive Evaluation of Factual Consistency in Large Language Models
  for Data-to-Text Generation
An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation
Joy Mahapatra
Utpal Garain
HILM
ALM
69
1
0
28 Nov 2024
Neural Networks Use Distance Metrics
Neural Networks Use Distance Metrics
Alan Oursland
59
0
0
26 Nov 2024
Unsupervised Event Outlier Detection in Continuous Time
Unsupervised Event Outlier Detection in Continuous Time
Somjit Nath
Yik Chau Lui
Siqi Liu
AI4TS
69
0
0
25 Nov 2024
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Jungeun Kim
Hyeongwoo Jeon
Jongseong Bae
Ha Young Kim
SLR
85
0
0
25 Nov 2024
Transforming NLU with Babylon: A Case Study in Development of Real-time,
  Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru
  Ordering
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh
Pooja Voladoddi
Tanmay Bakshi
Uma Gunturi
72
0
0
22 Nov 2024
NMT-Obfuscator Attack: Ignore a sentence in translation with only one
  word
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
77
0
0
19 Nov 2024
Forecasting Application Counts in Talent Acquisition Platforms:
  Harnessing Multimodal Signals using LMs
Forecasting Application Counts in Talent Acquisition Platforms: Harnessing Multimodal Signals using LMs
Md. Ahsanul Kabir
Kareem E. Abdelfatah
Shushan He
M. Korayem
Mohammad Al Hasan
AI4TS
70
0
0
19 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Ziqiang Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
35
6
0
18 Nov 2024
An exploration of the effect of quantisation on energy consumption and
  inference time of StarCoder2
An exploration of the effect of quantisation on energy consumption and inference time of StarCoder2
Pepijn de Reus
Ana Oprescu
Jelle Zuidema
MQ
85
1
0
15 Nov 2024
On the Shortcut Learning in Multilingual Neural Machine Translation
On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
161
1
0
15 Nov 2024
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved
  Visual Data Generation
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner
Paula Usinger
Julius Nehring-Wirxel
Gregor Kobsik
Victor Czech
Yanjiang He
I. Lim
Leif Kobbelt
37
0
0
15 Nov 2024
Neural Operators Can Play Dynamic Stackelberg Games
Neural Operators Can Play Dynamic Stackelberg Games
Guillermo Alvarez
Ibrahim Ekren
Anastasis Kratsios
Xuwei Yang
35
0
0
14 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic
  Survey
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
48
16
0
14 Nov 2024
More Expressive Attention with Negative Weights
More Expressive Attention with Negative Weights
Ang Lv
Ruobing Xie
Shuaipeng Li
Jiayi Liao
Xingchen Sun
Zhanhui Kang
Di Wang
Rui Yan
42
0
0
11 Nov 2024
Understanding Scaling Laws with Statistical and Approximation Theory for
  Transformer Neural Networks on Intrinsically Low-dimensional Data
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
Alex Havrilla
Wenjing Liao
39
8
0
11 Nov 2024
CULL-MT: Compression Using Language and Layer pruning for Machine
  Translation
CULL-MT: Compression Using Language and Layer pruning for Machine Translation
Pedram Rostami
M. Dousti
39
0
0
10 Nov 2024
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma:
  A Systematic Review
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review
Mahtab Faraji
Homa Rashidisabet
George R. Nahass
R. Chan
Thasarat S Vajaranant
Darvin Yi
34
0
0
07 Nov 2024
Pruning Literals for Highly Efficient Explainability at Word Level
Pruning Literals for Highly Efficient Explainability at Word Level
Rohan Kumar Yadav
Bimal Bhattarai
Abhik Jana
Lei Jiao
Seid Muhie Yimam
32
0
0
07 Nov 2024
LASER: Attention with Exponential Transformation
LASER: Attention with Exponential Transformation
Sai Surya Duvvuri
Inderjit Dhillon
43
1
0
05 Nov 2024
Previous
12345...117118119
Next