ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 6,341 papers shown
Title
Divide et Impera: Multi-Transformer Architectures for Complex NLP-Tasks
Divide et Impera: Multi-Transformer Architectures for Complex NLP-Tasks
Solveig Helland
Elena Gavagnin
Alexandre de Spindler
35
2
0
25 Oct 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email
  Response Prediction
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang
Prashanth Vijayaraghavan
Ehsan Degan
22
4
0
25 Oct 2023
SpikingJelly: An open-source machine learning infrastructure platform
  for spike-based intelligence
SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence
Wei Fang
Yanqing Chen
Jianhao Ding
Zhaofei Yu
T. Masquelier
Ding Chen
Liwei Huang
Huihui Zhou
Guoqi Li
Yonghong Tian
36
206
0
25 Oct 2023
On the Interplay between Fairness and Explainability
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
27
4
0
25 Oct 2023
Can Virtual Reality Protect Users from Keystroke Inference Attacks?
Can Virtual Reality Protect Users from Keystroke Inference Attacks?
Zhuolin Yang
Zain Sarwar
Iris Hwang
Ronik Bhaskar
Ben Y. Zhao
Haitao Zheng
21
8
0
24 Oct 2023
A Language Model with Limited Memory Capacity Captures Interference in
  Human Sentence Processing
A Language Model with Limited Memory Capacity Captures Interference in Human Sentence Processing
William Timkey
Tal Linzen
24
15
0
24 Oct 2023
DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention
DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention
Zheng Zhang
Dengyu Zhang
Qingrui Zhang
Wei Pan
Tianjiang Hu
33
4
0
24 Oct 2023
Expression Syntax Information Bottleneck for Math Word Problems
Expression Syntax Information Bottleneck for Math Word Problems
Jing Xiong
Chengming Li
Min Yang
Xiping Hu
Bin Hu
35
5
0
24 Oct 2023
tagE: Enabling an Embodied Agent to Understand Human Instructions
tagE: Enabling an Embodied Agent to Understand Human Instructions
Chayan Sarkar
Avik Mitra
Pradip Pramanick
Tapas Nayak
LM&Ro
51
1
0
24 Oct 2023
Detecting Intentional AIS Shutdown in Open Sea Maritime Surveillance
  Using Self-Supervised Deep Learning
Detecting Intentional AIS Shutdown in Open Sea Maritime Surveillance Using Self-Supervised Deep Learning
Pierre Bernabé
Arnaud Gotlieb
B. Legeard
D. Marijan
F. Sem-Jacobsen
Helge Spieker
24
15
0
24 Oct 2023
Meta learning with language models: Challenges and opportunities in the
  classification of imbalanced text
Meta learning with language models: Challenges and opportunities in the classification of imbalanced text
Apostol T. Vassilev
Honglan Jin
Munawar Hasan
21
0
0
23 Oct 2023
Adaptive Policy with Wait-$k$ Model for Simultaneous Translation
Adaptive Policy with Wait-kkk Model for Simultaneous Translation
Libo Zhao
Kai Fan
Wei Luo
Jing Wu
Shushu Wang
Ziqian Zeng
Zhongqiang Huang
61
9
0
23 Oct 2023
Generative Pre-trained Transformer for Vietnamese Community-based
  COVID-19 Question Answering
Generative Pre-trained Transformer for Vietnamese Community-based COVID-19 Question Answering
T. M. Vo
Khiem Vinh Tran
22
1
0
23 Oct 2023
Boosting Unsupervised Machine Translation with Pseudo-Parallel Data
Boosting Unsupervised Machine Translation with Pseudo-Parallel Data
Ivana Kvapilíková
Ondrej Bojar
MoE
30
0
0
22 Oct 2023
Code-Switching with Word Senses for Pretraining in Neural Machine
  Translation
Code-Switching with Word Senses for Pretraining in Neural Machine Translation
Vivek Iyer
Edoardo Barba
Alexandra Birch
Jeff Z. Pan
Roberto Navigli
20
3
0
21 Oct 2023
Plausibility Processing in Transformer Language Models: Focusing on the
  Role of Attention Heads in GPT
Plausibility Processing in Transformer Language Models: Focusing on the Role of Attention Heads in GPT
Soo Hyun Ryu
19
0
0
20 Oct 2023
On Synthetic Data for Back Translation
On Synthetic Data for Back Translation
Jiahao Xu
Yubin Ruan
Wei Bi
Guoping Huang
Shuming Shi
Lihui Chen
Lemao Liu
38
12
0
20 Oct 2023
Bridging Information-Theoretic and Geometric Compression in Language
  Models
Bridging Information-Theoretic and Geometric Compression in Language Models
Emily Cheng
Corentin Kervadec
Marco Baroni
36
17
0
20 Oct 2023
Semi-supervised multimodal coreference resolution in image narrations
Semi-supervised multimodal coreference resolution in image narrations
A. Goel
Basura Fernando
Frank Keller
Hakan Bilen
52
4
0
20 Oct 2023
Unveiling Energy Efficiency in Deep Learning: Measurement, Prediction,
  and Scoring across Edge Devices
Unveiling Energy Efficiency in Deep Learning: Measurement, Prediction, and Scoring across Edge Devices
Xiaolong Tu
Anik Mallik
Dawei Chen
Kyungtae Han
Onur Altintas
Haoxin Wang
Jiang Xie
27
12
0
19 Oct 2023
Character-level Chinese Backpack Language Models
Character-level Chinese Backpack Language Models
Hao Sun
John Hewitt
29
0
0
19 Oct 2023
Neurosymbolic Grounding for Compositional World Models
Neurosymbolic Grounding for Compositional World Models
Atharva Sehgal
Arya Grayeli
Jennifer J. Sun
Swarat Chaudhuri
45
5
0
19 Oct 2023
American Option Pricing using Self-Attention GRU and Shapley Value
  Interpretation
American Option Pricing using Self-Attention GRU and Shapley Value Interpretation
Yanhui Shen
38
0
0
19 Oct 2023
The Shifted and The Overlooked: A Task-oriented Investigation of
  User-GPT Interactions
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Siru Ouyang
Shuohang Wang
Yang Liu
Ming Zhong
Yizhu Jiao
Dan Iter
Reid Pryzant
Chenguang Zhu
Heng Ji
Jiawei Han
42
26
0
19 Oct 2023
Learning to Optimise Climate Sensor Placement using a Transformer
Learning to Optimise Climate Sensor Placement using a Transformer
Chen Wang
Victoria Huang
Gang Chen
Hui Ma
Bryce Chen
Jochen Schmidt
27
0
0
18 Oct 2023
knn-seq: Efficient, Extensible kNN-MT Framework
knn-seq: Efficient, Extensible kNN-MT Framework
Hiroyuki Deguchi
Hayate Hirano
T. Hoshino
Yuto Nishida
Justin Vasselli
Taro Watanabe
17
1
0
18 Oct 2023
Document-Level Language Models for Machine Translation
Document-Level Language Models for Machine Translation
Frithjof Petrick
Christian Herold
Pavel Petrushkov
Shahram Khadivi
Hermann Ney
32
9
0
18 Oct 2023
Direct Neural Machine Translation with Task-level Mixture of Experts
  models
Direct Neural Machine Translation with Task-level Mixture of Experts models
Isidora Chara Tourni
Subhajit Naskar
MoE
21
0
0
18 Oct 2023
Harnessing Dataset Cartography for Improved Compositional Generalization
  in Transformers
Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers
Osman Batur .Ince
Tanin Zeraati
Semih Yagcioglu
Yadollah Yaghoobzadeh
Erkut Erdem
Aykut Erdem
26
1
0
18 Oct 2023
Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for
  Long Sequences
Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long Sequences
Yanming Kang
Giang Tran
H. Sterck
23
3
0
18 Oct 2023
Brain decoding: toward real-time reconstruction of visual perception
Brain decoding: toward real-time reconstruction of visual perception
Yohann Benchetrit
Hubert J. Banville
Jean-Rémi King
41
46
0
18 Oct 2023
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language
  Models to Generalize to Novel Interpretations
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Arkil Patel
S. Bhattamishra
Siva Reddy
Dzmitry Bahdanau
39
5
0
18 Oct 2023
Enhancing Neural Machine Translation with Semantic Units
Enhancing Neural Machine Translation with Semantic Units
Langlin Huang
Shuhao Gu
Zhuocheng Zhang
Yang Feng
49
4
0
17 Oct 2023
Revealing the Unwritten: Visual Investigation of Beam Search Trees to
  Address Language Model Prompting Challenges
Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges
Thilo Spinner
Rebecca Kehlbeck
Rita Sevastjanova
Tobias Stähle
Daniel A. Keim
Oliver Deussen
Andreas Spitz
Mennatallah El-Assady
34
2
0
17 Oct 2023
MST-GAT: A Multimodal Spatial-Temporal Graph Attention Network for Time
  Series Anomaly Detection
MST-GAT: A Multimodal Spatial-Temporal Graph Attention Network for Time Series Anomaly Detection
Chaoyue Ding
Shiliang Sun
Jing Zhao
AI4TS
43
140
0
17 Oct 2023
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing
  Interactive Machine Translation Systems
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems
Xu Huang
Zhirui Zhang
Ruize Gao
Yichao Du
Lemao Liu
Gouping Huang
Shuming Shi
Jiajun Chen
Shujian Huang
VLM
26
0
0
17 Oct 2023
LPFormer: An Adaptive Graph Transformer for Link Prediction
LPFormer: An Adaptive Graph Transformer for Link Prediction
Harry Shomer
Yao Ma
Haitao Mao
Juanhui Li
Bo Wu
Jiliang Tang
38
7
0
17 Oct 2023
Enhanced Transformer Architecture for Natural Language Processing
Enhanced Transformer Architecture for Natural Language Processing
Woohyeon Moon
Taeyoung Kim
Bumgeun Park
Dongsoo Har
30
0
0
17 Oct 2023
Approximating Two-Layer Feedforward Networks for Efficient Transformers
Approximating Two-Layer Feedforward Networks for Efficient Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
MoE
27
18
0
16 Oct 2023
Motion2Language, unsupervised learning of synchronized semantic motion
  segmentation
Motion2Language, unsupervised learning of synchronized semantic motion segmentation
Karim Radouane
Andon Tchechmedjiev
Julien Lagarde
Sylvie Ranwez
19
4
0
16 Oct 2023
Interpreting and Controlling Vision Foundation Models via Text
  Explanations
Interpreting and Controlling Vision Foundation Models via Text Explanations
Haozhe Chen
Junfeng Yang
Carl Vondrick
Chengzhi Mao
27
2
0
16 Oct 2023
Exploiting User Comments for Early Detection of Fake News Prior to
  Users' Commenting
Exploiting User Comments for Early Detection of Fake News Prior to Users' Commenting
Qiong Nan
Qiang Sheng
Juan Cao
Yongchun Zhu
Danding Wang
Guang Yang
Jintao Li
Kai Shu
52
8
0
16 Oct 2023
Repetition In Repetition Out: Towards Understanding Neural Text
  Degeneration from the Data Perspective
Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
Huayang Li
Tian Lan
Z. Fu
Deng Cai
Lemao Liu
Nigel Collier
Taro Watanabe
Yixuan Su
42
15
0
16 Oct 2023
An Empirical Study of Self-supervised Learning with Wasserstein Distance
An Empirical Study of Self-supervised Learning with Wasserstein Distance
Makoto Yamada
Yuki Takezawa
Guillaume Houry
Kira Michaela Dusterwald
Deborah Sulem
Han Zhao
Yao-Hung Hubert Tsai
40
1
0
16 Oct 2023
A Survey of Graph and Attention Based Hyperspectral Image Classification
  Methods for Remote Sensing Data
A Survey of Graph and Attention Based Hyperspectral Image Classification Methods for Remote Sensing Data
Aryan Vats
Manan Suri
28
2
0
16 Oct 2023
Lexical Entrainment for Conversational Systems
Lexical Entrainment for Conversational Systems
Zhengxiang Shi
Procheta Sen
Aldo Lipani
44
1
0
14 Oct 2023
Machine Learning for Urban Air Quality Analytics: A Survey
Machine Learning for Urban Air Quality Analytics: A Survey
Jindong Han
Weijiao Zhang
Hao Liu
Hui Xiong
AI4CE
80
12
0
14 Oct 2023
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Shuyang Jiang
Jinchao Zhang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
54
0
0
14 Oct 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
Linyang Li
Ke Ren
Yunfan Shao
Pengyu Wang
Xipeng Qiu
31
4
0
13 Oct 2023
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing
  Image-text Retrieval
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval
Qing Ma
Jiancheng Pan
Cong Bai
34
17
0
12 Oct 2023
Previous
123...171819...125126127
Next