ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 6,328 papers shown
Title
Transformers and Cortical Waves: Encoders for Pulling In Context Across
  Time
Transformers and Cortical Waves: Encoders for Pulling In Context Across Time
L. Muller
P. Churchland
T. Sejnowski
29
6
0
25 Jan 2024
Explicitly Representing Syntax Improves Sentence-to-layout Prediction of
  Unexpected Situations
Explicitly Representing Syntax Improves Sentence-to-layout Prediction of Unexpected Situations
Wolf Nuyts
Ruben Cartuyvels
Marie-Francine Moens
57
1
0
25 Jan 2024
A comparative study of zero-shot inference with large language models
  and supervised modeling in breast cancer pathology classification
A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification
Madhumita Sushil
T. Zack
Divneet Mandair
Zhiwei Zheng
Ahmed Wali
Yan-Ning Yu
Yuwei Quan
A. Butte
41
6
0
25 Jan 2024
Towards Trustable Language Models: Investigating Information Quality of
  Large Language Models
Towards Trustable Language Models: Investigating Information Quality of Large Language Models
Rick Rejeleene
Xiaowei Xu
John R. Talburt
HILM
34
2
0
23 Jan 2024
Deep Learning Based Simulators for the Phosphorus Removal Process
  Control in Wastewater Treatment via Deep Reinforcement Learning Algorithms
Deep Learning Based Simulators for the Phosphorus Removal Process Control in Wastewater Treatment via Deep Reinforcement Learning Algorithms
Esmaeel Mohammadi
Mikkel Stokholm-Bjerregaard
A. A. Hansen
Per Halkjaer Nielsen
D. O. Arroyo
Petar Durdevic
AI4CE
16
12
0
23 Jan 2024
LLMCheckup: Conversational Examination of Large Language Models via
  Interpretability Tools and Self-Explanations
LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations
Qianli Wang
Tatiana Anikina
Nils Feldhus
Josef van Genabith
Leonhard Hennig
Sebastian Möller
ELM
LRM
20
8
0
23 Jan 2024
Boosting Unknown-number Speaker Separation with Transformer
  Decoder-based Attractor
Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor
Younglo Lee
Shukjae Choi
Byeonghak Kim
Zhong-Qiu Wang
Shinji Watanabe
MoE
18
9
0
23 Jan 2024
SEDAC: A CVAE-Based Data Augmentation Method for Security Bug Report
  Identification
SEDAC: A CVAE-Based Data Augmentation Method for Security Bug Report Identification
Y. Liao
T. Zhang
11
0
0
22 Jan 2024
Attention on Personalized Clinical Decision Support System: Federated
  Learning Approach
Attention on Personalized Clinical Decision Support System: Federated Learning Approach
Chu Myaet Thwal
K. Thar
Ye Lin Tun
Choong Seon Hong
24
22
0
22 Jan 2024
Colorectal Polyp Segmentation in the Deep Learning Era: A Comprehensive
  Survey
Colorectal Polyp Segmentation in the Deep Learning Era: A Comprehensive Survey
Zhenyu Wu
Fengmao Lv
Chenglizhao Chen
Aimin Hao
Shuo Li
ELM
36
10
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
35
3
0
21 Jan 2024
M3BUNet: Mobile Mean Max UNet for Pancreas Segmentation on CT-Scans
M3BUNet: Mobile Mean Max UNet for Pancreas Segmentation on CT-Scans
Juwita Juwita
Ghulam Mubashar Hassan
Naveed Akhtar
Amitava Datta
34
1
0
18 Jan 2024
MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation
  Extraction for Material Science Knowledge-base Construction
MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation Extraction for Material Science Knowledge-base Construction
Ankan Mullick
Akash Ghosh
G. Chaitanya
Samir Ghui
Tapas Nayak
Seung-Cheol Lee
S. Bhattacharjee
Pawan Goyal
18
10
0
18 Jan 2024
HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain
  Generalization
HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization
Guanglin Zhou
Zhongyi Han
Shiming Chen
Erdun Gao
Liming Zhu
Tongliang Liu
Lina Yao
Kun Zhang
37
3
0
18 Jan 2024
Efficient generative adversarial networks using linear
  additive-attention Transformers
Efficient generative adversarial networks using linear additive-attention Transformers
Emilio Morales-Juarez
Gibran Fuentes Pineda
42
3
0
17 Jan 2024
BERTologyNavigator: Advanced Question Answering with BERT-based
  Semantics
BERTologyNavigator: Advanced Question Answering with BERT-based Semantics
Shreya Rajpal
Ricardo Usbeck
32
1
0
17 Jan 2024
Inductive Models for Artificial Intelligence Systems are Insufficient
  without Good Explanations
Inductive Models for Artificial Intelligence Systems are Insufficient without Good Explanations
Udesh Habaraduwa
16
0
0
17 Jan 2024
VeriBug: An Attention-based Framework for Bug-Localization in Hardware
  Designs
VeriBug: An Attention-based Framework for Bug-Localization in Hardware Designs
Giuseppe Stracquadanio
Sourav Medya
Stefano Quer
Debjit Pal
6
2
0
17 Jan 2024
Lost in the Source Language: How Large Language Models Evaluate the
  Quality of Machine Translation
Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation
Xu Huang
Zhirui Zhang
Xiang Geng
Yichao Du
Jiajun Chen
Shujian Huang
53
7
0
12 Jan 2024
Improving the Detection of Small Oriented Objects in Aerial Images
Improving the Detection of Small Oriented Objects in Aerial Images
Chandler Timm C. Doloriel
R. Cajote
ObjD
36
9
0
12 Jan 2024
An approach for mistranslation removal from popular dataset for Indic MT
  Task
An approach for mistranslation removal from popular dataset for Indic MT Task
Sudhansu Bala Das
Leo Raphael Rodrigues
Tapas Kumar Mishra
Bidyut Kr. Patra
22
1
0
12 Jan 2024
YOLO-Former: YOLO Shakes Hand With ViT
YOLO-Former: YOLO Shakes Hand With ViT
J. Khoramdel
A. Moori
Y. Borhani
A. Ghanbarzadeh
Esmaeil Najafi
ViT
24
2
0
11 Jan 2024
Tuning LLMs with Contrastive Alignment Instructions for Machine
  Translation in Unseen, Low-resource Languages
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages
Zhuoyuan Mao
Yen Yu
ALM
21
2
0
11 Jan 2024
Attention versus Contrastive Learning of Tabular Data -- A Data-centric
  Benchmarking
Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking
S. B. Rabbani
Ivan V. Medri
Manar D. Samad
28
8
0
08 Jan 2024
Universal Time-Series Representation Learning: A Survey
Universal Time-Series Representation Learning: A Survey
Patara Trirat
Yooju Shin
Junhyeok Kang
Youngeun Nam
Jihye Na
Minyoung Bae
Joeun Kim
Byunghyun Kim
Jae-Gil Lee
AI4TS
75
15
0
08 Jan 2024
Enhancing Context Through Contrast
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
31
0
0
06 Jan 2024
SPFormer: Enhancing Vision Transformer with Superpixel Representation
SPFormer: Enhancing Vision Transformer with Superpixel Representation
Jieru Mei
Liang-Chieh Chen
Alan Yuille
Cihang Xie
ViT
MDE
21
4
0
05 Jan 2024
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron
  Captioning
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioning
Alfirsa Damasyifa Fauzulhaq
Wahyu Parwitayasa
Joseph A. Sugihdharma
M. F. Ridhani
N. Yudistira
34
0
0
05 Jan 2024
A unified multichannel far-field speech recognition system: combining
  neural beamforming with attention based end-to-end model
A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model
Dongdi Zhao
Jianbo Ma
Lu Lu
Jinke Li
Xuan Ji
Lei Zhu
Fuming Fang
Ming Liu
Feijun Jiang
32
1
0
05 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Hefei Ling
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
43
14
0
04 Jan 2024
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic
  Token Prediction
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction
Minchan Kim
Myeonghun Jeong
Byoung Jin Choi
Semin Kim
Joun Yeop Lee
Nam Soo Kim
AI4TS
25
4
0
03 Jan 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
47
21
0
30 Dec 2023
Contrastive learning-based agent modeling for deep reinforcement
  learning
Contrastive learning-based agent modeling for deep reinforcement learning
Wenhao Ma
Yu-Cheng Chang
Jie Yang
Yu-Kai Wang
Chin-Teng Lin
OffRL
32
0
0
30 Dec 2023
Hierarchical Aggregations for High-Dimensional Multiplex Graph Embedding
Hierarchical Aggregations for High-Dimensional Multiplex Graph Embedding
K. Abdous
Nairouz Mrabah
Mohamed Bouguessa
25
4
0
28 Dec 2023
Attention-Enhanced Reservoir Computing
Attention-Enhanced Reservoir Computing
Felix Köster
Kazutaka Kanno
Jun Ohkubo
Atsushi Uchida
13
2
0
27 Dec 2023
A Prompt Learning Framework for Source Code Summarization
A Prompt Learning Framework for Source Code Summarization
Weisong Sun
Chunrong Fang
Yudu You
Yuchen Chen
Yi Liu
...
Quanjun Zhang
Hanwei Qian
Wei Zhao
Yang Liu
Zhenyu Chen
LLMAG
50
13
0
26 Dec 2023
Heterogeneous Encoders Scaling In The Transformer For Neural Machine
  Translation
Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation
J. Hu
Roberto Cavicchioli
Giulia Berardinelli
Alessandro Capotondi
44
2
0
26 Dec 2023
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head
  Translation
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
Xize Cheng
Rongjie Huang
Linjun Li
Tao Jin
Zehan Wang
Aoxiong Yin
Minglei Li
Xinyu Duan
Changpeng Yang
Zhou Zhao
41
2
0
23 Dec 2023
Deep Learning for Efficient GWAS Feature Selection
Deep Learning for Efficient GWAS Feature Selection
Kexuan Li
29
0
0
22 Dec 2023
How Smooth Is Attention?
How Smooth Is Attention?
Valérie Castin
Pierre Ablin
Gabriel Peyré
AAML
40
9
0
22 Dec 2023
C2FAR: Coarse-to-Fine Autoregressive Networks for Precise Probabilistic
  Forecasting
C2FAR: Coarse-to-Fine Autoregressive Networks for Precise Probabilistic Forecasting
Shane Bergsma
Timothy J. Zeyl
J. R. Anaraki
Lei Guo
BDL
AI4TS
31
10
0
22 Dec 2023
3D Pose Estimation of Two Interacting Hands from a Monocular Event
  Camera
3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera
Christen Millerdurai
D. Luvizon
Viktor Rudnev
André Jonas
Jiayi Wang
Christian Theobalt
Vladislav Golyanik
40
10
0
21 Dec 2023
Real-time Neural Network Inference on Extremely Weak Devices: Agile
  Offloading with Explainable AI
Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI
Kai Huang
Wei Gao
27
35
0
21 Dec 2023
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic
  Tensor Selection
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection
Kai Huang
Boyuan Yang
Wei Gao
32
19
0
21 Dec 2023
HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model
  for online comments
HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments
Neeraj Kumar Singh
Koyel Ghosh
Joy Mahapatra
Utpal Garain
Apurbalal Senapati
27
0
0
20 Dec 2023
Cross-Modal Reasoning with Event Correlation for Video Question
  Answering
Cross-Modal Reasoning with Event Correlation for Video Question Answering
Chengxiang Yin
Zhengping Che
Kun Wu
Zhiyuan Xu
Qinru Qiu
Jian Tang
35
0
0
20 Dec 2023
Auto311: A Confidence-guided Automated System for Non-emergency Calls
Auto311: A Confidence-guided Automated System for Non-emergency Calls
Zirong Chen
Xutong Sun
Yuanhe Li
Meiyi Ma
31
1
0
19 Dec 2023
Predicting Line-Level Defects by Capturing Code Contexts with
  Hierarchical Transformers
Predicting Line-Level Defects by Capturing Code Contexts with Hierarchical Transformers
Parvez Mahbub
Mohammad Masudur Rahman
18
3
0
19 Dec 2023
Bridging Logic and Learning: A Neural-Symbolic Approach for Enhanced
  Reasoning in Neural Models (ASPER)
Bridging Logic and Learning: A Neural-Symbolic Approach for Enhanced Reasoning in Neural Models (ASPER)
Fadi Al Machot
22
2
0
18 Dec 2023
APE-then-QE: Correcting then Filtering Pseudo Parallel Corpora for MT
  Training Data Creation
APE-then-QE: Correcting then Filtering Pseudo Parallel Corpora for MT Training Data Creation
Akshay Batheja
S. Deoghare
Diptesh Kanojia
Pushpak Bhattacharyya
22
0
0
18 Dec 2023
Previous
123...141516...125126127
Next