ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 6,024 papers shown
Title
Efficient Training of Large Language Models on Distributed
  Infrastructures: A Survey
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
73
8
0
29 Jul 2024
Sentiment Analysis of Lithuanian Online Reviews Using Large Language
  Models
Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models
Brigita Vileikyt.e
M. Lukoševičius
Lukas Stankevicius
20
1
0
29 Jul 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
77
1
0
29 Jul 2024
Deep Learning Based Crime Prediction Models: Experiments and Analysis
Deep Learning Based Crime Prediction Models: Experiments and Analysis
Rittik Basak Utsha
Muhtasim Noor Alif
Yeasir Rayhan
T. Hashem
Mohammad Eunus Ali
37
0
0
27 Jul 2024
Rethinking Attention Module Design for Point Cloud Analysis
Rethinking Attention Module Design for Point Cloud Analysis
Chengzhi Wu
Kaige Wang
Zeyun Zhong
Hao Fu
Junwei Zheng
Jiaming Zhang
Julius Pfrommer
Jürgen Beyerer
3DPC
51
1
0
27 Jul 2024
Towards the Terminator Economy: Assessing Job Exposure to AI through LLMs
Towards the Terminator Economy: Assessing Job Exposure to AI through LLMs
Emilio Colombo
Fabio Mercorio
Mario Mezzanzanica
Antonio Serino
36
1
0
27 Jul 2024
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with
  LLMs
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs
Aleix Sant
Carlos Escolano
Audrey Mash
Francesca de Luca Fornaciari
Maite Melero
36
4
0
26 Jul 2024
Coupling Speech Encoders with Downstream Text Models
Coupling Speech Encoders with Downstream Text Models
Ciprian Chelba
J. Schalkwyk
AuLLM
45
0
0
24 Jul 2024
NarrationDep: Narratives on Social Media For Automatic Depression
  Detection
NarrationDep: Narratives on Social Media For Automatic Depression Detection
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
29
0
0
24 Jul 2024
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language
  Retrieval
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Longtao Jiang
Min Wang
Zecheng Li
Yao Fang
Wen-gang Zhou
Houqiang Li
SLR
39
2
0
23 Jul 2024
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long
  Sequences Training
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training
Cheng Luo
Jiawei Zhao
Zhuoming Chen
Beidi Chen
A. Anandkumar
37
3
0
22 Jul 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
39
4
0
21 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current
  Status, Challenges, and Perspectives
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MA
OffRL
40
23
0
20 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text
  Generation: A State-of-the-Art Investigation
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
42
8
0
19 Jul 2024
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting
  Recognition
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Gagan Bhatia
El Moatez Billah Nagoudi
Fakhraddin Alwajih
Muhammad Abdul-Mageed
34
3
0
18 Jul 2024
Dynamic Sentiment Analysis with Local Large Language Models using
  Majority Voting: A Study on Factors Affecting Restaurant Evaluation
Dynamic Sentiment Analysis with Local Large Language Models using Majority Voting: A Study on Factors Affecting Restaurant Evaluation
Junichiro Niimi
35
3
0
18 Jul 2024
LLMs-in-the-loop Part-1: Expert Small AI Models for Bio-Medical Text
  Translation
LLMs-in-the-loop Part-1: Expert Small AI Models for Bio-Medical Text Translation
Bunyamin Keles
Murat Gunay
Serdar I. Caglar
LM&MA
34
2
0
16 Jul 2024
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation
Olga Zatsarynna
Emad Bahrami
Yazan Abu Farha
Gianpiero Francesca
Juergen Gall
43
1
0
16 Jul 2024
Quantised Global Autoencoder: A Holistic Approach to Representing Visual
  Data
Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data
Tim Elsner
Paula Usinger
Victor Czech
Gregor Kobsik
Yanjiang He
I. Lim
Leif Kobbelt
44
1
0
16 Jul 2024
Augmented Neural Fine-Tuning for Efficient Backdoor Purification
Augmented Neural Fine-Tuning for Efficient Backdoor Purification
Nazmul Karim
Abdullah Al Arafat
Umar Khalid
Zhishan Guo
Nazanin Rahnavard
AAML
40
0
0
14 Jul 2024
Hydra: Bidirectional State Space Models Through Generalized Matrix
  Mixers
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers
Sukjun Hwang
Aakash Lahoti
Tri Dao
Albert Gu
Mamba
62
12
0
13 Jul 2024
Self-training Language Models for Arithmetic Reasoning
Self-training Language Models for Arithmetic Reasoning
Marek Kadlcík
Michal Štefánik
KELM
ReLM
OffRL
LRM
35
1
0
11 Jul 2024
Predicting Heart Failure with Attention Learning Techniques Utilizing
  Cardiovascular Data
Predicting Heart Failure with Attention Learning Techniques Utilizing Cardiovascular Data
Ershadul Haque
Manoranjan Paul
Faranak Tohidi
21
0
0
11 Jul 2024
Foundation Model Engineering: Engineering Foundation Models Just as
  Engineering Software
Foundation Model Engineering: Engineering Foundation Models Just as Engineering Software
Dezhi Ran
Mengzhou Wu
Wei Yang
Tao Xie
AI4CE
39
1
0
11 Jul 2024
How Well Can a Long Sequence Model Model Long Sequences? Comparing
  Architechtural Inductive Biases on Long-Context Abilities
How Well Can a Long Sequence Model Model Long Sequences? Comparing Architechtural Inductive Biases on Long-Context Abilities
Jerry Huang
57
7
0
11 Jul 2024
Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses
  from Diagram
Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram
Ming-Liang Zhang
Zhong-Zhi Li
Fei Yin
Liang Lin
Cheng-Lin Liu
LRM
24
6
0
10 Jul 2024
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in
  Large Language Models Using Only Attention Maps
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
Yung-Sung Chuang
Linlu Qiu
Cheng-Yu Hsieh
Ranjay Krishna
Yoon Kim
James R. Glass
HILM
18
35
0
09 Jul 2024
TeVAE: A Variational Autoencoder Approach for Discrete Online Anomaly
  Detection in Variable-state Multivariate Time-series Data
TeVAE: A Variational Autoencoder Approach for Discrete Online Anomaly Detection in Variable-state Multivariate Time-series Data
Lucas Correia
Jan-Christoph Goos
Philipp Klein
Thomas Bäck
Anna V. Kononova
24
0
0
09 Jul 2024
Source Code Summarization in the Era of Large Language Models
Source Code Summarization in the Era of Large Language Models
Weisong Sun
Yun Miao
Yuekang Li
Hongyu Zhang
Chunrong Fang
Yi Liu
Gelei Deng
Yang Liu
Zhenyu Chen
ELM
55
14
0
09 Jul 2024
AutoTask: Task Aware Multi-Faceted Single Model for Multi-Task Ads
  Relevance
AutoTask: Task Aware Multi-Faceted Single Model for Multi-Task Ads Relevance
Shouchang Guo
Sonam Damani
Keng-hao Chang
37
0
0
09 Jul 2024
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge
  Distillation: A Case Study
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study
Aniruddha Roy
Pretam Ray
Ayush Maheshwari
Sudeshna Sarkar
Pawan Goyal
34
1
0
09 Jul 2024
Ask Questions with Double Hints: Visual Question Generation with
  Answer-awareness and Region-reference
Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Kai Shen
Lingfei Wu
Siliang Tang
Fangli Xu
Bo Long
Yueting Zhuang
Jian Pei
35
0
0
06 Jul 2024
Looking into Black Box Code Language Models
Looking into Black Box Code Language Models
Muhammad Umair Haider
Umar Farooq
A. B. Siddique
Mark Marron
39
2
0
05 Jul 2024
Vision Mamba for Classification of Breast Ultrasound Images
Vision Mamba for Classification of Breast Ultrasound Images
Ali Nasiri-Sarvi
Mahdi S. Hosseini
Hassan Rivaz
44
5
0
04 Jul 2024
On the Anatomy of Attention
On the Anatomy of Attention
Nikhil Khatri
Tuomas Laakkonen
Jonathon Liu
Vincent Wang-Ma'scianica
3DV
52
1
0
02 Jul 2024
SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large
  Language Model
SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model
Lingyue Fu
Hao Guan
Kounianhua Du
Jianghao Lin
Wei Xia
Weinan Zhang
Ruiming Tang
Yasheng Wang
Yong Yu
AI4Ed
KELM
RALM
42
5
0
01 Jul 2024
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Nicy Scaria
Silvester John Joseph Kennedy
Deepak N. Subramani
MU
19
2
0
01 Jul 2024
ESALE: Enhancing Code-Summary Alignment Learning for Source Code
  Summarization
ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization
Chunrong Fang
Weisong Sun
Yuchen Chen
Xiao Chen
Zhao Wei
Quanjun Zhang
Yudu You
Bin Luo
Yang Liu
Zhenyu Chen
AI4TS
48
12
0
01 Jul 2024
Invariant Correlation of Representation with Label: Enhancing Domain Generalization in Noisy Environments
Invariant Correlation of Representation with Label: Enhancing Domain Generalization in Noisy Environments
Gaojie Jin
Ronghui Mu
Xinping Yi
Xiaowei Huang
Lijun Zhang
67
0
0
01 Jul 2024
A Comparative Study of Quality Evaluation Methods for Text Summarization
A Comparative Study of Quality Evaluation Methods for Text Summarization
Huyen Nguyen
Haihua Chen
Lavanya Pobbathi
Junhua Ding
ELM
43
5
0
30 Jun 2024
Virtual Context: Enhancing Jailbreak Attacks with Special Token
  Injection
Virtual Context: Enhancing Jailbreak Attacks with Special Token Injection
Yuqi Zhou
Lin Lu
Hanchi Sun
Pan Zhou
Lichao Sun
39
9
0
28 Jun 2024
Computational Politeness in Natural Language Processing: A Survey
Computational Politeness in Natural Language Processing: A Survey
Priyanshu Priya
Mauajama Firdaus
Asif Ekbal
42
10
0
28 Jun 2024
Hack Me If You Can: Aggregating AutoEncoders for Countering Persistent
  Access Threats Within Highly Imbalanced Data
Hack Me If You Can: Aggregating AutoEncoders for Countering Persistent Access Threats Within Highly Imbalanced Data
Sidahmed Benabderrahmane
Ngoc Hoang
Petko Valtchev
James Cheney
Talal Rahwan
19
3
0
27 Jun 2024
Estimating Long-term Heterogeneous Dose-response Curve: Generalization Bound Leveraging Optimal Transport Weights
Estimating Long-term Heterogeneous Dose-response Curve: Generalization Bound Leveraging Optimal Transport Weights
Zeqin Yang
Weilin Chen
Ruichu Cai
Yuguang Yan
Zhifeng Hao
Zhipeng Yu
Zhichao Zou
Jixing Xu
Zhen Peng
Jiecheng Guo
64
3
0
27 Jun 2024
On the Role of Visual Grounding in VQA
On the Role of Visual Grounding in VQA
Daniel Reich
Tanja Schultz
21
1
0
26 Jun 2024
Sequential Disentanglement by Extracting Static Information From A
  Single Sequence Element
Sequential Disentanglement by Extracting Static Information From A Single Sequence Element
Nimrod Berman
Ilan Naiman
Idan Arbiv
Gal Fadlon
Omri Azencot
CoGe
42
4
0
26 Jun 2024
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment
  Retrieval
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval
Weitong Cai
Jiabo Huang
Shaogang Gong
Hailin Jin
Yang Liu
44
0
0
25 Jun 2024
Joint Admission Control and Resource Allocation of Virtual Network
  Embedding via Hierarchical Deep Reinforcement Learning
Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning
Tianfu Wang
Li Shen
Qilin Fan
Tong Xu
Tongliang Liu
Hui Xiong
28
4
0
25 Jun 2024
Found in the Middle: Calibrating Positional Attention Bias Improves Long
  Context Utilization
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization
Cheng-Yu Hsieh
Yung-Sung Chuang
Chun-Liang Li
Zifeng Wang
Long T. Le
...
James R. Glass
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
48
31
0
23 Jun 2024
Latent diffusion models for parameterization and data assimilation of
  facies-based geomodels
Latent diffusion models for parameterization and data assimilation of facies-based geomodels
Guido Di Federico
L. Durlofsky
DiffM
AI4CE
33
2
0
21 Jun 2024
Previous
123...789...119120121
Next