Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.10181
Cited By
Exploiting Deep Representations for Neural Machine Translation
24 October 2018
Zi-Yi Dou
Zhaopeng Tu
Xing Wang
Shuming Shi
Tong Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploiting Deep Representations for Neural Machine Translation"
24 / 24 papers shown
Title
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
266
0
0
09 Mar 2025
The Curse of Depth in Large Language Models
Wenfang Sun
Xinyuan Song
Pengxiang Li
Lu Yin
Yefeng Zheng
Shiwei Liu
77
5
0
09 Feb 2025
Exploring the traditional NMT model and Large Language Model for chat translation
Jinlong Yang
Hengchao Shang
Daimeng Wei
Jiaxin Guo
Zongyao Li
...
Yuhao Xie
Yuanchang Luo
Jiawei Zheng
Bin Wei
Hao Yang
32
0
0
24 Sep 2024
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
Anna Langedijk
Hosein Mohebbi
Gabriele Sarti
Willem H. Zuidema
Jaap Jumelet
34
10
0
05 Oct 2023
Layer-wise Representation Fusion for Compositional Generalization
Yafang Zheng
Lei Lin
Shantao Liu
Binling Wang
Zhaohong Lai
Wenhao Rao
Biao Fu
Yidong Chen
Xiaodon Shi
AI4CE
50
2
0
20 Jul 2023
Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization
Lei Lin
Shuangtao Li
Yafang Zheng
Biao Fu
Shantao Liu
Yidong Chen
Xiaodon Shi
CoGe
32
3
0
20 May 2023
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
38
369
0
03 Nov 2021
Residual Tree Aggregation of Layers for Neural Machine Translation
Guoliang Li
Yiyang Li
45
0
0
19 Jul 2021
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML
Jiaquan Ye
Xianbiao Qi
Yelin He
Yihao Chen
Dengyi Gu
Peng Gao
Rong Xiao
LMTD
34
48
0
05 May 2021
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
54
170
0
13 Dec 2020
On the Sub-Layer Functionalities of Transformer Decoder
Yilin Yang
Longyue Wang
Shuming Shi
Prasad Tadepalli
Stefan Lee
Zhaopeng Tu
29
27
0
06 Oct 2020
Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis
Fengyu Yang
Shan Yang
Qinghua Wu
Yujun Wang
Lei Xie
39
5
0
03 Aug 2020
Rewiring the Transformer with Depth-Wise LSTMs
Hongfei Xu
Yang Song
Qiuhui Liu
Josef van Genabith
Deyi Xiong
49
6
0
13 Jul 2020
Multiscale Collaborative Deep Models for Neural Machine Translation
Xiangpeng Wei
Heng Yu
Yue Hu
Yue Zhang
Rongxiang Weng
Weihua Luo
27
28
0
29 Apr 2020
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons
Jie Hao
Xing Wang
Shuming Shi
Jinfeng Zhang
Zhaopeng Tu
29
12
0
04 Sep 2019
Convolutional Self-Attention Networks
Baosong Yang
Longyue Wang
Derek F. Wong
Lidia S. Chao
Zhaopeng Tu
24
124
0
05 Apr 2019
Modeling Recurrence for Transformer
Jie Hao
Xing Wang
Baosong Yang
Longyue Wang
Jinfeng Zhang
Zhaopeng Tu
45
85
0
05 Apr 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
42
19
0
18 Mar 2019
Context-Aware Self-Attention Networks
Baosong Yang
Jian Li
Derek F. Wong
Lidia S. Chao
Xing Wang
Zhaopeng Tu
39
113
0
15 Feb 2019
Multi-Head Attention with Disagreement Regularization
Jian Li
Zhaopeng Tu
Baosong Yang
Michael R. Lyu
Tong Zhang
27
145
0
24 Oct 2018
Tied Multitask Learning for Neural Speech Translation
Antonios Anastasopoulos
David Chiang
102
172
0
19 Feb 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhehuai Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,750
0
26 Sep 2016
Neural versus Phrase-Based Machine Translation Quality: a Case Study
L. Bentivogli
Arianna Bisazza
Mauro Cettolo
Marcello Federico
191
328
0
16 Aug 2016
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
220
7,930
0
17 Aug 2015
1