Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1810.10181
Cited By
Exploiting Deep Representations for Neural Machine Translation
24 October 2018
Zi-Yi Dou
Zhaopeng Tu
Xing Wang
Shuming Shi
Tong Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploiting Deep Representations for Neural Machine Translation"
50 / 53 papers shown
Title
Auto-Compressing Networks
Vaggelis Dorovatas
Georgios Paraskevopoulos
Alexandros Potamianos
352
2
0
11 Jun 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
1.0K
9
0
09 Mar 2025
The Curse of Depth in Large Language Models
Wenfang Sun
Xinyuan Song
Pengxiang Li
Lu Yin
Yefeng Zheng
Shiwei Liu
360
20
0
09 Feb 2025
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN
International Conference on Learning Representations (ICLR), 2024
Pengxiang Li
Lu Yin
Shiwei Liu
248
11
0
18 Dec 2024
Exploring the traditional NMT model and Large Language Model for chat translation
Conference on Machine Translation (WMT), 2024
Jinlong Yang
Hengchao Shang
Daimeng Wei
Jiaxin Guo
Zongyao Li
...
Yuhao Xie
Yuanchang Luo
Jiawei Zheng
Bin Wei
Hao Yang
153
0
0
24 Sep 2024
Integrating Pre-trained Language Model into Neural Machine Translation
Soon-Jae Hwang
Chang-Sung Jeong
280
2
0
30 Oct 2023
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
Anna Langedijk
Hosein Mohebbi
Gabriele Sarti
Willem H. Zuidema
Jaap Jumelet
194
15
0
05 Oct 2023
Layer-wise Representation Fusion for Compositional Generalization
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yafang Zheng
Lei Lin
Shantao Liu
Binling Wang
Zhaohong Lai
Wenhao Rao
Biao Fu
Yidong Chen
Xiaodon Shi
AI4CE
325
4
0
20 Jul 2023
Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lei Lin
Shuangtao Li
Yafang Zheng
Biao Fu
Shantao Liu
Yidong Chen
Xiaodon Shi
CoGe
258
3
0
20 May 2023
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Tomer Wullach
Shlomo E. Chazan
145
1
0
27 Dec 2022
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jian Yang
Yuwei Yin
Liqun Yang
Shuming Ma
Haoyang Huang
Dongdong Zhang
Furu Wei
Zhoujun Li
AI4CE
194
22
0
29 Jul 2022
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Xiao Xu
Chenfei Wu
Shachar Rosenman
Vasudev Lal
Wanxiang Che
Nan Duan
204
90
0
17 Jun 2022
B2T Connection: Serving Stability and Performance in Deep Transformers
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Sho Takase
Shun Kiyono
Sosuke Kobayashi
Jun Suzuki
286
15
0
01 Jun 2022
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Computer Vision and Pattern Recognition (CVPR), 2021
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
233
426
0
03 Nov 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation
Guoliang Li
Yiyang Li
MoE
110
2
0
23 Aug 2021
Residual Tree Aggregation of Layers for Neural Machine Translation
Guoliang Li
Yiyang Li
112
0
0
19 Jul 2021
Rethinking Skip Connection with Layer Normalization in Transformers and ResNets
International Conference on Computational Linguistics (COLING), 2020
Fenglin Liu
Xuancheng Ren
Zhiyuan Zhang
Xu Sun
Yuexian Zou
AI4CE
140
78
0
15 May 2021
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML
Jiaquan Ye
Xianbiao Qi
Yelin He
Yihao Chen
Dengyi Gu
Peng Gao
Rong Xiao
LMTD
154
58
0
05 May 2021
Text Compression-aided Transformer Encoding
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Z. Li
Zhuosheng Zhang
Hai Zhao
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
AI4CE
118
47
0
11 Feb 2021
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Machel Reid
Edison Marrese-Taylor
Y. Matsuo
MoE
320
57
0
01 Jan 2021
Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning
International Conference on Learning Representations (ICLR), 2020
Xuebo Liu
Longyue Wang
Yang Li
Liang Ding
Lidia S. Chao
Zhaopeng Tu
AI4CE
171
37
0
29 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
AAAI Conference on Artificial Intelligence (AAAI), 2020
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
180
199
0
13 Dec 2020
Layer-Wise Multi-View Learning for Neural Machine Translation
Qiang Wang
Changliang Li
Yue Zhang
Tong Xiao
Jingbo Zhu
66
4
0
03 Nov 2020
On the Sub-Layer Functionalities of Transformer Decoder
Findings (Findings), 2020
Yilin Yang
Longyue Wang
Shuming Shi
Prasad Tadepalli
Stefan Lee
Zhaopeng Tu
214
28
0
06 Oct 2020
On the Sparsity of Neural Machine Translation Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Yong Wang
Longyue Wang
Victor O.K. Li
Zhaopeng Tu
MoE
128
11
0
06 Oct 2020
Graph-to-Sequence Neural Machine Translation
Sufeng Duan
Hai Zhao
Rui Wang
78
1
0
16 Sep 2020
Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis
Interspeech (Interspeech), 2020
Fengyu Yang
Shan Yang
Qinghua Wu
Yujun Wang
Lei Xie
109
6
0
03 Aug 2020
Rewiring the Transformer with Depth-Wise LSTMs
International Conference on Language Resources and Evaluation (LREC), 2020
Hongfei Xu
Yang Song
Qiuhui Liu
Josef van Genabith
Deyi Xiong
198
7
0
13 Jul 2020
Learning Source Phrase Representations for Neural Machine Translation
Hongfei Xu
Josef van Genabith
Deyi Xiong
Qiuhui Liu
Jingyi Zhang
79
21
0
25 Jun 2020
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding
Fenglin Liu
Xuancheng Ren
Guangxiang Zhao
Chenyu You
Xuewei Ma
Xian Wu
Xu Sun
410
2
0
16 May 2020
Multiscale Collaborative Deep Models for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Xiangpeng Wei
Heng Yu
Yue Hu
Yue Zhang
Rongxiang Weng
Weihua Luo
203
29
0
29 Apr 2020
GRET: Global Representation Enhanced Transformer
AAAI Conference on Artificial Intelligence (AAAI), 2020
Rongxiang Weng
Hao-Ran Wei
Shujian Huang
Heng Yu
Lidong Bing
Weihua Luo
Jiajun Chen
156
9
0
24 Feb 2020
Balancing Cost and Benefit with Tied-Multi Transformers
Workshop on Neural Generation and Translation (WNGT), 2020
Mary Dabre
Raphaël Rubino
Atsushi Fujita
108
6
0
20 Feb 2020
Explicit Sentence Compression for Neural Machine Translation
AAAI Conference on Artificial Intelligence (AAAI), 2019
Z. Li
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
Zhuosheng Zhang
Hai Zhao
145
31
0
27 Dec 2019
Acquiring Knowledge from Pre-trained Model to Neural Machine Translation
AAAI Conference on Artificial Intelligence (AAAI), 2019
Rongxiang Weng
Heng Yu
Shujian Huang
Shanbo Cheng
Weihua Luo
166
70
0
04 Dec 2019
Learning to Reuse Translations: Guiding Neural Machine Translation with Examples
European Conference on Artificial Intelligence (ECAI), 2019
Qian Cao
Shaohui Kuang
Deyi Xiong
175
8
0
25 Nov 2019
Neuron Interaction Based Representation Composition for Neural Machine Translation
AAAI Conference on Artificial Intelligence (AAAI), 2019
Jian Li
Xing Wang
Baosong Yang
Shuming Shi
Michael R. Lyu
Zhaopeng Tu
125
18
0
22 Nov 2019
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Jie Hao
Xing Wang
Shuming Shi
Jinfeng Zhang
Zhaopeng Tu
160
12
0
04 Sep 2019
Self-Attention with Structural Position Representations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Xing Wang
Zhaopeng Tu
Longyue Wang
Shuming Shi
MILM
162
75
0
01 Sep 2019
Multi-Layer Softmaxing during Training Neural Machine Translation for Flexible Decoding with Fewer Layers
Mary Dabre
Atsushi Fujita
AI4CE
94
0
0
27 Aug 2019
Improving Neural Machine Translation with Pre-trained Representation
Rongxiang Weng
Heng Yu
Shujian Huang
Weihua Luo
Jiajun Chen
155
6
0
21 Aug 2019
UdS Submission for the WMT 19 Automatic Post-Editing Task
Conference on Machine Translation (WMT), 2019
Hongfei Xu
Qiuhui Liu
Josef van Genabith
93
4
0
09 Aug 2019
Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts
Conference on Machine Translation (WMT), 2019
Denis Emelin
Ivan Titov
Rico Sennrich
122
10
0
28 Jun 2019
Learning Deep Transformer Models for Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Qiang Wang
Bei Li
Tong Xiao
Jingbo Zhu
Changliang Li
Yang Li
Lidia S. Chao
198
732
0
05 Jun 2019
Exploiting Sentential Context for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Xing Wang
Zhaopeng Tu
Longyue Wang
Shuming Shi
106
22
0
04 Jun 2019
Convolutional Self-Attention Networks
Baosong Yang
Longyue Wang
Yang Li
Lidia S. Chao
Zhaopeng Tu
185
130
0
05 Apr 2019
Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Jian Li
Baosong Yang
Zi-Yi Dou
Xing Wang
Michael R. Lyu
Zhaopeng Tu
197
48
0
05 Apr 2019
Modeling Recurrence for Transformer
Jie Hao
Xing Wang
Baosong Yang
Longyue Wang
Jinfeng Zhang
Zhaopeng Tu
240
87
0
05 Apr 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
184
19
0
18 Mar 2019
Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement
Zi-Yi Dou
Zhaopeng Tu
Xing Wang
Longyue Wang
Shuming Shi
Tong Zhang
AI4CE
136
59
0
15 Feb 2019
1
2
Next