Exploiting Deep Representations for Neural Machine Translation

24 October 2018

Tong Zhang

Papers citing "Exploiting Deep Representations for Neural Machine Translation"

50 / 53 papers shown

Title
Auto-Compressing Networks Vaggelis Dorovatas Georgios Paraskevopoulos Alexandros Potamianos 352 2 0 11 Jun 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Yingfeng Luo Tong Zheng Yongyu Mu Yangqiu Song Qinghong Zhang ... Ziqiang Xu Peinan Feng Xiaoqian Liu Tong Xiao Jingbo Zhu AI4CE 1.0K 9 0 09 Mar 2025
The Curse of Depth in Large Language Models Wenfang Sun Xinyuan Song Pengxiang Li Lu Yin Yefeng Zheng Shiwei Liu 360 20 0 09 Feb 2025
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LNInternational Conference on Learning Representations (ICLR), 2024 Pengxiang Li Lu Yin Shiwei Liu 248 11 0 18 Dec 2024
Exploring the traditional NMT model and Large Language Model for chat translationConference on Machine Translation (WMT), 2024 Jinlong Yang Hengchao Shang Daimeng Wei Jiaxin Guo Zongyao Li ... Yuhao Xie Yuanchang Luo Jiawei Zheng Bin Wei Hao Yang 153 0 0 24 Sep 2024
Integrating Pre-trained Language Model into Neural Machine Translation Soon-Jae Hwang Chang-Sung Jeong 280 2 0 30 Oct 2023
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Anna Langedijk Hosein Mohebbi Gabriele Sarti Willem H. Zuidema Jaap Jumelet 194 15 0 05 Oct 2023
Layer-wise Representation Fusion for Compositional GeneralizationAAAI Conference on Artificial Intelligence (AAAI), 2023 Yafang Zheng Lei Lin Shantao Liu Binling Wang Zhaohong Lai Wenhao Rao Biao Fu Yidong Chen Xiaodon Shi AI4CE 325 4 0 20 Jul 2023
Learning to Compose Representations of Different Encoder Layers towards Improving Compositional GeneralizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Lei Lin Shuangtao Li Yafang Zheng Biao Fu Shantao Liu Yidong Chen Xiaodon Shi CoGe 258 3 0 20 May 2023
Don't Be So Sure! Boosting ASR Decoding via Confidence RelaxationAAAI Conference on Artificial Intelligence (AAAI), 2022 Tomer Wullach Shlomo E. Chazan 145 1 0 27 Dec 2022
GTrans: Grouping and Fusing Transformer Layers for Neural Machine TranslationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 Jian Yang Yuwei Yin Liqun Yang Shuming Ma Haoyang Huang Dongdong Zhang Furu Wei Zhoujun Li AI4CE 194 22 0 29 Jul 2022
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2022 Xiao Xu Chenfei Wu Shachar Rosenman Vasudev Lal Wanxiang Che Nan Duan 204 90 0 17 Jun 2022
B2T Connection: Serving Stability and Performance in Deep TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Sho Takase Shun Kiyono Sosuke Kobayashi Jun Suzuki 286 15 0 01 Jun 2022
An Empirical Study of Training End-to-End Vision-and-Language TransformersComputer Vision and Pattern Recognition (CVPR), 2021 Zi-Yi Dou Yichong Xu Zhe Gan Jianfeng Wang Shuohang Wang ... Pengchuan Zhang Lu Yuan Nanyun Peng Zicheng Liu Michael Zeng VLM 233 426 0 03 Nov 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation Guoliang Li Yiyang Li MoE 110 2 0 23 Aug 2021
Residual Tree Aggregation of Layers for Neural Machine Translation Guoliang Li Yiyang Li 112 0 0 19 Jul 2021
Rethinking Skip Connection with Layer Normalization in Transformers and ResNetsInternational Conference on Computational Linguistics (COLING), 2020 Fenglin Liu Xuancheng Ren Zhiyuan Zhang Xu Sun Yuexian Zou AI4CE 140 78 0 15 May 2021
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML Jiaquan Ye Xianbiao Qi Yelin He Yihao Chen Dengyi Gu Peng Gao Rong Xiao LMTD 154 58 0 05 May 2021
Text Compression-aided Transformer EncodingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Z. Li Zhuosheng Zhang Hai Zhao Rui Wang Kehai Chen Masao Utiyama Eiichiro Sumita AI4CE 118 47 0 11 Feb 2021
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Machel Reid Edison Marrese-Taylor Y. Matsuo MoE 320 57 0 01 Jan 2021
Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence LearningInternational Conference on Learning Representations (ICLR), 2020 Xuebo Liu Longyue Wang Yang Li Liang Ding Lidia S. Chao Zhaopeng Tu AI4CE 171 37 0 29 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer NetworkAAAI Conference on Artificial Intelligence (AAAI), 2020 Jiayi Ji Yunpeng Luo Xiaoshuai Sun Fuhai Chen Gen Luo Yongjian Wu Yue Gao Rongrong Ji ViT 180 199 0 13 Dec 2020
Layer-Wise Multi-View Learning for Neural Machine Translation Qiang Wang Changliang Li Yue Zhang Tong Xiao Jingbo Zhu 66 4 0 03 Nov 2020
On the Sub-Layer Functionalities of Transformer DecoderFindings (Findings), 2020 Yilin Yang Longyue Wang Shuming Shi Prasad Tadepalli Stefan Lee Zhaopeng Tu 214 28 0 06 Oct 2020
On the Sparsity of Neural Machine Translation ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 Yong Wang Longyue Wang Victor O.K. Li Zhaopeng Tu MoE 128 11 0 06 Oct 2020
Graph-to-Sequence Neural Machine Translation Sufeng Duan Hai Zhao Rui Wang 78 1 0 16 Sep 2020
Exploiting Deep Sentential Context for Expressive End-to-End Speech SynthesisInterspeech (Interspeech), 2020 Fengyu Yang Shan Yang Qinghua Wu Yujun Wang Lei Xie 109 6 0 03 Aug 2020
Rewiring the Transformer with Depth-Wise LSTMsInternational Conference on Language Resources and Evaluation (LREC), 2020 Hongfei Xu Yang Song Qiuhui Liu Josef van Genabith Deyi Xiong 198 7 0 13 Jul 2020
Learning Source Phrase Representations for Neural Machine Translation Hongfei Xu Josef van Genabith Deyi Xiong Qiuhui Liu Jingyi Zhang 79 21 0 25 Jun 2020
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding Fenglin Liu Xuancheng Ren Guangxiang Zhao Chenyu You Xuewei Ma Xian Wu Xu Sun 410 2 0 16 May 2020
Multiscale Collaborative Deep Models for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020 Xiangpeng Wei Heng Yu Yue Hu Yue Zhang Rongxiang Weng Weihua Luo 203 29 0 29 Apr 2020
GRET: Global Representation Enhanced TransformerAAAI Conference on Artificial Intelligence (AAAI), 2020 Rongxiang Weng Hao-Ran Wei Shujian Huang Heng Yu Lidong Bing Weihua Luo Jiajun Chen 156 9 0 24 Feb 2020
Balancing Cost and Benefit with Tied-Multi TransformersWorkshop on Neural Generation and Translation (WNGT), 2020 Mary Dabre Raphaël Rubino Atsushi Fujita 108 6 0 20 Feb 2020
Explicit Sentence Compression for Neural Machine TranslationAAAI Conference on Artificial Intelligence (AAAI), 2019 Z. Li Rui Wang Kehai Chen Masao Utiyama Eiichiro Sumita Zhuosheng Zhang Hai Zhao 145 31 0 27 Dec 2019
Acquiring Knowledge from Pre-trained Model to Neural Machine TranslationAAAI Conference on Artificial Intelligence (AAAI), 2019 Rongxiang Weng Heng Yu Shujian Huang Shanbo Cheng Weihua Luo 166 70 0 04 Dec 2019
Learning to Reuse Translations: Guiding Neural Machine Translation with ExamplesEuropean Conference on Artificial Intelligence (ECAI), 2019 Qian Cao Shaohui Kuang Deyi Xiong 175 8 0 25 Nov 2019
Neuron Interaction Based Representation Composition for Neural Machine TranslationAAAI Conference on Artificial Intelligence (AAAI), 2019 Jian Li Xing Wang Baosong Yang Shuming Shi Michael R. Lyu Zhaopeng Tu 125 18 0 22 Nov 2019
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered NeuronsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Jie Hao Xing Wang Shuming Shi Jinfeng Zhang Zhaopeng Tu 160 12 0 04 Sep 2019
Self-Attention with Structural Position RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Xing Wang Zhaopeng Tu Longyue Wang Shuming Shi MILM 162 75 0 01 Sep 2019
Multi-Layer Softmaxing during Training Neural Machine Translation for Flexible Decoding with Fewer Layers Mary Dabre Atsushi Fujita AI4CE 94 0 0 27 Aug 2019
Improving Neural Machine Translation with Pre-trained Representation Rongxiang Weng Heng Yu Shujian Huang Weihua Luo Jiajun Chen 155 6 0 21 Aug 2019
UdS Submission for the WMT 19 Automatic Post-Editing TaskConference on Machine Translation (WMT), 2019 Hongfei Xu Qiuhui Liu Josef van Genabith 93 4 0 09 Aug 2019
Widening the Representation Bottleneck in Neural Machine Translation with Lexical ShortcutsConference on Machine Translation (WMT), 2019 Denis Emelin Ivan Titov Rico Sennrich 122 10 0 28 Jun 2019
Learning Deep Transformer Models for Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 Qiang Wang Bei Li Tong Xiao Jingbo Zhu Changliang Li Yang Li Lidia S. Chao 198 732 0 05 Jun 2019
Exploiting Sentential Context for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 Xing Wang Zhaopeng Tu Longyue Wang Shuming Shi 106 22 0 04 Jun 2019
Convolutional Self-Attention Networks Baosong Yang Longyue Wang Yang Li Lidia S. Chao Zhaopeng Tu 185 130 0 05 Apr 2019
Information Aggregation for Multi-Head Attention with Routing-by-Agreement Jian Li Baosong Yang Zi-Yi Dou Xing Wang Michael R. Lyu Zhaopeng Tu 197 48 0 05 Apr 2019
Modeling Recurrence for Transformer Jie Hao Xing Wang Baosong Yang Longyue Wang Jinfeng Zhang Zhaopeng Tu 240 87 0 05 Apr 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants Hongfei Xu Qiuhui Liu 184 19 0 18 Mar 2019
Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement Zi-Yi Dou Zhaopeng Tu Xing Wang Longyue Wang Shuming Shi Tong Zhang AI4CE 136 59 0 15 Feb 2019