Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.07816
Cited By
Mixture Models for Diverse Machine Translation: Tricks of the Trade
20 February 2019
T. Shen
Myle Ott
Michael Auli
MarcÁurelio Ranzato
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mixture Models for Diverse Machine Translation: Tricks of the Trade"
34 / 34 papers shown
Title
RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models
Fujun Zhang
Xiangdong Su
31
0
0
13 May 2025
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
119
22
0
20 May 2024
Mixture of partially linear experts
Yeongsan Hwang
Byungtae Seo
Sangkon Oh
16
0
0
05 May 2024
Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Tianhui Zhang
Bei Peng
Danushka Bollegala
LRM
27
7
0
25 Apr 2024
MAD Speech: Measures of Acoustic Diversity of Speech
Matthieu Futeral
A. Agostinelli
Marco Tagliasacchi
Neil Zeghidour
Eugene Kharitonov
51
1
0
16 Apr 2024
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Ayana Niwa
Hayate Iso
36
4
0
27 Feb 2024
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Yuu Jinnai
Ukyo Honda
Tetsuro Morimura
Peinan Zhang
31
6
0
10 Jan 2024
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning
Amey P. Pasarkar
Adji Bousso Dieng
27
11
0
19 Oct 2023
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer
Boan Liu
Liang Ding
Li Shen
Keqin Peng
Yu Cao
Dazhao Cheng
Dacheng Tao
MoE
36
7
0
15 Oct 2023
Teacher-Student Architecture for Knowledge Distillation: A Survey
Chengming Hu
Xuan Li
Danyang Liu
Haolun Wu
Xi Chen
Ju Wang
Xue Liu
21
16
0
08 Aug 2023
Natural Language Generation for Advertising: A Survey
Soichiro Murakami
Sho Hoshino
Peinan Zhang
14
10
0
22 Jun 2023
Adversarial Clean Label Backdoor Attacks and Defenses on Text Classification Systems
Ashim Gupta
Amrith Krishna
AAML
22
16
0
31 May 2023
Multipath agents for modular multitask ML systems
Andrea Gesmundo
28
1
0
06 Feb 2023
Best-
k
k
k
Search Algorithm for Neural Text Generation
Jiacheng Xu
Caiming Xiong
Silvio Savarese
Yingbo Zhou
35
5
0
22 Nov 2022
DeepGen: Diverse Search Ad Generation and Real-Time Customization
Konstantin Golobokov
Junyi Chai
Victor Ye Dong
Mandy Gu
Bingyu Chi
Jie Cao
Yulan Yan
Yi Liu
23
8
0
06 Aug 2022
Learning to Diversify for Product Question Generation
Haggai Roitman
Uriel Singer
Yotam Eshel
A. Nus
E. Kiperwasser
16
1
0
06 Jul 2022
Exploring Diversity in Back Translation for Low-Resource Machine Translation
Laurie Burchell
Alexandra Birch
Kenneth Heafield
29
15
0
01 Jun 2022
A Well-Composed Text is Half Done! Composition Sampling for Diverse Conditional Generation
Shashi Narayan
Gonccalo Simoes
Yao-Min Zhao
Joshua Maynez
Dipanjan Das
Michael Collins
Mirella Lapata
29
30
0
28 Mar 2022
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts
W. Yu
Chenguang Zhu
Lianhui Qin
Zhihan Zhang
Tong Zhao
Meng Jiang
LRM
22
31
0
14 Mar 2022
EAG: Extract and Generate Multi-way Aligned Corpus for Complete Multi-lingual Neural Machine Translation
Yulin Xu
Zhen Yang
Fandong Meng
JieZhou
25
3
0
04 Mar 2022
WeTS: A Benchmark for Translation Suggestion
Zhen Yang
Fandong Meng
Yingxue Zhang
Ernan Li
Jie Zhou
VLM
19
11
0
11 Oct 2021
Taming Sparsely Activated Transformer with Stochastic Experts
Simiao Zuo
Xiaodong Liu
Jian Jiao
Young Jin Kim
Hany Hassan
Ruofei Zhang
T. Zhao
Jianfeng Gao
MoE
39
108
0
08 Oct 2021
Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy
Shaolei Zhang
Yang Feng
MoE
20
39
0
11 Sep 2021
Mixup Decoding for Diverse Machine Translation
Jicheng Li
Pengzhi Gao
Xuanfu Wu
Yang Feng
Zhongjun He
Hua-Hong Wu
Haifeng Wang
30
14
0
08 Sep 2021
Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
27
50
0
23 Jul 2021
multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning
Swarnadeep Saha
Prateek Yadav
Joey Tianyi Zhou
ReLM
LRM
16
26
0
02 Jun 2021
Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection
D. Beddiar
Md Saroar Jahan
Mourad Oussalah
19
82
0
25 May 2021
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
21
26
0
13 Apr 2021
Attention Forcing for Machine Translation
Qingyun Dou
Yiting Lu
Potsawee Manakul
Xixin Wu
Mark J. F. Gales
23
7
0
02 Apr 2021
Graph Classification by Mixture of Diverse Experts
Fenyu Hu
Liping Wang
Shu Wu
Liang Wang
Tieniu Tan
36
10
0
29 Mar 2021
Supporting Clustering with Contrastive Learning
Dejiao Zhang
Feng Nan
Xiaokai Wei
Shang-Wen Li
Henghui Zhu
Kathleen McKeown
Ramesh Nallapati
Andrew O. Arnold
Bing Xiang
SSL
38
195
0
24 Mar 2021
Few-shot Sequence Learning with Transformers
Lajanugen Logeswaran
Ann Lee
Myle Ott
Honglak Lee
MarcÁurelio Ranzato
Arthur Szlam
ViT
31
12
0
17 Dec 2020
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
M. Krikun
Noam M. Shazeer
Z. Chen
MoE
20
1,106
0
30 Jun 2020
Learning to Make Generalizable and Diverse Predictions for Retrosynthesis
Benson Chen
T. Shen
Tommi Jaakkola
Regina Barzilay
16
46
0
21 Oct 2019
1