Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.07909
Cited By
Neural Machine Translation of Rare Words with Subword Units
31 August 2015
Rico Sennrich
Barry Haddow
Alexandra Birch
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Machine Translation of Rare Words with Subword Units"
50 / 3,808 papers shown
Title
InferDPT: Privacy-Preserving Inference for Black-box Large Language Model
Meng Tong
Kejiang Chen
Jie Zhang
Yuang Qi
Weiming Zhang
Neng H. Yu
Tianwei Zhang
Zhikun Zhang
SILM
52
2
0
18 Oct 2023
InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation
Renzhi Wang
Jing Li
Piji Li
DiffM
37
2
0
18 Oct 2023
Recasting Continual Learning as Sequence Modeling
Soochan Lee
Jaehyeon Son
Gunhee Kim
CLL
25
9
0
18 Oct 2023
MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction
Jingheng Ye
Hai-Tao Zheng
Yangning Li
Hai-Tao Zheng
18
1
0
18 Oct 2023
Learn Your Tokens: Word-Pooled Tokenization for Language Modeling
Avijit Thawani
Saurabh Ghanekar
Xiaoyuan Zhu
Jay Pujara
43
4
0
17 Oct 2023
Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression
Adam Block
Dylan J. Foster
Akshay Krishnamurthy
Max Simchowitz
Cyril Zhang
38
4
0
17 Oct 2023
Enhancing Neural Machine Translation with Semantic Units
Langlin Huang
Shuhao Gu
Zhuocheng Zhang
Yang Feng
49
4
0
17 Oct 2023
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Jaap Jumelet
Michael Hanna
Marianne de Heer Kloots
Anna Langedijk
Charlotte Pouw
Oskar van der Wal
34
3
0
17 Oct 2023
Leveraging Diverse Semantic-based Audio Pretrained Models for Singing Voice Conversion
Xueyao Zhang
Yicheng Gu
Haopeng Chen
Zihao Fang
Lexiao Zou
Junan Zhang
Liumeng Xue
Jinchao Zhang
Jie Zhou
Zhizheng Wu
DiffM
43
1
0
17 Oct 2023
Approximating Two-Layer Feedforward Networks for Efficient Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
MoE
27
18
0
16 Oct 2023
Optimized Tokenization for Transcribed Error Correction
Tomer Wullach
Shlomo E. Chazan
34
0
0
16 Oct 2023
Rethinking Relation Classification with Graph Meaning Representations
Li Zhou
Wenyu Chen
DingYi Zeng
Hong Qu
Daniel Hershcovich
AI4CE
30
0
0
15 Oct 2023
Large Language Model-Aware In-Context Learning for Code Generation
Jia Li
Ge Li
Chongyang Tao
Jia Li
Huangzhao Zhang
Fang Liu
Zhi Jin
56
30
0
15 Oct 2023
Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling
Tiberiu Boros
Stefan Daniel Dumitrescu
Ionut Mironica
Radu Chivereanu
GAN
24
1
0
14 Oct 2023
Tokenizer Choice For LLM Training: Negligible or Crucial?
Mehdi Ali
Michael Fromm
Klaudia Thellmann
Richard Rutmann
Max Lübbering
...
Malte Ostendorff
Samuel Weinbach
R. Sifa
Stefan Kesselheim
Nicolas Flores-Herr
28
48
0
12 Oct 2023
On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition
Nick Rossenbach
Benedikt Hilmes
Ralf Schluter
26
3
0
12 Oct 2023
Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention
Huiyin Xue
Nikolaos Aletras
50
0
0
11 Oct 2023
An Empirical Study of Instruction-tuning Large Language Models in Chinese
Q. Si
Tong Wang
Zheng Lin
Xu Zhang
Yanan Cao
Weiping Wang
ALM
74
16
0
11 Oct 2023
No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation
Dennis Fucci
Marco Gaido
Matteo Negri
Mauro Cettolo
L. Bentivogli
36
5
0
10 Oct 2023
Humans and language models diverge when predicting repeating text
Aditya R. Vaidya
Javier S. Turek
Alexander G. Huth
25
6
0
10 Oct 2023
Estimating Numbers without Regression
Avijit Thawani
Jay Pujara
Ashwin Kalyan
26
1
0
09 Oct 2023
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond
Siyang Liu
Naihao Deng
Sahand Sabour
Yilin Jia
Minlie Huang
Rada Mihalcea
40
18
0
09 Oct 2023
Lightweight In-Context Tuning for Multimodal Unified Models
Yixin Chen
Shuai Zhang
Boran Han
Jiaya Jia
24
2
0
08 Oct 2023
LLM4VV: Developing LLM-Driven Testsuite for Compiler Validation
Christian Munley
Aaron Jarmusch
Sunita Chandrasekaran
27
16
0
08 Oct 2023
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models
Ziyi Yin
Muchao Ye
Tianrong Zhang
Tianyu Du
Jinguo Zhu
Han Liu
Jinghui Chen
Ting Wang
Fenglong Ma
AAML
VLM
CoGe
41
36
0
07 Oct 2023
Module-wise Adaptive Distillation for Multimodality Foundation Models
Chen Liang
Jiahui Yu
Ming-Hsuan Yang
Matthew A. Brown
Huayu Chen
Tuo Zhao
Boqing Gong
Tianyi Zhou
19
10
0
06 Oct 2023
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
Dominique Beaini
Shenyang Huang
Joao Alex Cunha
Zhiyi Li
Gabriela Moisescu-Pareja
...
Thérence Bois
Andrew Fitzgibbon
Bla.zej Banaszewski
Chad Martin
Dominic Masters
AI4CE
41
20
0
06 Oct 2023
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models
Boyu Zhang
Hongyang Yang
Tianyu Zhou
Muhammad Ali Babar
Xiao-Yang Liu
AIFin
58
105
0
06 Oct 2023
TPDR: A Novel Two-Step Transformer-based Product and Class Description Match and Retrieval Method
Washington Cunha
Celso França
Leonardo Rocha
M. A. Gonçalves
29
3
0
05 Oct 2023
Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
François Remy
Pieter Delobelle
Bettina Berendt
Kris Demuynck
Thomas Demeester
36
3
0
05 Oct 2023
Continual Contrastive Spoken Language Understanding
Umberto Cappellazzo
Enrico Fini
Muqiao Yang
Daniele Falavigna
Alessio Brutti
Bhiksha Raj
CLL
36
1
0
04 Oct 2023
Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns
Brian DuSell
David Chiang
33
12
0
03 Oct 2023
Enhancing Representation Generalization in Authorship Identification
Haining Wang
16
0
0
30 Sep 2023
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
41
15
0
28 Sep 2023
LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Guodong Ma
Wenxuan Wang
Yuke Li
Yuting Yang
Binbin Du
Haoran Fu
31
5
0
28 Sep 2023
Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Dongji Gao
Hainan Xu
Desh Raj
Leibny Paola García Perera
Daniel Povey
Sanjeev Khudanpur
38
4
0
26 Sep 2023
Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew
Shaltiel Shmidman
Avi Shmidman
Amir DN Cohen
Moshe Koppel
33
0
0
25 Sep 2023
M
3
^3
3
CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders
Qibo Qiu
Honghui Yang
Wenxiao Wang
Shun Zhang
Haiming Gao
Haochao Ying
Wei Hua
Xiaofei He
3DPC
46
0
0
23 Sep 2023
Hindi to English: Transformer-Based Neural Machine Translation
Kavit Gangar
Hardik Ruparel
Shreyas Lele
25
5
0
23 Sep 2023
Memory-augmented conformer for improved end-to-end long-form ASR
Carlos Carvalho
A. Abad
RALM
37
1
0
22 Sep 2023
Importance of Smoothness Induced by Optimizers in FL4ASR: Towards Understanding Federated Learning for End-to-End ASR
Sheikh Shams Azam
Tatiana Likhomanenko
Martin Pelikan
Jan Honza Silovsky
40
6
0
22 Sep 2023
Audience-specific Explanations for Machine Translation
Renhan Lou
Jan Niehues
LRM
28
2
0
22 Sep 2023
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts
Emad A. Alghamdi
Jezia Zakraoui
Fares A. Abanmy
39
1
0
22 Sep 2023
JCoLA: Japanese Corpus of Linguistic Acceptability
Taiga Someya
Yushi Sugimoto
Yohei Oseki
37
5
0
22 Sep 2023
Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation
Bar Iluz
Tomasz Limisiewicz
Gabriel Stanovsky
David Marevcek
40
3
0
21 Sep 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
Kan Wu
Houwen Peng
Zhenghong Zhou
Bin Xiao
Mengchen Liu
...
Xi
Xi Chen
Xinggang Wang
Hongyang Chao
Han Hu
VLM
OODD
29
54
0
21 Sep 2023
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model
Nolan Dey
Daria Soboleva
Faisal Al-Khateeb
Bowen Yang
Ribhu Pathria
...
Robert Myers
Jacob Robert Steeves
Natalia Vassilieva
Marvin Tom
Joel Hestness
MoE
41
15
0
20 Sep 2023
SignBank+: Preparing a Multilingual Sign Language Dataset for Machine Translation Using Large Language Models
Amit Moryossef
Zifan Jiang
SLR
23
0
0
20 Sep 2023
The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Aleksandar Stanić
Dylan R. Ashley
Oleg Serikov
Louis Kirsch
Francesco Faccio
Jürgen Schmidhuber
Thomas Hofmann
Imanol Schlag
MoE
50
9
0
20 Sep 2023
Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
Chen Jiang
Hong Liu
Xuzheng Yu
Qing Wang
Yuan Cheng
...
Zhongyi Liu
Qingpei Guo
Wei Chu
Ming-Hsuan Yang
Yuan Qi
41
10
0
20 Sep 2023
Previous
1
2
3
...
13
14
15
...
75
76
77
Next