Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.07909
Cited By
Neural Machine Translation of Rare Words with Subword Units
31 August 2015
Rico Sennrich
Barry Haddow
Alexandra Birch
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Machine Translation of Rare Words with Subword Units"
50 / 3,808 papers shown
Title
Pretraining Vision-Language Model for Difference Visual Question Answering in Longitudinal Chest X-rays
Yeongjae Cho
Taehee Kim
Heejun Shin
Sungzoon Cho
Dongmyung Shin
15
2
0
14 Feb 2024
Pixel Sentence Representation Learning
Chenghao Xiao
Zhuoxu Huang
Danlu Chen
G. Hudson
Yizhi Li
Haoran Duan
Chenghua Lin
Jie Fu
Jungong Han
Noura Al Moubayed
SSL
22
2
0
13 Feb 2024
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages
Johnathan Mercer
47
0
0
12 Feb 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
Ahmet Üstün
Marzieh Fadaee
Sara Hooker
125
119
0
09 Feb 2024
Promoting Target Data in Context-aware Neural Machine Translation
Harritxu Gete
Thierry Etchegoyhen
34
1
0
09 Feb 2024
Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation
Luca Beurer-Kellner
Marc Fischer
Martin Vechev
44
38
0
07 Feb 2024
IllusionX: An LLM-powered mixed reality personal companion
Ramez Yousri
Zeyad Essam
Yehia Kareem
Youstina Sherief
Sherry Gamil
Soha Safwat
29
3
0
04 Feb 2024
Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times
Byung-Doh Oh
Shisen Yue
William Schuler
58
16
0
03 Feb 2024
Revisiting the Markov Property for Machine Translation
Cunxiao Du
Hao Zhou
Zhaopeng Tu
Jing Jiang
43
1
0
03 Feb 2024
A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages
Md Mahfuz Ibn Alam
Sina Ahmadi
Antonios Anastasopoulos
65
0
0
02 Feb 2024
Sequence Shortening for Context-Aware Machine Translation
Paweł Mąka
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
22
2
0
02 Feb 2024
Streaming Sequence Transduction through Dynamic Compression
Weiting Tan
Yunmo Chen
Tongfei Chen
Guanghui Qin
Haoran Xu
Heidi C. Zhang
Benjamin Van Durme
Philipp Koehn
29
2
0
02 Feb 2024
Getting the most out of your tokenizer for pre-training and domain adaptation
Gautier Dagan
Gabriele Synnaeve
Baptiste Rozière
39
20
0
01 Feb 2024
Dense Reward for Free in Reinforcement Learning from Human Feedback
Alex J. Chan
Hao Sun
Samuel Holt
M. Schaar
26
32
0
01 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
63
32
0
01 Feb 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
39
7
0
31 Jan 2024
Arrows of Time for Large Language Models
Vassilis Papadopoulos
Jérémie Wenger
Clément Hongler
29
9
0
30 Jan 2024
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Bang-ju Yang
Yong Dai
Xuxin Cheng
Yaowei Li
Asif Raza
Yuexian Zou
VLM
47
4
0
30 Jan 2024
ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text
Thanh-Nhi Nguyen
Thanh-Phong Le
Kiet Van Nguyen
30
2
0
29 Jan 2024
Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation
Víctor M. Sánchez-Cartagena
Miquel Espla-Gomis
J. A. Pérez-Ortiz
F. Sánchez-Martínez
35
4
0
29 Jan 2024
Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation
Víctor M. Sánchez-Cartagena
J. A. Pérez-Ortiz
F. Sánchez-Martínez
23
5
0
29 Jan 2024
Stolen Subwords: Importance of Vocabularies for Machine Translation Model Stealing
Vilém Zouhar
AAML
40
0
0
29 Jan 2024
Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition
Ahnaf Mozib Samin
25
0
0
28 Jan 2024
Baichuan2-Sum: Instruction Finetune Baichuan2-7B Model for Dialogue Summarization
Jianfei Xiao
Yancan Chen
Yimin Ou
Hanyi Yu
Kai Shu
Yiyong Xiao
ALM
31
11
0
27 Jan 2024
Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
Ming-Ru Wu
Yufei Wang
George F. Foster
Lizhen Qu
Gholamreza Haffari
43
7
0
27 Jan 2024
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Daya Guo
Qihao Zhu
Dejian Yang
Zhenda Xie
Kai Dong
...
Yu-Huan Wu
Y. K. Li
Fuli Luo
Yingfei Xiong
W. Liang
ELM
62
695
0
25 Jan 2024
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech
Chenpeng Du
Yiwei Guo
Hankun Wang
Yifan Yang
Zhikang Niu
Shuai Wang
Hui Zhang
Xie Chen
Kai Yu
VLM
37
25
0
25 Jan 2024
MambaByte: Token-free Selective State Space Model
Junxiong Wang
Tushaar Gangavarapu
Jing Nathan Yan
Alexander M. Rush
Mamba
44
37
0
24 Jan 2024
SEDAC: A CVAE-Based Data Augmentation Method for Security Bug Report Identification
Y. Liao
T. Zhang
11
0
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
40
3
0
21 Jan 2024
Instructional Fingerprinting of Large Language Models
Lyne Tchapmi
Fei Wang
Mingyu Derek Ma
Pang Wei Koh
Chaowei Xiao
Muhao Chen
WaLM
22
29
0
21 Jan 2024
Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text
Mazal Bethany
Brandon Wherry
Emet Bethany
Nishant Vishwamitra
Anthony Rios
Peyman Najafirad
DeLMO
38
4
0
17 Jan 2024
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models
Jianhui Pang
Fanghua Ye
Longyue Wang
Dian Yu
Derek F. Wong
Shuming Shi
Zhaopeng Tu
ALM
46
6
0
16 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Aman Chadha
Amitava Das
42
28
0
15 Jan 2024
PersianMind: A Cross-Lingual Persian-English Large Language Model
Pedram Rostami
Ali Salemi
M. Dousti
CLL
LRM
37
5
0
12 Jan 2024
An approach for mistranslation removal from popular dataset for Indic MT Task
Sudhansu Bala Das
Leo Raphael Rodrigues
Tapas Kumar Mishra
Bidyut Kr. Patra
22
1
0
12 Jan 2024
Distilling Vision-Language Models on Millions of Videos
Yue Zhao
Long Zhao
Xingyi Zhou
Jialin Wu
Chun-Te Chu
...
Hartwig Adam
Ting Liu
Boqing Gong
Philipp Krahenbuhl
Liangzhe Yuan
VLM
41
13
0
11 Jan 2024
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Damai Dai
Chengqi Deng
Chenggang Zhao
R. X. Xu
Huazuo Gao
...
Panpan Huang
Fuli Luo
Chong Ruan
Zhifang Sui
W. Liang
MoE
46
252
0
11 Jan 2024
Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification?
Changye Li
Weizhe Xu
Trevor Cohen
Serguei V. S. Pakhomov
38
7
0
10 Jan 2024
RoBERTurk: Adjusting RoBERTa for Turkish
Nuri Tas
27
1
0
07 Jan 2024
CharPoet: A Chinese Classical Poetry Generation System Based on Token-free LLM
Chengyue Yu
Lei Zang
Jiaotuan Wang
Chenyi Zhuang
Jinjie Gu
34
3
0
07 Jan 2024
PIXAR: Auto-Regressive Language Modeling in Pixel Space
Yintao Tai
Xiyang Liao
Alessandro Suglia
Antonio Vergari
MLLM
26
7
0
06 Jan 2024
CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition
Junfeng Hou
Peiyao Wang
Jincheng Zhang
Meng-Da Yang
Minwei Feng
Jingcheng Yin
29
1
0
04 Jan 2024
Cheetah: Natural Language Generation for 517 African Languages
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
29
4
0
02 Jan 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
50
6
0
29 Dec 2023
Language Model as an Annotator: Unsupervised Context-aware Quality Phrase Generation
Zhihao Zhang
Yuan Zuo
Chenghua Lin
Junjie Wu
29
5
0
28 Dec 2023
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
40
147
0
28 Dec 2023
Spike No More: Stabilizing the Pre-training of Large Language Models
Sho Takase
Shun Kiyono
Sosuke Kobayashi
Jun Suzuki
20
14
0
28 Dec 2023
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification
Yajing Zhai
Yawen Zeng
Zhiyong Huang
Zheng Qin
Xin Jin
Dandan Cao
28
12
0
28 Dec 2023
Algebraic Positional Encodings
Konstantinos Kogkalidis
Jean-Philippe Bernardy
Vikas K. Garg
24
1
0
26 Dec 2023
Previous
1
2
3
...
10
11
12
...
75
76
77
Next