Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.07909
Cited By
Neural Machine Translation of Rare Words with Subword Units
31 August 2015
Rico Sennrich
Barry Haddow
Alexandra Birch
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Machine Translation of Rare Words with Subword Units"
50 / 3,808 papers shown
Title
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
Wei Zou
Shujian Huang
Jiajun Chen
AAML
75
0
0
21 Jan 2025
Banzhaf Power in Hierarchical Voting Games
John Randolph
Denizalp Goktas
Amy Greenwald
36
0
0
12 Jan 2025
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer
Vladimir Bataev
Subhankar Ghosh
Vitaly Lavrukhin
Jason Chun Lok Li
AI4TS
46
0
0
10 Jan 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
154
205
0
10 Jan 2025
Dialectal and Low-Resource Machine Translation for Aromanian
Alexandru-Iulius Jerpelea
Alina-Ştefania Rădoi
Sergiu Nisioi
33
1
0
08 Jan 2025
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Yueqin Yin
Shentao Yang
Yujia Xie
Ziyi Yang
Yuting Sun
Hany Awadalla
Weizhu Chen
Mingyuan Zhou
52
0
0
07 Jan 2025
Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition
Rui Liu
Hongyu Yuan
Hong Li
43
0
0
03 Jan 2025
Enhancing Visual Representation for Text-based Person Searching
Wei Shen
Ming Fang
Yuxia Wang
Jiafeng Xiao
Diping Li
H. Chen
Ling Xu
Wenbo Zhang
41
1
0
31 Dec 2024
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
Yang Han
Ziping Wan
Lu Chen
Kai Yu
Xin Chen
LM&MA
35
1
0
31 Dec 2024
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan
Yao Wan
Zhangqian Bi
Zheng Wang
Hongyu Zhang
Yulei Sui
Pan Zhou
45
8
0
31 Dec 2024
Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation
Haotian Qian
YD Chen
Shengtao Lou
Fahad Shahbaz Khan
Xiaogang Jin
Deng-Ping Fan
DiffM
50
4
0
26 Dec 2024
Domain adapted machine translation: What does catastrophic forgetting forget and why?
Danielle Saunders
Steve DeNeefe
AI4CE
31
0
0
23 Dec 2024
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study of Persian-Hindi Pair
Waisullah Yousofi
Pushpak Bhattacharyya
84
0
0
22 Dec 2024
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
J. Zhang
Zhengxue Cheng
Yan Zhao
Shihao Wang
Dajiang Zhou
Guo Lu
Li-Na Song
81
1
0
21 Dec 2024
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
Lifeng Qiao
Peng Ye
Yuchen Ren
Weiqiang Bai
Chaoqi Liang
Xinzhu Ma
Nanqing Dong
W. Ouyang
86
2
0
18 Dec 2024
SEE: Sememe Entanglement Encoding for Transformer-bases Models Compression
Jing Zhang
Shuzhen Sun
Peng Zhang
Guangxing Cao
Hui Gao
Xindian Ma
Nan Xu
Yuexian Hou
65
0
0
15 Dec 2024
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
Changan Chen
Juze Zhang
S. K. Lakshmikanth
Yusu Fang
Ruizhi Shao
Gordon Wetzstein
L. Fei-Fei
Ehsan Adeli
VGen
82
3
0
13 Dec 2024
Efficient Continual Pre-training of LLMs for Low-resource Languages
Arijit Nag
Soumen Chakrabarti
Animesh Mukherjee
Niloy Ganguly
82
0
0
13 Dec 2024
Multi-Head Encoding for Extreme Label Classification
Daojun Liang
Haixia Zhang
Dongfeng Yuan
Minggao Zhang
75
0
0
13 Dec 2024
MVD: A Multi-Lingual Software Vulnerability Detection Framework
Boyu Zhang
T. H. Le
M. Babar
77
0
0
09 Dec 2024
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Michael Y. Hu
Aaron Mueller
Candace Ross
Adina Williams
Tal Linzen
Chengxu Zhuang
Ryan Cotterell
Leshem Choshen
Alex Warstadt
Ethan Gotlieb Wilcox
99
7
0
06 Dec 2024
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning
Amnon Bleich
A. Linnemann
B. Diem
Tim Conrad
MedIm
70
2
0
05 Dec 2024
From Language Models over Tokens to Language Models over Characters
Tim Vieira
Ben LeBrun
Mario Giulianelli
Juan Luis Gastaldi
Brian DuSell
John Terilla
Timothy J. O'Donnell
Ryan Cotterell
81
8
0
04 Dec 2024
DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
Tejumade Afonja
Hui-Po Wang
Raouf Kerkouche
Mario Fritz
SyDa
118
2
0
03 Dec 2024
Command-line Risk Classification using Transformer-based Neural Architectures
Paolo Notaro
Soroush Haeri
Jorge Cardoso
Michael Gerndt
64
0
0
02 Dec 2024
Concept Based Continuous Prompts for Interpretable Text Classification
Qian Chen
Dongyang Li
Xiaofeng He
92
0
0
02 Dec 2024
A Wave is Worth 100 Words: Investigating Cross-Domain Transferability in Time Series
Xiangkai Ma
Xiaobin Hong
Wenzhong Li
Sanglu Lu
AI4TS
64
0
0
01 Dec 2024
Scaling Particle Collision Data Analysis
Hengkui Wu
Panpan Chi
Yongfeng Zhu
Liujiang Liu
Shuyang Hu
...
Yingsi Xin
Bruce Liu
Dahao Liang
Xiaojun Jia
Manqi Ruan
79
0
0
28 Nov 2024
Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
Zhu Xu
Zhiqiang Zhao
Zihan Zhang
Yuchi Liu
Quanwei Shen
Fei Liu
Yu Kuang
Jian He
Conglin Liu
83
1
0
26 Nov 2024
Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition
Hyeonseung Lee
J. Yoon
Sungsoo Kim
N. Kim
71
0
0
26 Nov 2024
Development of Pre-Trained Transformer-based Models for the Nepali Language
Prajwal Thapa
Jinu Nyachhyon
Mridul Sharma
Bal Krishna Bal
81
0
0
24 Nov 2024
Efficient Online Inference of Vision Transformers by Training-Free Tokenization
Leonidas Gee
Wing Yan Li
V. Sharmanska
Novi Quadrianto
ViT
93
0
0
23 Nov 2024
Generative Timelines for Instructed Visual Assembly
Alejandro Pardo
Jui-hsien Wang
Guohao Li
Josef Sivic
Bryan C. Russell
Fabian Caba Heilbron
VGen
72
0
0
19 Nov 2024
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
Shitong Shao
Zikai Zhou
Tian Ye
Lichen Bai
Zhiqiang Xu
Zeke Xie
DiffM
51
0
0
16 Nov 2024
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner
Paula Usinger
Julius Nehring-Wirxel
Gregor Kobsik
Victor Czech
Yanjiang He
I. Lim
Leif Kobbelt
39
0
0
15 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
48
9
0
08 Nov 2024
HeartBERT: A Self-Supervised ECG Embedding Model for Efficient and Effective Medical Signal Analysis
Saedeh Tahery
Fatemeh Hamid Akhlaghi
Termeh Amirsoleimani
OOD
88
1
0
08 Nov 2024
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM
Jingwei Xu
Chenyu Wang
Zibo Zhao
Wen Liu
Yi Ma
Shenghua Gao
58
13
0
07 Nov 2024
Classification Done Right for Vision-Language Pre-Training
Zilong Huang
Qinghao Ye
Bingyi Kang
Jiashi Feng
Haoqi Fan
CLIP
VLM
50
2
0
05 Nov 2024
Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning
Yangqiu Song
Tong Zheng
Ran Wang
Jiahao Liu
Qingyan Guo
...
Xu Tan
Tong Xiao
Jingbo Zhu
Jie Wang
Xunliang Cai
60
1
0
05 Nov 2024
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Mufei Li
Viraj Shitole
Eli Chien
Changhai Man
Zhaodong Wang
Srinivas Sridharan
Ying Zhang
Tushar Krishna
P. Li
41
0
0
04 Nov 2024
MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation
Langlin Huang
Mengyu Bu
Yang Feng
33
0
0
03 Nov 2024
SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation
Dennis Fucci
Marco Gaido
Beatrice Savoldi
Matteo Negri
Mauro Cettolo
L. Bentivogli
57
1
0
03 Nov 2024
Morphological Typology in BPE Subword Productivity and Language Modeling
Iñigo Parra
36
0
0
31 Oct 2024
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Zébulon Goriely
Richard Diehl Martinez
Andrew Caines
Lisa Beinborn
P. Buttery
CLL
50
5
0
30 Oct 2024
Discrete Modeling via Boundary Conditional Diffusion Processes
Yuxuan Gu
Xiaocheng Feng
Lei Huang
Yingsheng Wu
Zekun Zhou
Weihong Zhong
Kun Zhu
Bing Qin
DiffM
28
0
0
29 Oct 2024
MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression
Noel Elias
H. Esfahanizadeh
Kaan Kale
S. Vishwanath
Muriel Médard
38
0
0
28 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
40
2
0
28 Oct 2024
Graph Neural Networks on Discriminative Graphs of Words
Yassine Abbahaddou
J. Lutzeyer
Michalis Vazirgiannis
24
0
0
27 Oct 2024
CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Fangwen Mu
Junjie Wang
Zhuohao Yu
Lin Shi
Song Wang
Mingyang Li
Qing Wang
AAML
41
1
0
26 Oct 2024
Previous
1
2
3
4
5
6
...
75
76
77
Next