ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.07909
  4. Cited By
Neural Machine Translation of Rare Words with Subword Units

Neural Machine Translation of Rare Words with Subword Units

31 August 2015
Rico Sennrich
Barry Haddow
Alexandra Birch
ArXivPDFHTML

Papers citing "Neural Machine Translation of Rare Words with Subword Units"

50 / 3,808 papers shown
Title
Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models
Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models
Hui-Po Wang
Mario Fritz
35
3
0
26 Sep 2024
Data-Centric AI Governance: Addressing the Limitations of Model-Focused
  Policies
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
Ritwik Gupta
Leah Walker
Rodolfo Corona
Stephanie Fu
Suzanne Petryk
Janet Napolitano
Trevor Darrell
Andrew W. Reddie
ELM
43
3
0
25 Sep 2024
dnaGrinder: a lightweight and high-capacity genomic foundation model
dnaGrinder: a lightweight and high-capacity genomic foundation model
Qihang Zhao
Chi Zhang
Weixiong Zhang
31
0
0
24 Sep 2024
RAMBO: Enhancing RAG-based Repository-Level Method Body Completion
RAMBO: Enhancing RAG-based Repository-Level Method Body Completion
Tuan-Dung Bui
Duc-Thieu Luu-Van
Thanh-Phat Nguyen
Thu-Trang Nguyen
Son Nguyen
H. Vo
39
4
0
23 Sep 2024
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond
Hong Chen
Xin Wang
Yuwei Zhou
Bin Huang
Yipeng Zhang
Wei Feng
Houlun Chen
Zeyang Zhang
Siao Tang
Wenwu Zhu
DiffM
55
7
0
23 Sep 2024
HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks
HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks
Zhanglin Wu
Yuanchang Luo
Daimeng Wei
Jiawei Zheng
Bin Wei
...
Jiaxin Guo
Shaojun Li
Mengli Zhu
Ning Xie
Hao Yang
45
1
0
23 Sep 2024
Temporally Consistent Factuality Probing for Large Language Models
Temporally Consistent Factuality Probing for Large Language Models
Ashutosh Bajpai
Aaryan Goyal
Atif Anwer
Tanmoy Chakraborty
HILM
32
1
0
21 Sep 2024
Demystifying and Extracting Fault-indicating Information from Logs for
  Failure Diagnosis
Demystifying and Extracting Fault-indicating Information from Logs for Failure Diagnosis
Junjie Huang
Zhihan Jiang
Jinyang Liu
Yintong Huo
Jiazhen Gu
Zhuangbin Chen
Cong Feng
Hui Dong
Zengyin Yang
Michael R. Lyu
33
3
0
20 Sep 2024
LM-assisted keyword biasing with Aho-Corasick algorithm for
  Transducer-based ASR
LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR
Iuliia Thorbecke
Juan Zuluaga-Gomez
Esaú Villatoro-Tello
Andres Carofilis
Shashi Kumar
P. Motlícek
Karthik Pandia
A. Ganapathiraju
37
0
0
20 Sep 2024
Smirk: An Atomically Complete Tokenizer for Molecular Foundation Models
Smirk: An Atomically Complete Tokenizer for Molecular Foundation Models
Alexius Wadell
Anoushka Bhutani
Venkatasubramanian Viswanathan
186
0
0
19 Sep 2024
Mixture of Diverse Size Experts
Mixture of Diverse Size Experts
Manxi Sun
Wei Liu
Jian Luan
Pengzhi Gao
Bin Wang
MoE
28
1
0
18 Sep 2024
DocMamba: Efficient Document Pre-training with State Space Model
DocMamba: Efficient Document Pre-training with State Space Model
Pengfei Hu
Zhenrong Zhang
Jiefeng Ma
Shuhang Liu
Jun Du
Jianshu Zhang
Mamba
42
1
0
18 Sep 2024
Egalitarian Language Representation in Language Models: It All Begins
  with Tokenizers
Egalitarian Language Representation in Language Models: It All Begins with Tokenizers
Menan Velayuthan
Kengatharaiyer Sarveswaran
40
5
0
17 Sep 2024
Linear Recency Bias During Training Improves Transformers' Fit to
  Reading Times
Linear Recency Bias During Training Improves Transformers' Fit to Reading Times
Christian Clark
Byung-Doh Oh
William Schuler
39
3
0
17 Sep 2024
Surveying the MLLM Landscape: A Meta-Review of Current Surveys
Surveying the MLLM Landscape: A Meta-Review of Current Surveys
Ming Li
Keyu Chen
Ziqian Bi
Ming Liu
Benji Peng
...
Jinlang Wang
Sen Zhang
X. Pan
Jiawei Xu
Pohsun Feng
OffRL
54
2
0
17 Sep 2024
Context-Aware Membership Inference Attacks against Pre-trained Large
  Language Models
Context-Aware Membership Inference Attacks against Pre-trained Large Language Models
Hongyan Chang
Ali Shahin Shamsabadi
Kleomenis Katevas
Hamed Haddadi
Reza Shokri
MIALM
63
6
0
11 Sep 2024
TeXBLEU: Automatic Metric for Evaluate LaTeX Format
TeXBLEU: Automatic Metric for Evaluate LaTeX Format
Kyudan Jung
N. Kim
Hyongon Ryu
Sieun Hyeon
Seung-jun Lee
Hyeok-jae Lee
37
0
0
10 Sep 2024
Scaling Law Hypothesis for Multimodal Model
Scaling Law Hypothesis for Multimodal Model
Qingyun Sun
Zhen Guo
48
0
0
10 Sep 2024
SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization
SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization
Kohei Tsuji
Tatsuya Hiraoka
Yuchang Cheng
Tomoya Iwakura
45
1
0
10 Sep 2024
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer
  Training
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training
Pavel Chizhov
Catherine Arnett
Elizaveta Korotkova
Ivan P. Yamshchikov
48
2
0
06 Sep 2024
STAB: Speech Tokenizer Assessment Benchmark
STAB: Speech Tokenizer Assessment Benchmark
Shikhar Vashishth
Harman Singh
Shikhar Bharadwaj
Sriram Ganapathy
Chulayuth Asawaroengchai
Kartik Audhkhasi
Andrew Rosenberg
Ankur Bapna
Bhuvana Ramabhadran
57
1
0
04 Sep 2024
THInC: A Theory-Driven Framework for Computational Humor Detection
THInC: A Theory-Driven Framework for Computational Humor Detection
Victor De Marez
Thomas Winters
Ayla Rigouts Terryn
27
2
0
02 Sep 2024
Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Yingfa Chen
Chenlong Hu
Cong Feng
Chenyang Song
Shi Yu
Xu Han
Zhiyuan Liu
Maosong Sun
33
0
0
02 Sep 2024
Post-OCR Text Correction for Bulgarian Historical Documents
Post-OCR Text Correction for Bulgarian Historical Documents
Angel Beshirov
Milena Dobreva
Dimitar Dimitrov
Momchil Hardalov
Ivan Koychev
Preslav Nakov
44
1
0
31 Aug 2024
ProGRes: Prompted Generative Rescoring on ASR n-Best
ProGRes: Prompted Generative Rescoring on ASR n-Best
Ada Defne Tur
Adel Moumen
Mirco Ravanelli
36
1
0
30 Aug 2024
Unintentional Security Flaws in Code: Automated Defense via Root Cause
  Analysis
Unintentional Security Flaws in Code: Automated Defense via Root Cause Analysis
Nafis Tanveer Islam
Mazal Bethany
Dylan Manuel
Murtuza Jadliwala
Peyman Najafirad
33
0
0
30 Aug 2024
Large-Scale Multi-omic Biosequence Transformers for Modeling Protein-Nucleic Acid Interactions
Large-Scale Multi-omic Biosequence Transformers for Modeling Protein-Nucleic Acid Interactions
Sully F. Chen
Robert J. Steele
Beakal Lemeneh
S. Lad
Eric Oermann
Eric K. Oermann
AI4CE
47
0
0
29 Aug 2024
CrossInspector: A Static Analysis Approach for Cross-Contract
  Vulnerability Detection
CrossInspector: A Static Analysis Approach for Cross-Contract Vulnerability Detection
Xiao Chen
26
3
0
27 Aug 2024
Towards Lifelong Learning Embeddings: An Algorithmic Approach to
  Dynamically Extend Embeddings
Towards Lifelong Learning Embeddings: An Algorithmic Approach to Dynamically Extend Embeddings
Miguel Alves Gomes
Philipp Meisen
Tobias Meisen
31
0
0
26 Aug 2024
Bidirectional Awareness Induction in Autoregressive Seq2Seq Models
Bidirectional Awareness Induction in Autoregressive Seq2Seq Models
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
BDL
34
0
0
25 Aug 2024
Positional Description for Numerical Normalization
Positional Description for Numerical Normalization
Deepanshu Gupta
Javier Latorre
3DGS
34
0
0
22 Aug 2024
Distributional Properties of Subword Regularization
Distributional Properties of Subword Regularization
Marco Cognetta
Vilém Zouhar
Naoaki Okazaki
37
0
0
21 Aug 2024
Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking
  Across Diverse Vocabularies
Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking Across Diverse Vocabularies
Sai Koneru
Matthias Huck
M. Exel
Jan Niehues
32
0
0
21 Aug 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering
  LLM Weaknesses
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALM
ELM
37
7
0
16 Aug 2024
SC-Rec: Enhancing Generative Retrieval with Self-Consistent Reranking
  for Sequential Recommendation
SC-Rec: Enhancing Generative Retrieval with Self-Consistent Reranking for Sequential Recommendation
Tongyoung Kim
Soojin Yoon
SeongKu Kang
Jinyoung Yeo
Dongha Lee
RALM
30
2
0
16 Aug 2024
ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction
  Based on Large Language Model
ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model
Xuanqing Yu
Wangtao Sun
Jingwei Li
Kang Liu
Chengbao Liu
Jie Tan
OffRL
AI4TS
44
3
0
14 Aug 2024
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Soumyadeep Roy
S. Sural
Niloy Ganguly
MedIm
43
0
0
13 Aug 2024
Retrieval-augmented code completion for local projects using large
  language models
Retrieval-augmented code completion for local projects using large language models
Marko Hostnik
Marko Robnik-Sikonja
RALM
35
0
0
09 Aug 2024
Semantics or spelling? Probing contextual word embeddings with
  orthographic noise
Semantics or spelling? Probing contextual word embeddings with orthographic noise
Jacob A. Matthews
John R. Starr
Marten van Schijndel
40
2
0
08 Aug 2024
MPC-Minimized Secure LLM Inference
MPC-Minimized Secure LLM Inference
Deevashwer Rathee
Dacheng Li
Ion Stoica
Hao Zhang
Raluca A. Popa
47
1
0
07 Aug 2024
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh
  Tokenization
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization
Yiwen Chen
Yikai Wang
Yihao Luo
Zihan Wang
Zilong Chen
Jun Zhu
Chi Zhang
Guosheng Lin
33
23
0
05 Aug 2024
Batching BPE Tokenization Merges
Batching BPE Tokenization Merges
Alexander P. Morgan
32
0
0
05 Aug 2024
Can LLMs predict the convergence of Stochastic Gradient Descent?
Can LLMs predict the convergence of Stochastic Gradient Descent?
Hiroki Sakaji
Abdelhakim Benechehab
Wataru Kuramoto
LRM
62
2
0
03 Aug 2024
PC$^2$: Pseudo-Classification Based Pseudo-Captioning for Noisy
  Correspondence Learning in Cross-Modal Retrieval
PC2^22: Pseudo-Classification Based Pseudo-Captioning for Noisy Correspondence Learning in Cross-Modal Retrieval
Yue Duan
Zhangxuan Gu
ZhenZhe Ying
Wei Li
Yu Zhang
Zibin Zheng
26
2
0
02 Aug 2024
Reconsidering Token Embeddings with the Definitions for Pre-trained
  Language Models
Reconsidering Token Embeddings with the Definitions for Pre-trained Language Models
Ying Zhang
Zhuoran Liu
Manabu Okumura
18
1
0
02 Aug 2024
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End
  Modeling with LM Knowledge Distillation
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Masato Mimura
Takatomo Kano
A. Ogawa
Marc Delcroix
27
2
0
01 Aug 2024
Towards interfacing large language models with ASR systems using
  confidence measures and prompting
Towards interfacing large language models with ASR systems using confidence measures and prompting
Maryam Naderi
Xingrui Yang
Weihan Wang
Sevada Hovsepyan
Weichen Dai
KELM
37
1
0
31 Jul 2024
Look Hear: Gaze Prediction for Speech-directed Human Attention
Look Hear: Gaze Prediction for Speech-directed Human Attention
Sounak Mondal
Seoyoung Ahn
Zhibo Yang
Niranjan Balasubramanian
Dimitris Samaras
G. Zelinsky
Minh Hoai
47
1
0
28 Jul 2024
Improving noisy student training for low-resource languages in
  End-to-End ASR using CycleGAN and inter-domain losses
Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses
C. Li
Ngoc Thang Vu
24
3
0
26 Jul 2024
On the Effect of Purely Synthetic Training Data for Different Automatic
  Speech Recognition Architectures
On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures
Nick Rossenbach
Benedikt Hilmes
Ralf Schluter
35
1
0
25 Jul 2024
Previous
123456...757677
Next