ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.07909
  4. Cited By
Neural Machine Translation of Rare Words with Subword Units

Neural Machine Translation of Rare Words with Subword Units

31 August 2015
Rico Sennrich
Barry Haddow
Alexandra Birch
ArXivPDFHTML

Papers citing "Neural Machine Translation of Rare Words with Subword Units"

50 / 3,808 papers shown
Title
Explicit Foundation Model Optimization with Self-Attentive Feed-Forward
  Neural Units
Explicit Foundation Model Optimization with Self-Attentive Feed-Forward Neural Units
Jake Ryland Williams
Haoran Zhao
21
0
0
13 Nov 2023
Reducing the Need for Backpropagation and Discovering Better Optima With
  Explicit Optimizations of Neural Networks
Reducing the Need for Backpropagation and Discovering Better Optima With Explicit Optimizations of Neural Networks
Jake Ryland Williams
Haoran Zhao
29
0
0
13 Nov 2023
Context Consistency between Training and Testing in Simultaneous Machine
  Translation
Context Consistency between Training and Testing in Simultaneous Machine Translation
M. Zhong
Lemao Liu
Kehai Chen
Mingming Yang
Min Zhang
LRM
52
0
0
13 Nov 2023
Decoupling and Interacting Multi-Task Learning Network for Joint Speech
  and Accent Recognition
Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition
Qijie Shao
Pengcheng Guo
Jinghao Yan
Pengfei Hu
Lei Xie
32
8
0
13 Nov 2023
To Transformers and Beyond: Large Language Models for the Genome
To Transformers and Beyond: Large Language Models for the Genome
Micaela Elisa Consens
Cameron Dufault
Michael Wainberg
Duncan Forster
Mehran Karimzadeh
Hani Goodarzi
Fabian J. Theis
Alan Moses
Bo Wang
LM&MA
MedIm
26
28
0
13 Nov 2023
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training
  Regime and Better Alignment to Human Preferences
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences
Yuanhe Tian
Ruyi Gan
Yan Song
Jiaxing Zhang
Yongdong Zhang
AI4MH
AI4CE
LM&MA
36
31
0
10 Nov 2023
TransformCode: A Contrastive Learning Framework for Code Embedding via
  Subtree Transformation
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree Transformation
Zixiang Xian
Rubing Huang
Dave Towey
Chunrong Fang
Zhenyu Chen
25
5
0
10 Nov 2023
Memorisation Cartography: Mapping out the Memorisation-Generalisation
  Continuum in Neural Machine Translation
Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation
Verna Dankers
Ivan Titov
Dieuwke Hupkes
48
5
0
09 Nov 2023
Improving Whispered Speech Recognition Performance using
  Pseudo-whispered based Data Augmentation
Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation
Zhaofeng Lin
T. Patel
O. Scharenborg
16
2
0
09 Nov 2023
Mental Health Diagnosis in the Digital Age: Harnessing Sentiment
  Analysis on Social Media Platforms upon Ultra-Sparse Feature Content
Mental Health Diagnosis in the Digital Age: Harnessing Sentiment Analysis on Social Media Platforms upon Ultra-Sparse Feature Content
Haijian Shao
Ming Zhu
Shengjie Zhai
AI4MH
30
3
0
09 Nov 2023
Loss Masking Is Not Needed in Decoder-only Transformer for
  Discrete-token-based ASR
Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token-based ASR
Qian Chen
Wen Wang
Qinglin Zhang
Siqi Zheng
Shiliang Zhang
Chong Deng
Yukun Ma
Hai Yu
Jiaqing Liu
Chong Zhang
26
8
0
08 Nov 2023
Multitask Multimodal Prompted Training for Interactive Embodied Task
  Completion
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
Georgios Pantazopoulos
Malvina Nikandrou
Amit Parekh
Bhathiya Hemanthage
Arash Eshghi
Ioannis Konstas
Verena Rieser
Oliver Lemon
Alessandro Suglia
LM&Ro
39
7
0
07 Nov 2023
Improving Korean NLP Tasks with Linguistically Informed Subword
  Tokenization and Sub-character Decomposition
Improving Korean NLP Tasks with Linguistically Informed Subword Tokenization and Sub-character Decomposition
Tae-Hee Jeon
Bongseok Yang
ChangHwan Kim
Yoonseob Lim
27
0
0
07 Nov 2023
Ziya2: Data-centric Learning is All LLMs Need
Ziya2: Data-centric Learning is All LLMs Need
Ruyi Gan
Ziwei Wu
Renliang Sun
Junyu Lu
Xiaojun Wu
...
Ping Yang
Qi Yang
Hao Wang
Jiaxing Zhang
Yan Song
VLM
ALM
25
17
0
06 Nov 2023
Mini Minds: Exploring Bebeshka and Zlata Baby Models
Mini Minds: Exploring Bebeshka and Zlata Baby Models
Irina Proskurina
Guillaume Metzler
Julien Velcin
ALM
31
1
0
06 Nov 2023
Replicable Benchmarking of Neural Machine Translation (NMT) on
  Low-Resource Local Languages in Indonesia
Replicable Benchmarking of Neural Machine Translation (NMT) on Low-Resource Local Languages in Indonesia
Lucky Susanto
Ryandito Diandaru
Adila Alfa Krisnadhi
Ayu Purwarianti
Derry Wijaya
29
2
0
02 Nov 2023
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech
  Translation
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Juan Pablo Zuluaga
Zhaocheng Huang
Xing Niu
Rohit Paturi
S. Srinivasan
Prashant Mathur
Brian Thompson
Marcello Federico
BDL
37
2
0
01 Nov 2023
Explicit Morphological Knowledge Improves Pre-training of Language
  Models for Hebrew
Explicit Morphological Knowledge Improves Pre-training of Language Models for Hebrew
Eylon Gueta
Omer Goldman
Reut Tsarfaty
24
1
0
01 Nov 2023
De-Diffusion Makes Text a Strong Cross-Modal Interface
De-Diffusion Makes Text a Strong Cross-Modal Interface
Chen Wei
Chenxi Liu
Siyuan Qiao
Zhishuai Zhang
Alan Yuille
Jiahui Yu
VLM
DiffM
42
10
0
01 Nov 2023
Increasing The Performance of Cognitively Inspired Data-Efficient
  Language Models via Implicit Structure Building
Increasing The Performance of Cognitively Inspired Data-Efficient Language Models via Implicit Structure Building
Omar Momen
David Arps
Laura Kallmeyer
AI4CE
31
2
0
31 Oct 2023
Unified Representation for Non-compositional and Compositional
  Expressions
Unified Representation for Non-compositional and Compositional Expressions
Ziheng Zeng
Suma Bhat
30
3
0
29 Oct 2023
Probing LLMs for Joint Encoding of Linguistic Categories
Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace
Konstantinos Papakostas
Rochelle Choenni
Apostolos Panagiotopoulos
Matteo Rosati
Alina Leidinger
Ekaterina Shutova
27
5
0
28 Oct 2023
ArcheType: A Novel Framework for Open-Source Column Type Annotation
  using Large Language Models
ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Ben Feuer
Yurong Liu
Chinmay Hegde
Juliana Freire
AI4TS
VLM
27
9
0
27 Oct 2023
Unified Segment-to-Segment Framework for Simultaneous Sequence
  Generation
Unified Segment-to-Segment Framework for Simultaneous Sequence Generation
Shaolei Zhang
Yang Feng
30
7
0
27 Oct 2023
Words, Subwords, and Morphemes: What Really Matters in the
  Surprisal-Reading Time Relationship?
Words, Subwords, and Morphemes: What Really Matters in the Surprisal-Reading Time Relationship?
Sathvik Nair
Philip Resnik
30
10
0
26 Oct 2023
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct
  Speech-to-Speech Translation
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation
Yongxin Zhu
Zhujin Gao
Xinyuan Zhou
Zhongyi Ye
Linli Xu
36
2
0
26 Oct 2023
LightLM: A Lightweight Deep and Narrow Language Model for Generative
  Recommendation
LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation
Kai Mei
Yongfeng Zhang
VLM
105
11
0
26 Oct 2023
Learning to Abstract with Nonparametric Variational Information
  Bottleneck
Learning to Abstract with Nonparametric Variational Information Bottleneck
Melika Behjati
Fabio Fehr
James Henderson
SSL
32
1
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How
  Does Information Loss Affect Performance?
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
26
1
0
26 Oct 2023
Beyond MLE: Convex Learning for Text Generation
Beyond MLE: Convex Learning for Text Generation
Chenze Shao
Zhengrui Ma
Min Zhang
Yang Feng
34
3
0
26 Oct 2023
miditok: A Python package for MIDI file tokenization
miditok: A Python package for MIDI file tokenization
Nathan Fradet
Jean-Pierre Briot
F. Chhel
A. E. Seghrouchni
Nicolas Gutowski
39
39
0
26 Oct 2023
SpikingJelly: An open-source machine learning infrastructure platform
  for spike-based intelligence
SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence
Wei Fang
Yanqing Chen
Jianhao Ding
Zhaofei Yu
T. Masquelier
Ding Chen
Liwei Huang
Huihui Zhou
Guoqi Li
Yonghong Tian
36
206
0
25 Oct 2023
Enhanced Simultaneous Machine Translation with Word-level Policies
Enhanced Simultaneous Machine Translation with Word-level Policies
Kang Kim
Hankyu Cho
64
3
0
25 Oct 2023
Samsung R&D Institute Philippines at WMT 2023
Samsung R&D Institute Philippines at WMT 2023
Jan Christian Blaise Cruz
23
6
0
25 Oct 2023
GenKIE: Robust Generative Multimodal Document Key Information Extraction
GenKIE: Robust Generative Multimodal Document Key Information Extraction
Panfeng Cao
Ye Wang
Qiang Zhang
Zaiqiao Meng
SyDa
29
6
0
24 Oct 2023
Machine Translation for Nko: Tools, Corpora and Baseline Results
Machine Translation for Nko: Tools, Corpora and Baseline Results
M. Doumbouya
Baba Mamadi Diané
Solo Farabado Cissé
Djibrila Diané
Abdoulaye Sow
...
Fodé Moriba Bayo
Ibrahima Sory 2. Condé
Kalo Mory Diané
Chris Piech
Christopher D. Manning
51
3
0
24 Oct 2023
Reference Free Domain Adaptation for Translation of Noisy Questions with
  Question Specific Rewards
Reference Free Domain Adaptation for Translation of Noisy Questions with Question Specific Rewards
Baban Gain
Ramakrishna Appicharla
Soumya Chennabasavaraj
Nikesh Garera
Asif Ekbal
M. Chelliah
43
0
0
23 Oct 2023
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into
  the Morphological Capabilities of a Large Language Model
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model
Leonie Weissweiler
Valentin Hofmann
Anjali Kantharuban
Anna Cai
Ritam Dutt
...
Abhishek Vijayakumar
Haofei Yu
Hinrich Schütze
Kemal Oflazer
David R. Mortensen
41
10
0
23 Oct 2023
PartialFormer: Modeling Part Instead of Whole for Machine Translation
PartialFormer: Modeling Part Instead of Whole for Machine Translation
Tong Zheng
Bei Li
Huiwen Bao
Jiale Wang
Weiqiao Shan
Tong Xiao
Jingbo Zhu
MoE
AI4CE
16
0
0
23 Oct 2023
Non-autoregressive Streaming Transformer for Simultaneous Translation
Non-autoregressive Streaming Transformer for Simultaneous Translation
Zhengrui Ma
Shaolei Zhang
Shoutao Guo
Chenze Shao
Min Zhang
Yang Feng
37
13
0
23 Oct 2023
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
Zhiyuan Liu
Yaorui Shi
An Zhang
Enzhi Zhang
Kenji Kawaguchi
Xiang Wang
Tat-Seng Chua
AI4CE
44
36
0
23 Oct 2023
Rethinking Word-Level Auto-Completion in Computer-Aided Translation
Rethinking Word-Level Auto-Completion in Computer-Aided Translation
Xingyu Chen
Lemao Liu
Guoping Huang
Zhirui Zhang
Mingming Yang
Shuming Shi
Rui Wang
29
2
0
23 Oct 2023
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
Baohao Liao
Michael Kozielski
Sanjika Hewavitharana
Jiangbo Yuan
Shahram Khadivi
Tomer Lancewicki
SSL
23
0
0
22 Oct 2023
Boosting Unsupervised Machine Translation with Pseudo-Parallel Data
Boosting Unsupervised Machine Translation with Pseudo-Parallel Data
Ivana Kvapilíková
Ondrej Bojar
MoE
30
0
0
22 Oct 2023
On Synthetic Data for Back Translation
On Synthetic Data for Back Translation
Jiahao Xu
Yubin Ruan
Wei Bi
Guoping Huang
Shuming Shi
Lihui Chen
Lemao Liu
38
12
0
20 Oct 2023
Simultaneous Machine Translation with Tailored Reference
Simultaneous Machine Translation with Tailored Reference
Shoutao Guo
Shaolei Zhang
Yang Feng
37
9
0
20 Oct 2023
Bridging the Gap between Synthetic and Authentic Images for Multimodal
  Machine Translation
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
Wenyu Guo
Qingkai Fang
Dong Yu
Yang Feng
27
6
0
20 Oct 2023
Analyzing Cognitive Plausibility of Subword Tokenization
Analyzing Cognitive Plausibility of Subword Tokenization
Lisa Beinborn
Yuval Pinter
31
17
0
20 Oct 2023
A Predictive Factor Analysis of Social Biases and Task-Performance in
  Pretrained Masked Language Models
A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
Yi Zhou
Jose Camacho-Collados
Danushka Bollegala
86
6
0
19 Oct 2023
Character-level Chinese Backpack Language Models
Character-level Chinese Backpack Language Models
Hao Sun
John Hewitt
32
0
0
19 Oct 2023
Previous
123...121314...757677
Next