ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06961
  4. Cited By
Mimicking Word Embeddings using Subword RNNs

Mimicking Word Embeddings using Subword RNNs

21 July 2017
Yuval Pinter
Robert Guthrie
Jacob Eisenstein
ArXivPDFHTML

Papers citing "Mimicking Word Embeddings using Subword RNNs"

50 / 68 papers shown
Title
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
31
0
0
21 Apr 2025
Overcoming Vocabulary Constraints with Pixel-level Fallback
Overcoming Vocabulary Constraints with Pixel-level Fallback
Jonas F. Lotz
Hendra Setiawan
Stephan Peitz
Yova Kementchedjhieva
43
0
0
02 Apr 2025
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
63
23
0
10 Sep 2024
Reconsidering Token Embeddings with the Definitions for Pre-trained
  Language Models
Reconsidering Token Embeddings with the Definitions for Pre-trained Language Models
Ying Zhang
Zhuoran Liu
Manabu Okumura
18
0
0
02 Aug 2024
Zero-Shot Tokenizer Transfer
Zero-Shot Tokenizer Transfer
Benjamin Minixhofer
E. Ponti
Ivan Vulić
VLM
44
9
0
13 May 2024
Advancements in eHealth Data Analytics through Natural Language
  Processing and Deep Learning
Advancements in eHealth Data Analytics through Natural Language Processing and Deep Learning
Elena Simona Apostol
Ciprian-Octavian Truică
14
0
0
19 Jan 2024
Representation Learning via Variational Bayesian Networks
Representation Learning via Variational Bayesian Networks
Oren Barkan
Avi Caciularu
Idan Rejwan
Ori Katz
Jonathan Weill
Itzik Malkiel
Noam Koenigstein
BDL
22
15
0
28 Jun 2023
Revisit Out-Of-Vocabulary Problem for Slot Filling: A Unified
  Contrastive Frameword with Multi-level Data Augmentations
Revisit Out-Of-Vocabulary Problem for Slot Filling: A Unified Contrastive Frameword with Multi-level Data Augmentations
Daichi Guo
Guanting Dong
Dayuan Fu
Yuxiang Wu
Chen Zeng
...
Xuefeng Li
Zechen Wang
Keqing He
Xinyue Cui
Weiran Xu
11
8
0
27 Feb 2023
Inducing Character-level Structure in Subword-based Language Models with
  Type-level Interchange Intervention Training
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training
Jing-ling Huang
Zhengxuan Wu
Kyle Mahowald
Christopher Potts
24
13
0
19 Dec 2022
API-Miner: an API-to-API Specification Recommendation Engine
API-Miner: an API-to-API Specification Recommendation Engine
S. Moon
Gregory Kerr
Fran Silavong
Sean J. Moran
AI4TS
20
2
0
14 Dec 2022
Leveraging knowledge graphs to update scientific word embeddings using
  latent semantic imputation
Leveraging knowledge graphs to update scientific word embeddings using latent semantic imputation
J. Hoelscher-Obermaier
Edward Stevinson
V. Stauber
Ivaylo Zhelev
Victor Botev
R. Wu
J. Minton
14
0
0
27 Oct 2022
Learning to Learn to Predict Performance Regressions in Production at
  Meta
Learning to Learn to Predict Performance Regressions in Production at Meta
M. Beller
Hongyu Li
V. Nair
V. Murali
Imad Ahmad
Jürgen Cito
Drew Carlson
Gareth Ari Aye
Wes Dyer
31
5
0
08 Aug 2022
MINER: Improving Out-of-Vocabulary Named Entity Recognition from an
  Information Theoretic Perspective
MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
Xiao Wang
Shihan Dou
Li Xiong
Yicheng Zou
Qi Zhang
Tao Gui
Liang Qiao
Zhanzhan Cheng
Xuanjing Huang
11
26
0
09 Apr 2022
drsphelps at SemEval-2022 Task 2: Learning idiom representations using
  BERTRAM
drsphelps at SemEval-2022 Task 2: Learning idiom representations using BERTRAM
Dylan Phelps
20
5
0
06 Apr 2022
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models
  Robust with Little Cost
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
13
14
0
15 Mar 2022
Between words and characters: A Brief History of Open-Vocabulary
  Modeling and Tokenization in NLP
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
30
140
0
20 Dec 2021
Integrating Approaches to Word Representation
Integrating Approaches to Word Representation
Yuval Pinter
NAI
48
5
0
10 Sep 2021
Log-based Anomaly Detection Without Log Parsing
Log-based Anomaly Detection Without Log Parsing
Van-Hoang Le
Hongyu Zhang
22
179
0
04 Aug 2021
Dynamic Language Models for Continuously Evolving Content
Dynamic Language Models for Continuously Evolving Content
Spurthi Amba Hombaiah
Tao Chen
Mingyang Zhang
Michael Bendersky
Marc Najork
CLL
KELM
34
37
0
11 Jun 2021
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language
  Generation
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation
Xin Liu
Baosong Yang
Dayiheng Liu
Haibo Zhang
Weihua Luo
Min Zhang
Haiying Zhang
Jinsong Su
18
18
0
11 Jun 2021
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language
  Representation
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation
J. Clark
Dan Garrette
Iulia Turc
John Wieting
27
210
0
11 Mar 2021
Trajectory-Based Meta-Learning for Out-Of-Vocabulary Word Embedding
  Learning
Trajectory-Based Meta-Learning for Out-Of-Vocabulary Word Embedding Learning
Gordon Buck
Andreas Vlachos
26
1
0
24 Feb 2021
Towards generalisable hate speech detection: a review on obstacles and
  solutions
Towards generalisable hate speech detection: a review on obstacles and solutions
Wenjie Yin
A. Zubiaga
117
164
0
17 Feb 2021
Recent Trends in Named Entity Recognition (NER)
Recent Trends in Named Entity Recognition (NER)
Aryan Roy
23
37
0
25 Jan 2021
Recomposition vs. Prediction: A Novel Anomaly Detection for Discrete
  Events Based On Autoencoder
Recomposition vs. Prediction: A Novel Anomaly Detection for Discrete Events Based On Autoencoder
Lun-Pin Yuan
Peng Liu
Sencun Zhu
AI4TS
17
15
0
27 Dec 2020
A Comprehensive Survey on Word Representation Models: From Classical to
  State-Of-The-Art Word Representation Language Models
A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models
Usman Naseem
Imran Razzak
S. Khan
M. Prasad
12
156
0
28 Oct 2020
Char2Subword: Extending the Subword Embedding Space Using Robust
  Character Compositionality
Char2Subword: Extending the Subword Embedding Space Using Robust Character Compositionality
Gustavo Aguilar
Bryan McCann
Tong Niu
Nazneen Rajani
N. Keskar
Thamar Solorio
47
12
0
24 Oct 2020
PBoS: Probabilistic Bag-of-Subwords for Generalizing Word Embedding
PBoS: Probabilistic Bag-of-Subwords for Generalizing Word Embedding
Jinman Zhao
Shawn Zhong
Xiaomin Zhang
Yingyu Liang
BDL
31
5
0
21 Oct 2020
Knowledge Efficient Deep Learning for Natural Language Processing
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
12
2
0
28 Aug 2020
Deep learning models for representing out-of-vocabulary words
Deep learning models for representing out-of-vocabulary words
Johannes V. Lochter
Renato M. Silva
Tiago A. Almeida
13
15
0
14 Jul 2020
Quantifying the Contextualization of Word Representations with Semantic
  Class Probing
Quantifying the Contextualization of Word Representations with Semantic Class Probing
Mengjie Zhao
Philipp Dufter
Yadollah Yaghoobzadeh
Hinrich Schütze
12
27
0
25 Apr 2020
NYTWIT: A Dataset of Novel Words in the New York Times
NYTWIT: A Dataset of Novel Words in the New York Times
Yuval Pinter
Cassandra L. Jacobs
Max Bittker
8
14
0
06 Mar 2020
Hierarchical Character Embeddings: Learning Phonological and Semantic
  Representations in Languages of Logographic Origin using Recursive Neural
  Networks
Hierarchical Character Embeddings: Learning Phonological and Semantic Representations in Languages of Logographic Origin using Recursive Neural Networks
Minh Nguyen
G. Ngo
Nancy F. Chen
19
19
0
20 Dec 2019
Attending Form and Context to Generate Specialized
  Out-of-VocabularyWords Representations
Attending Form and Context to Generate Specialized Out-of-VocabularyWords Representations
Nicolas Garneau
Jean-Samuel Leboeuf
Yuval Pinter
Luc Lamontagne
8
1
0
14 Dec 2019
Learning to Learn Words from Visual Scenes
Learning to Learn Words from Visual Scenes
Dídac Surís
Dave Epstein
Heng Ji
Shih-Fu Chang
Carl Vondrick
VLM
CLIP
SSL
OffRL
22
4
0
25 Nov 2019
Word Embedding Algorithms as Generalized Low Rank Models and their
  Canonical Form
Word Embedding Algorithms as Generalized Low Rank Models and their Canonical Form
Kian Kenyon-Dean
12
3
0
06 Nov 2019
BPE-Dropout: Simple and Effective Subword Regularization
BPE-Dropout: Simple and Effective Subword Regularization
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
24
276
0
29 Oct 2019
A Hybrid Semantic Parsing Approach for Tabular Data Analysis
A Hybrid Semantic Parsing Approach for Tabular Data Analysis
Yan Gao
Jian-Guang Lou
Dongmei Zhang
LMTD
8
3
0
23 Oct 2019
Estimator Vectors: OOV Word Embeddings based on Subword and Context Clue
  Estimates
Estimator Vectors: OOV Word Embeddings based on Subword and Context Clue Estimates
R. Patel
C. Domeniconi
17
5
0
18 Oct 2019
BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized
  Model Performance
BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance
Timo Schick
Hinrich Schütze
8
48
0
16 Oct 2019
Improving Pre-Trained Multilingual Models with Vocabulary Expansion
Improving Pre-Trained Multilingual Models with Vocabulary Expansion
Hai Wang
Dian Yu
Kai Sun
Jianshu Chen
Dong Yu
28
40
0
26 Sep 2019
On the Importance of Subword Information for Morphological Tasks in
  Truly Low-Resource Languages
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages
Yi Zhu
Benjamin Heinzerling
Ivan Vulić
Michael Strube
Roi Reichart
Anna Korhonen
19
19
0
26 Sep 2019
Multimodal deep networks for text and image-based document
  classification
Multimodal deep networks for text and image-based document classification
Nicolas Audebert
Catherine Herold
K. Slimani
Cédric Vidal
17
96
0
15 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Ziniu Hu
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
16
75
0
01 Jul 2019
Out-of-Vocabulary Embedding Imputation with Grounded Language
  Information by Graph Convolutional Networks
Out-of-Vocabulary Embedding Imputation with Grounded Language Information by Graph Convolutional Networks
Ziyi Yang
Chenguang Zhu
Vin Sachidananda
Eric F. Darve
13
4
0
10 Jun 2019
Learning Task-specific Representation for Novel Words in Sequence
  Labeling
Learning Task-specific Representation for Novel Words in Sequence Labeling
Minlong Peng
Qi Zhang
Xiaoyu Xing
Tao Gui
Jinlan Fu
Xuanjing Huang
23
8
0
29 May 2019
Misspelling Oblivious Word Embeddings
Misspelling Oblivious Word Embeddings
Bora Edizel
Aleksandra Piktus
Piotr Bojanowski
Rui A. Ferreira
Edouard Grave
Fabrizio Silvestri
12
63
0
23 May 2019
A Systematic Study of Leveraging Subword Information for Learning Word
  Representations
A Systematic Study of Leveraging Subword Information for Learning Word Representations
Yi Zhu
Ivan Vulić
Anna Korhonen
19
30
0
16 Apr 2019
Rare Words: A Major Problem for Contextualized Embeddings And How to Fix
  it by Attentive Mimicking
Rare Words: A Major Problem for Contextualized Embeddings And How to Fix it by Attentive Mimicking
Timo Schick
Hinrich Schütze
8
95
0
14 Apr 2019
Attentive Mimicking: Better Word Embeddings by Attending to Informative
  Contexts
Attentive Mimicking: Better Word Embeddings by Attending to Informative Contexts
Timo Schick
Hinrich Schütze
17
47
0
02 Apr 2019
12
Next