Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.00403
Cited By
Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words
2 January 2021
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"
15 / 15 papers shown
Title
Evaluating Morphological Compositional Generalization in Large Language Models
Mete Ismayilzada
Defne Çirci
Jonne Sälevä
Hale Sirin
Abdullatif Köksal
Bhuwan Dhingra
Antoine Bosselut
Lonneke van der Plas
Duygu Ataman
26
2
0
16 Oct 2024
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
19
1
0
06 Apr 2024
The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations
Aina Garí Soler
Matthieu Labeau
Chloé Clavel
VLM
27
2
0
22 Feb 2024
Analyzing Cognitive Plausibility of Subword Tokenization
Lisa Beinborn
Yuval Pinter
17
17
0
20 Oct 2023
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data
Xinzhe Li
Ming Liu
Shang Gao
MU
17
8
0
02 Jul 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip H. S. Torr
Adel Bibi
16
96
0
17 May 2023
Incorporating Context into Subword Vocabularies
Shaked Yehezkel
Yuval Pinter
16
8
0
13 Oct 2022
Morphological Processing of Low-Resource Languages: Where We Are and What's Next
Adam Wiemerslage
Miikka Silfverberg
Changbing Yang
Arya D. McCarthy
Garrett Nicolai
Eliana Colunga
Katharina Kann
23
12
0
16 Mar 2022
Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language Models
Mark Chu
Bhargav Srinivasa Desikan
E. Nadler
Ruggerio L. Sardo
Elise Darragh-Ford
Douglas Guilbeault
11
0
0
15 Mar 2022
Morphology Without Borders: Clause-Level Morphology
Omer Goldman
Reut Tsarfaty
AILaw
29
3
0
25 Feb 2022
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
23
137
0
20 Dec 2021
Efficient Intent Detection with Dual Sentence Encoders
I. Casanueva
Tadas Temvcinas
D. Gerz
Matthew Henderson
Ivan Vulić
VLM
164
444
0
10 Mar 2020
Probabilistic FastText for Multi-Sense Word Embeddings
Ben Athiwaratkun
A. Wilson
Anima Anandkumar
10
136
0
07 Jun 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,435
0
26 Sep 2016
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
29,632
0
16 Jan 2013
1