Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.10158
Cited By
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
20 April 2023
Verena Blaschke
Hinrich Schütze
Barbara Plank
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages"
9 / 9 papers shown
Title
Evaluating Pixel Language Models on Non-Standardized Languages
Alberto Muñoz-Ortiz
Verena Blaschke
Barbara Plank
59
1
0
12 Dec 2024
We're Calling an Intervention: Exploring Fundamental Hurdles in Adapting Language Models to Nonstandard Text
Aarohi Srivastava
David Chiang
54
0
0
10 Apr 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Fahim Faisal
Orevaoghene Ahia
Aarohi Srivastava
Kabir Ahuja
David Chiang
Yulia Tsvetkov
Antonios Anastasopoulos
56
27
0
16 Mar 2024
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Verena Blaschke
Barbara Kovavcić
Siyao Peng
Hinrich Schütze
Barbara Plank
29
4
0
15 Mar 2024
Natural Language Processing for Dialects of a Language: A Survey
Aditya Joshi
Raj Dabre
Diptesh Kanojia
Zhuang Li
Haolan Zhan
Gholamreza Haffari
Doris Dippold
LM&MA
26
27
0
11 Jan 2024
BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text
Aarohi Srivastava
David Chiang
25
6
0
31 Oct 2023
Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Ningyu Xu
Qi Zhang
Jingting Ye
Menghan Zhang
Xuanjing Huang
38
4
0
19 Oct 2023
CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
Kaushal Kumar Maurya
Rahul Kejriwal
M. Desarkar
Anoop Kunchukuttan
23
1
0
09 May 2023
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
66
156
0
20 Oct 2020
1