ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.09805
  4. Cited By
A Survey of Corpora for Germanic Low-Resource Languages and Dialects

A Survey of Corpora for Germanic Low-Resource Languages and Dialects

19 April 2023
Verena Blaschke
Hinrich Schütze
Barbara Plank
ArXivPDFHTML

Papers citing "A Survey of Corpora for Germanic Low-Resource Languages and Dialects"

11 / 11 papers shown
Title
RLHF Can Speak Many Languages: Unlocking Multilingual Preference
  Optimization for LLMs
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
John Dang
Arash Ahmadian
Kelly Marchisio
Julia Kreutzer
A. Ustun
Sara Hooker
31
21
0
02 Jul 2024
Voices Unheard: NLP Resources and Models for Yorùbá Regional
  Dialects
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Orevaoghene Ahia
Anuoluwapo Aremu
Diana Abagyan
Hila Gonen
David Ifeoluwa Adelani
Daud Abolade
Noah A. Smith
Yulia Tsvetkov
59
3
0
27 Jun 2024
Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian
  Dialectal Data
Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
Siyao Peng
Zihang Sun
Huangyan Shan
Marie Kolm
Verena Blaschke
Ekaterina Artemova
Barbara Plank
24
2
0
19 Mar 2024
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Verena Blaschke
Barbara Kovavcić
Siyao Peng
Hinrich Schütze
Barbara Plank
21
4
0
15 Mar 2024
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language
  Technology for German Dialects
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects
Verena Blaschke
Christoph Purschke
Hinrich Schütze
Barbara Plank
16
9
0
19 Feb 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction
  Tuning
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
A. Ustun
Marzieh Fadaee
Sara Hooker
115
115
0
09 Feb 2024
AlbNews: A Corpus of Headlines for Topic Modeling in Albanian
AlbNews: A Corpus of Headlines for Topic Modeling in Albanian
Erion Çano
Dario Lamaj
8
1
0
06 Feb 2024
GlotLID: Language Identification for Low-Resource Languages
GlotLID: Language Identification for Low-Resource Languages
Amir Hossein Kargaran
Ayyoob Imani
François Yvon
Hinrich Schütze
14
10
0
24 Oct 2023
Low-resource Bilingual Dialect Lexicon Induction with Large Language
  Models
Low-resource Bilingual Dialect Lexicon Induction with Large Language Models
Ekaterina Artemova
Barbara Plank
21
1
0
19 Apr 2023
SDS-200: A Swiss German Speech to Standard German Text Corpus
SDS-200: A Swiss German Speech to Standard German Text Corpus
Michel Plüss
Manuela Hurlimann
Marc Cuny
Alla Stockli
Nikolaos Kapotis
...
Yanick Schraner
Amit Jain
Jan Deriu
Mark Cieliebak
Manfred Vogel
16
20
0
19 May 2022
Challenges of language technologies for the indigenous languages of the
  Americas
Challenges of language technologies for the indigenous languages of the Americas
Manuel Mager
Ximena Gutierrez-Vasques
Gerardo E Sierra
Ivan Vladimir Meza Ruiz
VLM
184
87
0
12 Jun 2018
1