Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2505.20428
Cited By
The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project
26 May 2025
Angelina A. Aquino
Lester James V. Miranda
Elsie Marie T. Or
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project"
19 / 19 papers shown
Title
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Xin Zhang
Yanzhao Zhang
Dingkun Long
Wen Xie
Ziqi Dai
...
Pengjun Xie
Fei Huang
Meishan Zhang
Wenjie Li
Min Zhang
190
176
0
29 Jul 2024
Extrinsic Evaluation of Cultural Competence in Large Language Models
Shaily Bhatt
Fernando Diaz
ELM
EGVM
175
16
0
17 Jun 2024
Joint Lemmatization and Morphological Tagging with LEMMING
Thomas Müller
Robert Bamler
Kangyang Luo
Hinrich Schütze
85
124
0
28 May 2024
On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction
Jianwei Wang
Tianyin Wang
Huiping Zhuang
174
3
0
28 Feb 2024
Unintended Impacts of LLM Alignment on Global Representation
Michael Joseph Ryan
William B. Held
Diyi Yang
164
50
0
22 Feb 2024
calamanCy: A Tagalog Natural Language Processing Toolkit
Lester James V. Miranda
76
2
0
13 Nov 2023
Developing a Named Entity Recognition Dataset for Tagalog
Lester James V. Miranda
105
8
0
13 Nov 2023
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
David Ifeoluwa Adelani
Hannah Liu
Xiaoyu Shen
Nikita Vassilyev
Jesujoba Oluwadara Alabi
Yanke Mao
Haonan Gao
Annie En-Shiun Lee
ELM
184
111
0
14 Sep 2023
Benchmarking zero-shot and few-shot approaches for tokenization, tagging, and dependency parsing of Tagalog text
Angelina A. Aquino
Franz A. de Leon
88
2
0
03 Aug 2022
Improving Large-scale Language Models and Resources for Filipino
Jan Christian Blaise Cruz
C. Cheng
AI4CE
93
38
0
11 Nov 2021
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
International Conference on Learning Representations (ICLR), 2025
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
371
3,004
0
05 Jun 2020
Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
Joakim Nivre
M. Marneffe
Filip Ginter
Jan Hajivc
Christopher D. Manning
S. Pyysalo
Sebastian Schuster
Francis M. Tyers
Daniel Zeman
VLM
187
542
0
22 Apr 2020
The State and Fate of Linguistic Diversity and Inclusion in the NLP World
Pratik M. Joshi
Sebastin Santy
A. Budhiraja
Kalika Bali
Monojit Choudhury
LMTD
270
964
0
20 Apr 2020
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages
Peng Qi
Yuhao Zhang
Yuhui Zhang
Jason Bolton
Christopher D. Manning
AI4TS
475
1,794
0
16 Mar 2020
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
352
7,082
0
05 Nov 2019
On the Cross-lingual Transferability of Monolingual Representations
Mikel Artetxe
Sebastian Ruder
Dani Yogatama
411
839
0
25 Oct 2019
75 Languages, 1 Model: Parsing Universal Dependencies Universally
Dan Kondratyuk
Milan Straka
242
266
0
03 Apr 2019
Enriching Word Vectors with Subword Information
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAI
SSL
VLM
437
10,194
0
15 Jul 2016
Learning Dependency-Based Compositional Semantics
Percy Liang
Michael I. Jordan
Dan Klein
CoGe
195
611
0
30 Sep 2011
1