Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.00679
Cited By
Liputan6: A Large-scale Indonesian Dataset for Text Summarization
2 November 2020
Fajri Koto
Jey Han Lau
Timothy Baldwin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Liputan6: A Large-scale Indonesian Dataset for Text Summarization"
15 / 15 papers shown
BOE-XSUM: Extreme Summarization in Clear Language of Spanish Legal Decrees and Notifications
Andrés Fernández García
Javier de la Rosa
Julio Gonzalo
Roser Morante
Enrique Amigó
...
Victor Fresno
Adrián Ghajari
Guillermo Marco
Laura Plaza
Eva Sánchez Salido
AILaw
ELM
198
1
0
29 Sep 2025
SEA-BED: How Do Embedding Models Represent Southeast Asian Languages?
Wuttikorn Ponwitayarat
Raymond Ng
Jann Railey Montalan
Thura Aung
Jian Gang Ngui
...
Panuthep Tasawong
Erik Cambria
Ekapol Chuangsuwanich
Sarana Nutanong
Peerat Limkonchotiwat
FedML
224
2
0
17 Aug 2025
IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian
Vanessa Rebecca Wiyono
David Anugraha
Ayu Purwarianti
Genta Indra Winata
381
1
0
29 Jul 2025
The State and Fate of Summarization Datasets: A Survey
Noam Dahan
Gabriel Stanovsky
HILM
606
0
0
07 Nov 2024
Investigating Text Shortening Strategy in BERT: Truncation vs Summarization
Mirza Alim Mutasodirin
Radityo Eko Prasojo
165
14
0
19 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
865
200
0
05 Mar 2024
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Fajri Koto
Nurul Aisyah
Jinyan Su
Timothy Baldwin
AI4Ed
LRM
ELM
354
63
0
07 Oct 2023
LR-Sum: Summarization for Less-Resourced Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Chester Palen-Michel
Constantine Lignos
262
8
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
550
71
0
19 Dec 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
262
138
0
24 Mar 2022
BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation
Thomas Scialom
Felix Hill
254
7
0
18 Oct 2021
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Fajri Koto
Jey Han Lau
Timothy Baldwin
VLM
339
123
0
10 Sep 2021
Evaluating the Efficacy of Summarization Evaluation across Languages
Findings (Findings), 2021
Fajri Koto
Jey Han Lau
Timothy Baldwin
310
21
0
02 Jun 2021
FFCI: A Framework for Interpretable Automatic Evaluation of Summarization
Journal of Artificial Intelligence Research (JAIR), 2020
Fajri Koto
Timothy Baldwin
Jey Han Lau
HILM
338
40
0
27 Nov 2020
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP
International Conference on Computational Linguistics (COLING), 2020
Fajri Koto
Afshin Rahimi
Jey Han Lau
Timothy Baldwin
283
384
0
02 Nov 2020
1
Page 1 of 1