Liputan6: A Large-scale Indonesian Dataset for Text Summarization

2 November 2020

Papers citing "Liputan6: A Large-scale Indonesian Dataset for Text Summarization"

15 / 15 papers shown

BOE-XSUM: Extreme Summarization in Clear Language of Spanish Legal Decrees and Notifications

Andrés Fernández García

...

198

29 Sep 2025

SEA-BED: How Do Embedding Models Represent Southeast Asian Languages?

Wuttikorn Ponwitayarat

...

Ekapol Chuangsuwanich

Sarana Nutanong

Peerat Limkonchotiwat

FedML

224

17 Aug 2025

IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian

Vanessa Rebecca Wiyono

David Anugraha

Ayu Purwarianti

Genta Indra Winata

381

29 Jul 2025

The State and Fate of Summarization Datasets: A Survey

Noam Dahan

Gabriel Stanovsky

HILM

606

07 Nov 2024

Investigating Text Shortening Strategy in BERT: Truncation vs Summarization

Mirza Alim Mutasodirin

Radityo Eko Prasojo

165

19 Mar 2024

A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods

865

200

05 Mar 2024

Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLUConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

354

07 Oct 2023

LR-Sum: Summarization for Less-Resourced LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Chester Palen-Michel

Constantine Lignos

262

19 Dec 2022

NusaCrowd: Open Source Initiative for Indonesian NLP ResourcesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

550

19 Dec 2022

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in IndonesiaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

262

138

24 Mar 2022

BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation

Thomas Scialom

Felix Hill

254

18 Oct 2021

IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary InitializationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

339

123

10 Sep 2021

Evaluating the Efficacy of Summarization Evaluation across LanguagesFindings (Findings), 2021

Fajri Koto

Jey Han Lau

Timothy Baldwin

310

02 Jun 2021

FFCI: A Framework for Interpretable Automatic Evaluation of SummarizationJournal of Artificial Intelligence Research (JAIR), 2020

338

27 Nov 2020

IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLPInternational Conference on Computational Linguistics (COLING), 2020

283

384

02 Nov 2020