ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.10118
  4. Cited By
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

14 June 2024
Holy Lovenia
Rahmad Mahendra
Salsabil Maulana Akbar
Lester James Validad Miranda
Jennifer Santoso
Elyanah Aco
Akhdan Fadhilah
Jonibek Mansurov
Joseph Marvin Imperial
Onno P. Kampman
Joel Ruben Antony Moniz
Muhammad Ravi Shulthan Habibi
Frederikus Hudi
Railey Montalan
Ryan Ignatius
Joanito Agili Lopo
William Nixon
Börje F. Karlsson
James Jaya
Ryandito Diandaru
Yuze Gao
Patrick Amadeus
Bin Wang
Jan Christian Blaise Cruz
Chenxi Whitehouse
Ivan Halim Parmonangan
Maria Khelli
Wenyu Zhang
Lucky Susanto
Reynard Adha Ryanda
Sonny Lazuardi Hermawan
Dan John Velasco
Muhammad Dehan Al Kautsar
Willy Fitra Hendria
Yasmin Moslem
Noah Flynn
Muhammad Farid Adilazuarda
Haochen Li
Johanes Lee
R. Damanhuri
Shuo Sun
M. Qorib
Amirbek Djanibekov
Wei Qi Leong
Quyet V. Do
Niklas Muennighoff
T. Pansuwan
Ilham Firdausi Putra
Yan Xu
Ngee Chia Tai
Ayu Purwarianti
Sebastian Ruder
William-Chandra Tjhi
Peerat Limkonchotiwat
Alham Fikri Aji
Sedrick Scott Keh
Genta Indra Winata
Ruochen Zhang
Fajri Koto
Zheng-Xin Yong
Samuel Cahyawijaya
ArXivPDFHTML

Papers citing "SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages"

17 / 17 papers shown
Title
Bemba Speech Translation: Exploring a Low-Resource African Language
Bemba Speech Translation: Exploring a Low-Resource African Language
Muhammad Hazim Al Farouq
Aman Kassahun Wassie
Yasmin Moslem
31
0
0
05 May 2025
Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation
Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation
Toqeer Ehsan
Thamar Solorio
26
0
0
07 Apr 2025
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Longxu Dou
Qian Liu
Fan Zhou
Changyu Chen
Zili Wang
...
Tianyu Pang
Chao Du
Xinyi Wan
Wei Lu
Min Lin
82
1
0
18 Feb 2025
Large Multimodal Models for Low-Resource Languages: A Survey
Large Multimodal Models for Low-Resource Languages: A Survey
Marian Lupascu
Ana-Cristina Rogoz
Mihai-Sorin Stupariu
Radu Tudor Ionescu
46
1
0
08 Feb 2025
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet)
  Disambiguate Cross-Lingual Word Sense
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense
Samuel Cahyawijaya
Ruochen Zhang
Holy Lovenia
Jan Christian Blaise Cruz
Elisa Gilbert
Hiroki Nomoto
Alham Fikri Aji
LRM
23
0
0
28 Oct 2024
Eir: Thai Medical Large Language Models
Eir: Thai Medical Large Language Models
Yutthakorn Thiprak
Rungtam Ngodngamthaweesuk
Songtam Ngodngamtaweesuk
LM&MA
ELM
30
0
0
13 Sep 2024
Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech
  Translation
Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation
Yasmin Moslem
SyDa
33
0
0
25 Jun 2024
Resilience of Large Language Models for Noisy Instructions
Resilience of Large Language Models for Noisy Instructions
Bin Wang
Chengwei Wei
Zhengyuan Liu
Geyu Lin
Nancy F. Chen
26
10
0
15 Apr 2024
Cendol: Open Instruction-tuned Generative Large Language Models for
  Indonesian Languages
Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Samuel Cahyawijaya
Holy Lovenia
Fajri Koto
Rifki Afina Putri
Emmanuel Dave
...
Bryan Wilie
Genta Indra Winata
Alham Fikri Aji
Ayu Purwarianti
Pascale Fung
34
15
0
09 Apr 2024
IndoCulture: Exploring Geographically-Influenced Cultural Commonsense
  Reasoning Across Eleven Indonesian Provinces
IndoCulture: Exploring Geographically-Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces
Fajri Koto
Rahmad Mahendra
Nurul Aisyah
Timothy Baldwin
LRM
51
16
0
02 Apr 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction
  Tuning
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
A. Ustun
Marzieh Fadaee
Sara Hooker
113
115
0
09 Feb 2024
Scaling Speech Technology to 1,000+ Languages
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
...
Yossi Adi
Xiaohui Zhang
Wei-Ning Hsu
Alexis Conneau
Michael Auli
VLM
73
297
0
22 May 2023
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset
Ashish V. Thapliyal
Jordi Pont-Tuset
Xi Chen
Radu Soricut
VGen
55
71
0
25 May 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
XTREME-S: Evaluating Cross-lingual Speech Representations
Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
...
Orhan Firat
Michael Auli
Sebastian Ruder
Jason Riesa
Melvin Johnson
VLM
AILaw
ELM
38
21
0
21 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
203
1,651
0
15 Oct 2021
Systematic Inequalities in Language Technology Performance across the
  World's Languages
Systematic Inequalities in Language Technology Performance across the World's Languages
Damián E. Blasi
Antonios Anastasopoulos
Graham Neubig
98
130
0
13 Oct 2021
1