ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18951
  4. Cited By
BnMMLU: Measuring Massive Multitask Language Understanding in Bengali
v1v2 (latest)

BnMMLU: Measuring Massive Multitask Language Understanding in Bengali

25 May 2025
Saman Sarker Joy
Swakkhar Shatabda
    ELM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (28717★)

Papers citing "BnMMLU: Measuring Massive Multitask Language Understanding in Bengali"

6 / 6 papers shown
CRaFT: An Explanation-Based Framework for Evaluating Cultural Reasoning in Multilingual Language Models
CRaFT: An Explanation-Based Framework for Evaluating Cultural Reasoning in Multilingual Language Models
Shehenaz Hossain
Haithem Afli
ELMLRM
107
0
0
15 Oct 2025
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark
  for Chinese Large Language Models
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELMALM
270
31
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between LanguagesNeural Information Processing Systems (NeurIPS), 2023
Aleksandar Petrov
Emanuele La Malfa
Juil Sock
Adel Bibi
345
169
0
17 May 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
  Language Models in Multilingual Learning
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELMLM&MA
239
356
0
12 Apr 2023
Measuring Massive Multitask Language Understanding
Measuring Massive Multitask Language UnderstandingInternational Conference on Learning Representations (ICLR), 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELMRALM
2.3K
6,566
0
07 Sep 2020
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training,
  Understanding and Generation
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Yaobo Liang
Nan Duan
Yeyun Gong
Ning Wu
Fenfei Guo
...
Shuguang Liu
Fan Yang
Daniel Fernando Campos
Rangan Majumder
Ming Zhou
ELMVLM
310
370
0
03 Apr 2020
1