ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.09212
  4. Cited By
CMMLU: Measuring massive multitask language understanding in Chinese
v1v2 (latest)

CMMLU: Measuring massive multitask language understanding in Chinese

Annual Meeting of the Association for Computational Linguistics (ACL), 2023
15 June 2023
Jinyan Su
Yixuan Zhang
Fajri Koto
Yifei Yang
Hai Zhao
Yeyun Gong
Nan Duan
Tim Baldwin
    ALMELM
ArXiv (abs)PDFHTML

Papers citing "CMMLU: Measuring massive multitask language understanding in Chinese"

17 / 267 papers shown
Title
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model
  Pre-trained from Scratch
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from ScratchScience China Information Sciences (Sci China Inf Sci), 2023
Juntao Li
Zecheng Tang
Yuyang Ding
Pinzheng Wang
Pei Guo
...
Wenliang Chen
Guohong Fu
Qiaoming Zhu
Guodong Zhou
Hao Fei
304
6
0
19 Sep 2023
Baichuan 2: Open Large-scale Language Models
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELMLRM
711
903
0
19 Sep 2023
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation
  into Multicultural Proverbs and Sayings
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and SayingsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Chen Cecilia Liu
Fajri Koto
Timothy Baldwin
Iryna Gurevych
LRM
171
28
0
15 Sep 2023
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment
  to Cultural Reasoning
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural ReasoningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Bin Wang
Zhengyuan Liu
Xin Huang
Fangkai Jiao
Yang Ding
Ai Ti Aw
Nancy F. Chen
LRM
230
95
0
09 Sep 2023
HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models
HAE-RAE Bench: Evaluation of Korean Knowledge in Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2023
Seunghyeok Hong
Hanwool Albert Lee
Suwan Kim
Huiseo Kim
Jaecheol Lee
Je Won Yeom
Jihyu Jung
Jung Woo Kim
Songseong Kim
RALMELM
310
37
0
06 Sep 2023
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open
  Generative Large Language Models
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Neha Sengupta
Sunil Kumar Sahu
Bokang Jia
Satheesh Katipomu
Jinyan Su
...
A. Jackson
Hector Xuguang Ren
Preslav Nakov
Timothy Baldwin
Eric P. Xing
LRM
288
58
0
30 Aug 2023
Through the Lens of Core Competency: Survey on Evaluation of Large
  Language Models
Through the Lens of Core Competency: Survey on Evaluation of Large Language ModelsChina National Conference on Chinese Computational Linguistics (CNCCL), 2023
Ziyu Zhuang
Qiguang Chen
Longxuan Ma
Mingda Li
Yi Han
Yushan Qian
Haopeng Bai
Zixian Feng
Weinan Zhang
Ting Liu
ELM
141
22
0
15 Aug 2023
Evaluating the Generation Capabilities of Large Chinese Language Models
Evaluating the Generation Capabilities of Large Chinese Language ModelsAI Open (AO), 2023
Hui Zeng
Jingyuan Xue
Meng Hao
Chen Sun
Bin Ning
Na Zhang
ELM
138
13
0
09 Aug 2023
CLEVA: Chinese Language Models EVAluation Platform
CLEVA: Chinese Language Models EVAluation PlatformConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yanyang Li
Jianqiao Zhao
Duo Zheng
Zi-Yuan Hu
Zhi Chen
...
Yongfeng Huang
Shijia Huang
Dahua Lin
Michael R. Lyu
Liwei Wang
ALMELM
247
15
0
09 Aug 2023
ChatHome: Development and Evaluation of a Domain-Specific Language Model
  for Home Renovation
ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation
Cheng Wen
Xianghui Sun
Shuaijiang Zhao
Xiaoquan Fang
Liang Chen
Wei Zou
ALM
86
31
0
28 Jul 2023
ArcGPT: A Large Language Model Tailored for Real-world Archival
  Applications
ArcGPT: A Large Language Model Tailored for Real-world Archival Applications
Shitou Zhang
Jingrui Hou
Siyuan Peng
Z. Li
Qibiao Hu
Peijie Wang
KELMRALMLLMAG
125
4
0
27 Jul 2023
A Survey on Evaluation of Large Language Models
A Survey on Evaluation of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Yu-Chu Chang
Xu Wang
Yongfeng Zhang
Yuanyi Wu
Linyi Yang
...
Yue Zhang
Yi-Ju Chang
Philip S. Yu
Qian Yang
Xingxu Xie
ELMLM&MAALM
576
2,595
0
06 Jul 2023
Style Over Substance: Evaluation Biases for Large Language Models
Style Over Substance: Evaluation Biases for Large Language ModelsInternational Conference on Computational Linguistics (COLING), 2023
Minghao Wu
Alham Fikri Aji
ALMELM
550
61
0
06 Jul 2023
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained
  Transformer
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer
Z. Li
Shitou Zhang
Hai Zhao
Yifei Yang
Dongjie Yang
LM&MA
217
26
0
01 Jul 2023
Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge
  Evaluation
Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge EvaluationAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhouhong Gu
Xiaoxuan Zhu
Haoning Ye
Lin Zhang
Jianchen Wang
...
Zili Wang
Shusen Wang
Weiguo Zheng
Hongwei Feng
Yanghua Xiao
ALMELM
241
73
0
09 Jun 2023
Evaluating the Performance of Large Language Models on GAOKAO Benchmark
Evaluating the Performance of Large Language Models on GAOKAO Benchmark
Xiaotian Zhang
Chun-yan Li
Yi Zong
Zhengyu Ying
Liang He
Xipeng Qiu
ALMELM
260
151
0
21 May 2023
Huatuo-26M, a Large-scale Chinese Medical QA Dataset
Huatuo-26M, a Large-scale Chinese Medical QA DatasetNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Jianquan Li
Xidong Wang
Xiangbo Wu
Zhiyi Zhang
Xiaolong Xu
Jie Fu
Prayag Tiwari
Xiang Wan
Benyou Wang
LM&MA
314
61
0
02 May 2023
Previous
123456