ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

International Conference on Learning Representations (ICLR), 2020
7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 4,486 papers shown
Should We Attend More or Less? Modulating Attention for Fairness
Should We Attend More or Less? Modulating Attention for Fairness
A. Zayed
Gonçalo Mordido
Samira Shabanian
Sarath Chandar
266
15
0
22 May 2023
RWKV: Reinventing RNNs for the Transformer Era
RWKV: Reinventing RNNs for the Transformer EraConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
598
873
0
22 May 2023
Iterative Forward Tuning Boosts In-Context Learning in Language Models
Iterative Forward Tuning Boosts In-Context Learning in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jiaxi Yang
Binyuan Hui
Min Yang
Bailin Wang
Bowen Li
Binhua Li
Fei Huang
Yongbin Li
285
19
0
22 May 2023
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist
  Examination
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist ExaminationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Dongfang Li
Jindi Yu
Baotian Hu
Zhenran Xu
Hao Fei
ELM
182
15
0
22 May 2023
Meta-in-context learning in large language models
Meta-in-context learning in large language modelsNeural Information Processing Systems (NeurIPS), 2023
Julian Coda-Forno
Marcel Binz
Zeynep Akata
M. Botvinick
Jane X. Wang
Eric Schulz
LRM
455
60
0
22 May 2023
Enhancing Small Medical Learners with Privacy-preserving Contextual
  Prompting
Enhancing Small Medical Learners with Privacy-preserving Contextual PromptingInternational Conference on Learning Representations (ICLR), 2023
Xinlu Zhang
Shiyang Li
Xianjun Yang
Chenxin Tian
Yao Qin
Linda R. Petzold
282
13
0
22 May 2023
Model-Generated Pretraining Signals Improves Zero-Shot Generalization of
  Text-to-Text Transformers
Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Linyuan Gong
Chenyan Xiong
Xiaodong Liu
Payal Bajaj
Yiqing Xie
Alvin Cheung
Jianfeng Gao
Xia Song
VLMAI4CE
162
2
0
21 May 2023
Evaluating the Performance of Large Language Models on GAOKAO Benchmark
Evaluating the Performance of Large Language Models on GAOKAO Benchmark
Xiaotian Zhang
Chun-yan Li
Yi Zong
Zhengyu Ying
Liang He
Xipeng Qiu
ALMELM
391
167
0
21 May 2023
VNHSGE: VietNamese High School Graduation Examination Dataset for Large
  Language Models
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models
Dao Xuan-Quy
Le Ngoc-Bich
Vo The-Duy
Phan Xuan-Dung
Ngo Bac-Bien
Nguyen Van-Tien
Nguyen Thi-My-Thanh
Nguyen Hong-Phuoc
145
22
0
20 May 2023
Evaluation of medium-large Language Models at zero-shot closed book
  generative question answering
Evaluation of medium-large Language Models at zero-shot closed book generative question answeringArtificial Intelligence and Applications (AIA), 2023
René Peinl
Johannes Wirth
ELM
232
8
0
19 May 2023
Prompting with Pseudo-Code Instructions
Prompting with Pseudo-Code InstructionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mayank Mishra
Praveen Venkateswaran
Riyaz Ahmad Bhat
V. Rudramurthy
Danish Contractor
Srikanth G. Tamilselvam
342
17
0
19 May 2023
Separating form and meaning: Using self-consistency to quantify task
  understanding across multiple senses
Separating form and meaning: Using self-consistency to quantify task understanding across multiple sensesIEEE Games Entertainment Media Conference (IEEE GEM), 2023
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
LRM
309
17
0
19 May 2023
Examining Inter-Consistency of Large Language Models Collaboration: An
  In-depth Analysis via Debate
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kai Xiong
Xiao Ding
Yixin Cao
Ting Liu
Bing Qin
552
119
0
19 May 2023
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM
  Inference with Transferable Prompt
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt
Zhaozhuo Xu
Zirui Liu
Beidi Chen
Yuxin Tang
Jue Wang
Kaixiong Zhou
Helen Zhou
Anshumali Shrivastava
MQ
252
39
0
17 May 2023
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark
  for Chinese Large Language Models
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELMALM
300
32
0
17 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
  Language Models
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
396
50
0
17 May 2023
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for
  Foundation Models
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsNeural Information Processing Systems (NeurIPS), 2023
Yuzhen Huang
Yuzhuo Bai
Zhihao Zhu
Junlei Zhang
Jinghan Zhang
...
Yikai Zhang
Jiayi Lei
Yao Fu
Maosong Sun
Junxian He
ELMLRM
426
751
0
15 May 2023
Symbol tuning improves in-context learning in language models
Symbol tuning improves in-context learning in language modelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jerry W. Wei
Le Hou
Andrew Kyle Lampinen
Xiangning Chen
Da Huang
...
Xinyun Chen
Yifeng Lu
Denny Zhou
Tengyu Ma
Quoc V. Le
LRM
341
103
0
15 May 2023
Not All Languages Are Created Equal in LLMs: Improving Multilingual
  Capability by Cross-Lingual-Thought Prompting
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haoyang Huang
Tianyi Tang
Dongdong Zhang
Wayne Xin Zhao
Ting Song
Yan Xia
Furu Wei
LRM
350
228
0
11 May 2023
Active Retrieval Augmented Generation
Active Retrieval Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
405
508
0
11 May 2023
Taking Advice from ChatGPT
Taking Advice from ChatGPT
Peter Zhang
281
5
0
11 May 2023
Long-Tailed Question Answering in an Open World
Long-Tailed Question Answering in an Open WorldAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yinpei Dai
Hao Lang
Yinhe Zheng
Fei Huang
Yongbin Li
VLM
182
10
0
11 May 2023
RECKONING: Reasoning through Dynamic Knowledge Encoding
RECKONING: Reasoning through Dynamic Knowledge EncodingNeural Information Processing Systems (NeurIPS), 2023
Zeming Chen
Gail Weiss
E. Mitchell
Asli Celikyilmaz
Antoine Bosselut
KELMLRM
358
15
0
10 May 2023
Multilingual LLMs are Better Cross-lingual In-context Learners with
  Alignment
Multilingual LLMs are Better Cross-lingual In-context Learners with AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Eshaan Tanwar
Subhabrata Dutta
Manish Borthakur
Tanmoy Chakraborty
232
83
0
10 May 2023
StarCoder: may the source be with you!
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
515
1,077
0
09 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
282
10
0
08 May 2023
How Do In-Context Examples Affect Compositional Generalization?
How Do In-Context Examples Affect Compositional Generalization?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shengnan An
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Jian-Guang Lou
Dongmei Zhang
408
70
0
08 May 2023
Improving Cross-Task Generalization with Step-by-Step Instructions
Improving Cross-Task Generalization with Step-by-Step InstructionsScience China Information Sciences (Sci China Inf Sci), 2023
Yang Wu
Yanyan Zhao
Zhongyang Li
Bing Qin
Kai Xiong
LRMALM
143
11
0
08 May 2023
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive
  Transformer APIs
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Deepak Narayanan
Keshav Santhanam
Peter Henderson
Rishi Bommasani
Tony Lee
Abigail Z. Jacobs
307
4
0
03 May 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
433
940
0
26 Apr 2023
Measuring Massive Multitask Chinese Understanding
Measuring Massive Multitask Chinese Understanding
Hui Zeng
ALMELMAILaw
152
34
0
25 Apr 2023
Why Does ChatGPT Fall Short in Providing Truthful Answers?
Why Does ChatGPT Fall Short in Providing Truthful Answers?
Shen Zheng
Jie Huang
Kevin Chen-Chuan Chang
HILMAI4MH
498
75
0
20 Apr 2023
LongForm: Effective Instruction Tuning with Reverse Instructions
LongForm: Effective Instruction Tuning with Reverse InstructionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Abdullatif Köksal
Timo Schick
Anna Korhonen
Hinrich Schütze
SyDaALM
287
48
0
17 Apr 2023
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction
  Tuning
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning
Qian Liu
Fan Zhou
Zhengbao Jiang
Longxu Dou
Min Lin
279
18
0
17 Apr 2023
nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss
  Prediction across Scales
nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales
Yiqun Yao
Siqi Fan
Xiusheng Huang
Xuezhi Fang
Xiang Li
...
Peng Han
Shuo Shang
Kang Liu
Aixin Sun
Yequan Wang
226
8
0
14 Apr 2023
Learning Personalized Decision Support Policies
Learning Personalized Decision Support PoliciesAAAI Conference on Artificial Intelligence (AAAI), 2023
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
527
12
0
13 Apr 2023
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Wanjun Zhong
Ruixiang Cui
Yiduo Guo
Yaobo Liang
Shuai Lu
Yanlin Wang
Amin Saied
Weizhu Chen
Nan Duan
ALMELM
382
740
0
13 Apr 2023
Can Large Language Models Transform Computational Social Science?
Can Large Language Models Transform Computational Social Science?International Conference on Computational Logic (ICCL), 2023
Caleb Ziems
William B. Held
Omar Shaikh
Jiaao Chen
Zhehao Zhang
Diyi Yang
LLMAG
495
440
0
12 Apr 2023
Boosted Prompt Ensembles for Large Language Models
Boosted Prompt Ensembles for Large Language Models
Silviu Pitis
Michael Ruogu Zhang
Andrew Wang
Jimmy Ba
LRMLLMAG
174
55
0
12 Apr 2023
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language
  Models
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models
Patrik Puchert
Poonam Poonam
Christian van Onzenoodt
Timo Ropinski
157
11
0
02 Apr 2023
BloombergGPT: A Large Language Model for Finance
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
688
1,170
0
30 Mar 2023
Whose Opinions Do Language Models Reflect?
Whose Opinions Do Language Models Reflect?International Conference on Machine Learning (ICML), 2023
Shibani Santurkar
Esin Durmus
Faisal Ladhak
Cinoo Lee
Abigail Z. Jacobs
Tatsunori Hashimoto
376
653
0
30 Mar 2023
Natural Language Reasoning, A Survey
Natural Language Reasoning, A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLMLRM
331
97
0
26 Mar 2023
$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest
  Neighbor Inference
kkkNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceInternational Conference on Learning Representations (ICLR), 2023
Benfeng Xu
Quan Wang
Zhendong Mao
Yajuan Lyu
Qiaoqiao She
Yongdong Zhang
308
65
0
24 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training
  Efficiency
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training EfficiencyInternational Conference on Machine Learning (ICML), 2023
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
440
7
0
21 Mar 2023
Language Model Behavior: A Comprehensive Survey
Language Model Behavior: A Comprehensive SurveyInternational Conference on Computational Logic (ICCL), 2023
Tyler A. Chang
Benjamin Bergen
VLMLRMLM&MA
382
143
0
20 Mar 2023
eP-ALM: Efficient Perceptual Augmentation of Language Models
eP-ALM: Efficient Perceptual Augmentation of Language ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Mustafa Shukor
Corentin Dancette
Matthieu Cord
MLLMVLM
427
34
0
20 Mar 2023
Capabilities of GPT-4 on Medical Challenge Problems
Capabilities of GPT-4 on Medical Challenge Problems
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
LM&MAELMAI4MH
480
1,075
0
20 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and
  Challenges
Large Language Model Instruction Following: A Survey of Progresses and ChallengesComputational Linguistics (CL), 2023
Renze Lou
Kai Zhang
Wenpeng Yin
ALMLRM
858
40
0
18 Mar 2023
Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher
  Education Programming Courses?
Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses?
Jaromír Šavelka
Arav Agarwal
Chris Bogart
Yifan Song
M. Sakr
ELM
183
119
0
16 Mar 2023
Previous
123...87888990
Next
Page 88 of 90
Pageof 90