Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.03300
Cited By
v1
v2
v3 (latest)
Measuring Massive Multitask Language Understanding
International Conference on Learning Representations (ICLR), 2020
7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (3 upvotes)
Papers citing
"Measuring Massive Multitask Language Understanding"
50 / 4,486 papers shown
Should We Attend More or Less? Modulating Attention for Fairness
A. Zayed
Gonçalo Mordido
Samira Shabanian
Sarath Chandar
266
15
0
22 May 2023
RWKV: Reinventing RNNs for the Transformer Era
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
598
873
0
22 May 2023
Iterative Forward Tuning Boosts In-Context Learning in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jiaxi Yang
Binyuan Hui
Min Yang
Bailin Wang
Bowen Li
Binhua Li
Fei Huang
Yongbin Li
285
19
0
22 May 2023
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Dongfang Li
Jindi Yu
Baotian Hu
Zhenran Xu
Hao Fei
ELM
182
15
0
22 May 2023
Meta-in-context learning in large language models
Neural Information Processing Systems (NeurIPS), 2023
Julian Coda-Forno
Marcel Binz
Zeynep Akata
M. Botvinick
Jane X. Wang
Eric Schulz
LRM
455
60
0
22 May 2023
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting
International Conference on Learning Representations (ICLR), 2023
Xinlu Zhang
Shiyang Li
Xianjun Yang
Chenxin Tian
Yao Qin
Linda R. Petzold
282
13
0
22 May 2023
Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Linyuan Gong
Chenyan Xiong
Xiaodong Liu
Payal Bajaj
Yiqing Xie
Alvin Cheung
Jianfeng Gao
Xia Song
VLM
AI4CE
162
2
0
21 May 2023
Evaluating the Performance of Large Language Models on GAOKAO Benchmark
Xiaotian Zhang
Chun-yan Li
Yi Zong
Zhengyu Ying
Liang He
Xipeng Qiu
ALM
ELM
391
167
0
21 May 2023
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models
Dao Xuan-Quy
Le Ngoc-Bich
Vo The-Duy
Phan Xuan-Dung
Ngo Bac-Bien
Nguyen Van-Tien
Nguyen Thi-My-Thanh
Nguyen Hong-Phuoc
145
22
0
20 May 2023
Evaluation of medium-large Language Models at zero-shot closed book generative question answering
Artificial Intelligence and Applications (AIA), 2023
René Peinl
Johannes Wirth
ELM
232
8
0
19 May 2023
Prompting with Pseudo-Code Instructions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mayank Mishra
Praveen Venkateswaran
Riyaz Ahmad Bhat
V. Rudramurthy
Danish Contractor
Srikanth G. Tamilselvam
342
17
0
19 May 2023
Separating form and meaning: Using self-consistency to quantify task understanding across multiple senses
IEEE Games Entertainment Media Conference (IEEE GEM), 2023
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
LRM
309
17
0
19 May 2023
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kai Xiong
Xiao Ding
Yixin Cao
Ting Liu
Bing Qin
552
119
0
19 May 2023
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt
Zhaozhuo Xu
Zirui Liu
Beidi Chen
Yuxin Tang
Jue Wang
Kaixiong Zhou
Helen Zhou
Anshumali Shrivastava
MQ
252
39
0
17 May 2023
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELM
ALM
300
32
0
17 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
International Conference on Learning Representations (ICLR), 2023
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
396
50
0
17 May 2023
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
Neural Information Processing Systems (NeurIPS), 2023
Yuzhen Huang
Yuzhuo Bai
Zhihao Zhu
Junlei Zhang
Jinghan Zhang
...
Yikai Zhang
Jiayi Lei
Yao Fu
Maosong Sun
Junxian He
ELM
LRM
426
751
0
15 May 2023
Symbol tuning improves in-context learning in language models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jerry W. Wei
Le Hou
Andrew Kyle Lampinen
Xiangning Chen
Da Huang
...
Xinyun Chen
Yifeng Lu
Denny Zhou
Tengyu Ma
Quoc V. Le
LRM
341
103
0
15 May 2023
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haoyang Huang
Tianyi Tang
Dongdong Zhang
Wayne Xin Zhao
Ting Song
Yan Xia
Furu Wei
LRM
350
228
0
11 May 2023
Active Retrieval Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
405
508
0
11 May 2023
Taking Advice from ChatGPT
Peter Zhang
281
5
0
11 May 2023
Long-Tailed Question Answering in an Open World
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yinpei Dai
Hao Lang
Yinhe Zheng
Fei Huang
Yongbin Li
VLM
182
10
0
11 May 2023
RECKONING: Reasoning through Dynamic Knowledge Encoding
Neural Information Processing Systems (NeurIPS), 2023
Zeming Chen
Gail Weiss
E. Mitchell
Asli Celikyilmaz
Antoine Bosselut
KELM
LRM
358
15
0
10 May 2023
Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Eshaan Tanwar
Subhabrata Dutta
Manish Borthakur
Tanmoy Chakraborty
232
83
0
10 May 2023
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
515
1,077
0
09 May 2023
The Current State of Summarization
Fabian Retkowski
282
10
0
08 May 2023
How Do In-Context Examples Affect Compositional Generalization?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shengnan An
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Jian-Guang Lou
Dongmei Zhang
408
70
0
08 May 2023
Improving Cross-Task Generalization with Step-by-Step Instructions
Science China Information Sciences (Sci China Inf Sci), 2023
Yang Wu
Yanyan Zhao
Zhongyang Li
Bing Qin
Kai Xiong
LRM
ALM
143
11
0
08 May 2023
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Deepak Narayanan
Keshav Santhanam
Peter Henderson
Rishi Bommasani
Tony Lee
Abigail Z. Jacobs
307
4
0
03 May 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
ACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
433
940
0
26 Apr 2023
Measuring Massive Multitask Chinese Understanding
Hui Zeng
ALM
ELM
AILaw
152
34
0
25 Apr 2023
Why Does ChatGPT Fall Short in Providing Truthful Answers?
Shen Zheng
Jie Huang
Kevin Chen-Chuan Chang
HILM
AI4MH
498
75
0
20 Apr 2023
LongForm: Effective Instruction Tuning with Reverse Instructions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Abdullatif Köksal
Timo Schick
Anna Korhonen
Hinrich Schütze
SyDa
ALM
287
48
0
17 Apr 2023
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning
Qian Liu
Fan Zhou
Zhengbao Jiang
Longxu Dou
Min Lin
279
18
0
17 Apr 2023
nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales
Yiqun Yao
Siqi Fan
Xiusheng Huang
Xuezhi Fang
Xiang Li
...
Peng Han
Shuo Shang
Kang Liu
Aixin Sun
Yequan Wang
226
8
0
14 Apr 2023
Learning Personalized Decision Support Policies
AAAI Conference on Artificial Intelligence (AAAI), 2023
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
527
12
0
13 Apr 2023
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Wanjun Zhong
Ruixiang Cui
Yiduo Guo
Yaobo Liang
Shuai Lu
Yanlin Wang
Amin Saied
Weizhu Chen
Nan Duan
ALM
ELM
382
740
0
13 Apr 2023
Can Large Language Models Transform Computational Social Science?
International Conference on Computational Logic (ICCL), 2023
Caleb Ziems
William B. Held
Omar Shaikh
Jiaao Chen
Zhehao Zhang
Diyi Yang
LLMAG
495
440
0
12 Apr 2023
Boosted Prompt Ensembles for Large Language Models
Silviu Pitis
Michael Ruogu Zhang
Andrew Wang
Jimmy Ba
LRM
LLMAG
174
55
0
12 Apr 2023
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models
Patrik Puchert
Poonam Poonam
Christian van Onzenoodt
Timo Ropinski
157
11
0
02 Apr 2023
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
688
1,170
0
30 Mar 2023
Whose Opinions Do Language Models Reflect?
International Conference on Machine Learning (ICML), 2023
Shibani Santurkar
Esin Durmus
Faisal Ladhak
Cinoo Lee
Abigail Z. Jacobs
Tatsunori Hashimoto
376
653
0
30 Mar 2023
Natural Language Reasoning, A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLM
LRM
331
97
0
26 Mar 2023
k
k
k
NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference
International Conference on Learning Representations (ICLR), 2023
Benfeng Xu
Quan Wang
Zhendong Mao
Yajuan Lyu
Qiaoqiao She
Yongdong Zhang
308
65
0
24 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
International Conference on Machine Learning (ICML), 2023
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
440
7
0
21 Mar 2023
Language Model Behavior: A Comprehensive Survey
International Conference on Computational Logic (ICCL), 2023
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
382
143
0
20 Mar 2023
eP-ALM: Efficient Perceptual Augmentation of Language Models
IEEE International Conference on Computer Vision (ICCV), 2023
Mustafa Shukor
Corentin Dancette
Matthieu Cord
MLLM
VLM
427
34
0
20 Mar 2023
Capabilities of GPT-4 on Medical Challenge Problems
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
LM&MA
ELM
AI4MH
480
1,075
0
20 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Computational Linguistics (CL), 2023
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
858
40
0
18 Mar 2023
Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses?
Jaromír Šavelka
Arav Agarwal
Chris Bogart
Yifan Song
M. Sakr
ELM
183
119
0
16 Mar 2023
Previous
1
2
3
...
87
88
89
90
Next
Page 88 of 90
Page
of 90
Go