v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

International Conference on Learning Representations (ICLR), 2020

7 September 2020

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 4,486 papers shown

Should We Attend More or Less? Modulating Attention for Fairness

266

22 May 2023

RWKV: Reinventing RNNs for the Transformer EraConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

...

Rui-Jie Zhu

598

873

22 May 2023

Iterative Forward Tuning Boosts In-Context Learning in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jiaxi Yang

Binyuan Hui

Min Yang

Bailin Wang

Bowen Li

Binhua Li

Fei Huang

Yongbin Li

285

22 May 2023

ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist ExaminationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Baotian Hu

182

22 May 2023

Meta-in-context learning in large language modelsNeural Information Processing Systems (NeurIPS), 2023

455

22 May 2023

Enhancing Small Medical Learners with Privacy-preserving Contextual PromptingInternational Conference on Learning Representations (ICLR), 2023

282

22 May 2023

Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xiaodong Liu

Xia Song

162

21 May 2023

Evaluating the Performance of Large Language Models on GAOKAO Benchmark

Xipeng Qiu

391

167

21 May 2023

VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

145

20 May 2023

Evaluation of medium-large Language Models at zero-shot closed book generative question answeringArtificial Intelligence and Applications (AIA), 2023

René Peinl

Johannes Wirth

ELM

232

19 May 2023

Prompting with Pseudo-Code InstructionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Mayank Mishra

Praveen Venkateswaran

Riyaz Ahmad Bhat

V. Rudramurthy

Danish Contractor

Srikanth G. Tamilselvam

342

19 May 2023

Separating form and meaning: Using self-consistency to quantify task understanding across multiple sensesIEEE Games Entertainment Media Conference (IEEE GEM), 2023

309

19 May 2023

Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

552

119

19 May 2023

Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt

Anshumali Shrivastava

252

17 May 2023

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

...

Qun Liu

300

17 May 2023

Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Shangbin Feng

Weijia Shi

Yuyang Bai

Vidhisha Balachandran

Tianxing He

Yulia Tsvetkov

KELM

396

17 May 2023

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsNeural Information Processing Systems (NeurIPS), 2023

...

Maosong Sun

426

751

15 May 2023

Symbol tuning improves in-context learning in language modelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

...

341

103

15 May 2023

Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

350

228

11 May 2023

Active Retrieval Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Graham Neubig

405

508

11 May 2023

Taking Advice from ChatGPT

Peter Zhang

281

11 May 2023

Long-Tailed Question Answering in an Open WorldAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Fei Huang

182

11 May 2023

RECKONING: Reasoning through Dynamic Knowledge EncodingNeural Information Processing Systems (NeurIPS), 2023

358

10 May 2023

Multilingual LLMs are Better Cross-lingual In-context Learners with AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

232

10 May 2023

StarCoder: may the source be with you!

Niklas Muennighoff

...

515

1,077

09 May 2023

The Current State of Summarization

Fabian Retkowski

282

08 May 2023

How Do In-Context Examples Affect Compositional Generalization?Annual Meeting of the Association for Computational Linguistics (ACL), 2023

408

08 May 2023

Improving Cross-Task Generalization with Step-by-Step InstructionsScience China Information Sciences (Sci China Inf Sci), 2023

143

08 May 2023

Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs

Deepak Narayanan

307

03 May 2023

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondACM Transactions on Knowledge Discovery from Data (TKDD), 2023

433

940

26 Apr 2023

Measuring Massive Multitask Chinese Understanding

Hui Zeng

ALM ELM AILaw

152

25 Apr 2023

Why Does ChatGPT Fall Short in Providing Truthful Answers?

Shen Zheng

Jie Huang

Kevin Chen-Chuan Chang

HILM AI4MH

498

20 Apr 2023

LongForm: Effective Instruction Tuning with Reverse InstructionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Abdullatif Köksal

287

17 Apr 2023

From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning

279

17 Apr 2023

nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales

Xuezhi Fang

...

Kang Liu

226

14 Apr 2023

Learning Personalized Decision Support PoliciesAAAI Conference on Artificial Intelligence (AAAI), 2023

Umang Bhatt

Valerie Chen

Katherine M. Collins

Parameswaran Kamalaruban

527

13 Apr 2023

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

382

740

13 Apr 2023

Can Large Language Models Transform Computational Social Science?International Conference on Computational Logic (ICCL), 2023

Jiaao Chen

Diyi Yang

495

440

12 Apr 2023

Boosted Prompt Ensembles for Large Language Models

Silviu Pitis

Michael Ruogu Zhang

Andrew Wang

Jimmy Ba

LRM LLMAG

174

12 Apr 2023

LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models

Patrik Puchert

Poonam Poonam

Christian van Onzenoodt

Timo Ropinski

157

02 Apr 2023

BloombergGPT: A Large Language Model for Finance

688

1,170

30 Mar 2023

Whose Opinions Do Language Models Reflect?International Conference on Machine Learning (ICML), 2023

Esin Durmus

Tatsunori Hashimoto

376

653

30 Mar 2023

Natural Language Reasoning, A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023

Hongbo Zhang

331

26 Mar 2023

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest
Neighbor Inference

k

NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceInternational Conference on Learning Representations (ICLR), 2023

Benfeng Xu

308

24 Mar 2023

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training EfficiencyInternational Conference on Machine Learning (ICML), 2023

440

21 Mar 2023

Language Model Behavior: A Comprehensive SurveyInternational Conference on Computational Logic (ICCL), 2023

Tyler A. Chang

Benjamin Bergen

VLM LRM LM&MA

382

143

20 Mar 2023

eP-ALM: Efficient Perceptual Augmentation of Language ModelsIEEE International Conference on Computer Vision (ICCV), 2023

427

20 Mar 2023

Capabilities of GPT-4 on Medical Challenge Problems

480

1,075

20 Mar 2023

Large Language Model Instruction Following: A Survey of Progresses and ChallengesComputational Linguistics (CL), 2023

858

18 Mar 2023

Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses?

183

119

16 Mar 2023