ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

International Conference on Learning Representations (ICLR), 2020
7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 4,481 papers shown
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
Yuxi Liu
Renjia Deng
Yutong He
Xue Wang
Tao Yao
Kun Yuan
143
0
0
28 Oct 2025
Parallel Loop Transformer for Efficient Test-Time Computation Scaling
Parallel Loop Transformer for Efficient Test-Time Computation Scaling
Bohong Wu
Mengzhao Chen
Xiang Luo
Shen Yan
Qifan Yu
...
Hongrui Zhan
Zheng Zhong
Xun Zhou
Siyuan Qiao
Xingyan Bin
116
2
0
28 Oct 2025
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
Yixing Chen
Yiding Wang
Siqi Zhu
Haofei Yu
Tao Feng
Muhan Zhang
M. Patwary
Jiaxuan You
LLMAGLRM
295
5
0
27 Oct 2025
Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Xiaofan Zhou
Lu Cheng
CLL
378
0
0
27 Oct 2025
Probing Knowledge Holes in Unlearned LLMs
Probing Knowledge Holes in Unlearned LLMs
Myeongseob Ko
H. Just
Charles Fleming
Ming Jin
R. Jia
MU
302
0
0
27 Oct 2025
A Survey on LLM Mid-Training
A Survey on LLM Mid-Training
Chengying Tu
Xuemiao Zhang
Rongxiang Weng
Rumei Li
Chen Zhang
Yang Bai
Hongfei Yan
Jingang Wang
Xunliang Cai
OffRLLRM
239
2
0
27 Oct 2025
Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks
Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks
Amal Abed
Ivan Lukic
Jorg K. H. Franke
Frank Hutter
SyDaLRM
372
0
0
27 Oct 2025
Knocking-Heads Attention
Knocking-Heads Attention
Zhanchao Zhou
Xiaodong Chen
Haoxing Chen
Zhenzhong Lan
Jianguo Li
95
0
0
27 Oct 2025
PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models
PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models
Patrick Haller
Fabio Barth
Jonas Golde
Georg Rehm
Alan Akbik
LRM
373
0
0
27 Oct 2025
Offline Preference Optimization via Maximum Marginal Likelihood Estimation
Offline Preference Optimization via Maximum Marginal Likelihood Estimation
Saeed Najafi
Alona Fyshe
OffRL
144
0
0
27 Oct 2025
Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization
Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization
Yijia Fan
Jusheng Zhang
Jing Yang
Keze Wang
LLMAG
100
1
0
26 Oct 2025
Frustratingly Easy Task-aware Pruning for Large Language Models
Frustratingly Easy Task-aware Pruning for Large Language Models
Yuanhe Tian
Junjie Liu
Xican Yang
Haishan Ye
Yan Song
136
1
0
26 Oct 2025
TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination
TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination
Omar Naim
Krish Sharma
Nicholas M. Asher
88
0
0
26 Oct 2025
Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints
Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints
Jing Yang
Kaitong Cai
Yijia Fan
Yufeng Yang
Keze Wang
123
0
0
26 Oct 2025
Leveraging Large Language Models to Identify Conversation Threads in Collaborative Learning
Leveraging Large Language Models to Identify Conversation Threads in Collaborative Learning
Prerna Ravi
Dong Won Lee
Beatriz Flamia
Jasmine David
Brandon Hanks
C. Breazeal
Emma Anderson
Grace Lin
96
0
0
26 Oct 2025
SeeDNorm: Self-Rescaled Dynamic Normalization
SeeDNorm: Self-Rescaled Dynamic Normalization
Wenrui Cai
Defa Zhu
Qingjie Liu
Qiyang Min
145
0
0
26 Oct 2025
Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks
Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks
Peiyu Li
Xiuxiu Tang
Si Chen
Ying Cheng
Ronald A Metoyer
Ting Hua
Nitesh Chawla
65
1
0
26 Oct 2025
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Jinzhe Liu
Junshu Sun
Shufan Shen
Chenxue Yang
Shuhui Wang
KELMCLL
352
1
0
25 Oct 2025
When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs
When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs
Keyu Wang
Tian Lyu
Guinan Su
Jonas Geiping
L. Yin
Marco Canini
Shiwei Liu
LRM
117
1
0
25 Oct 2025
The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models
The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models
Yao Lu
Yuqi Li
Wenbin Xie
Shanqing Yu
Qi Xuan
Zhaowei Zhu
Shiping Wen
85
1
0
25 Oct 2025
Model-Aware Tokenizer Transfer
Model-Aware Tokenizer Transfer
Mykola Haltiuk
Aleksander Smywiński-Pohl
120
0
0
24 Oct 2025
A Diagnostic Benchmark for Sweden-Related Factual Knowledge
A Diagnostic Benchmark for Sweden-Related Factual Knowledge
Jenny Kunz
HILM
179
0
0
24 Oct 2025
$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy
δδδ-STEAL: LLM Stealing Attack with Local Differential Privacy
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
R. Jin
Abdallah Khreishah
AAML
132
1
0
24 Oct 2025
Transformer Based Linear Attention with Optimized GPU Kernel Implementation
Transformer Based Linear Attention with Optimized GPU Kernel Implementation
Armin Gerami
R. Duraiswami
143
0
0
24 Oct 2025
Risk Management for Mitigating Benchmark Failure Modes: BenchRisk
Risk Management for Mitigating Benchmark Failure Modes: BenchRisk
Sean McGregor
Victor Lu
Vassil Tashev
Armstrong Foundjem
Aishwarya Ramasethu
...
Chris Knotz
Kongtao Chen
Alicia Parrish
Anka Reuel
Heather Frase
145
0
0
24 Oct 2025
Model Merging with Functional Dual Anchors
Model Merging with Functional Dual Anchors
Kexuan Shi
Yandong Wen
Weiyang Liu
MoMe
272
0
0
24 Oct 2025
Estonian Native Large Language Model Benchmark
Estonian Native Large Language Model Benchmark
Helena Grete Lillepalu
Tanel Alumäe
ELM
116
0
0
24 Oct 2025
Chain of Execution Supervision Promotes General Reasoning in Large Language Models
Chain of Execution Supervision Promotes General Reasoning in Large Language Models
Nuo Chen
Zehua Li
Keqin Bao
Junyang Lin
Dayiheng Liu
LLMAGLRM
118
0
0
24 Oct 2025
On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?
On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?
Mingmeng Geng
Thierry Poibeau
DeLMO
217
0
0
23 Oct 2025
Robust Preference Alignment via Directional Neighborhood Consensus
Robust Preference Alignment via Directional Neighborhood Consensus
Ruochen Mao
Yuling Shi
Xiaodong Gu
Jiaheng Wei
173
0
0
23 Oct 2025
\textsc{CantoNLU}: A benchmark for Cantonese natural language understanding
\textsc{CantoNLU}: A benchmark for Cantonese natural language understanding
Junghyun Min
York Hay Ng
Sophia Chan
Helena Shunhua Zhao
En-Shiun Annie Lee
ELM
120
0
0
23 Oct 2025
What Does It Take to Build a Performant Selective Classifier?
What Does It Take to Build a Performant Selective Classifier?
Stephan Rabanser
Nicolas Papernot
210
0
0
23 Oct 2025
Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs
Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs
Yanlin Song
Ben Liu
Víctor Gutiérrez-Basulto
Zhiwei Hu
Qianqian Xie
Min Peng
Sophia Ananiadou
Jeff Z. Pan
RALMReLMLRM
279
0
0
23 Oct 2025
LM-mixup: Text Data Augmentation via Language Model based Mixup
LM-mixup: Text Data Augmentation via Language Model based Mixup
Zhijie Deng
Zhouan Shen
Ling Li
Yao Zhou
Zhaowei Zhu
Yanji He
Wei Wang
Jiaheng Wei
98
0
0
23 Oct 2025
Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks
Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks
Javier Marín
84
0
0
23 Oct 2025
ResearchGPT: Benchmarking and Training LLMs for End-to-End Computer Science Research Workflows
ResearchGPT: Benchmarking and Training LLMs for End-to-End Computer Science Research Workflows
Penghao Wang
Yuhao Zhou
Mengxuan Wu
Ziheng Qin
Bangyuan Zhu
...
J. Yang
Zheng Zhu
Tianlong Chen
Zinan Lin
Kai Wang
LLMAGAI4TSALMVLM
328
0
0
23 Oct 2025
The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Sangmitra Madhusudan
Kaige Chen
Ali Emami
ELMLRM
119
0
0
23 Oct 2025
DiSRouter: Distributed Self-Routing for LLM Selections
DiSRouter: Distributed Self-Routing for LLM Selections
Hang Zheng
Hongshen Xu
Yongkai Lin
Shuai Fan
Lu Chen
Kai Yu
132
1
0
22 Oct 2025
LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts
LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts
S. S. Wang
Gaokai Zhang
Li Zhang
Ning Shang
Fan Yang
Dongyao Chen
M. Yang
OffRLRALMReLMLRM
241
0
0
22 Oct 2025
What is the Best Sequence Length for BABYLM?
What is the Best Sequence Length for BABYLM?
Suchir Salhan
Richard Diehl Martinez
Zébulon Goriely
P. Buttery
103
2
0
22 Oct 2025
Data-Centric Lessons To Improve Speech-Language Pretraining
Data-Centric Lessons To Improve Speech-Language Pretraining
Vishaal Udandarao
Zhiyun Lu
Xuankai Chang
Yongqiang Wang
Violet Z. Yao
Albin Madapally Jose
Fartash Faghri
Josh Gardner
Chung-Cheng Chiu
136
0
0
22 Oct 2025
LLM Unlearning with LLM Beliefs
LLM Unlearning with LLM Beliefs
Kemou Li
Qizhou Wang
Y. Wang
Fengpeng Li
Jun Liu
Bo Han
Jiantao Zhou
MUKELM
201
1
0
22 Oct 2025
Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges
Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges
Cheng Huang
Nyima Tashi
Fan Gao
Yutong Liu
J. Li
...
Guojie Tang
Xiangxiang Wang
Jia Zhang
Tsengdar J. Lee
Yongbin Yu
116
0
0
22 Oct 2025
Beyond MedQA: Towards Real-world Clinical Decision Making in the Era of LLMs
Beyond MedQA: Towards Real-world Clinical Decision Making in the Era of LLMs
Yunpeng Xiao
Carl Yang
Mark Mai
Xiao Hu
Kai Shu
LM&MAELM
264
0
0
22 Oct 2025
Teaming LLMs to Detect and Mitigate Hallucinations
Teaming LLMs to Detect and Mitigate Hallucinations
Demian Till
John Smeaton
Peter Haubrick
Gouse Saheb
Florian Graef
David Berman
HILM
319
0
0
22 Oct 2025
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
Guanzhong He
Zhen Yang
Jinxin Liu
Bin Xu
Lei Hou
Juanzi Li
105
1
0
21 Oct 2025
From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering
From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering
Lei Li
Xiao Zhou
Y. Zhang
X. Wu
RALMMedIm
156
0
0
21 Oct 2025
Investigating LLM Capabilities on Long Context Comprehension for Medical Question Answering
Investigating LLM Capabilities on Long Context Comprehension for Medical Question Answering
Feras AlMannaa
Talia Tseriotou
Jenny Chim
Maria Liakata
ELM
191
0
0
21 Oct 2025
Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation
Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation
Giovanni De Muri
Mark Vero
Robin Staab
Martin Vechev
155
0
0
21 Oct 2025
Some Attention is All You Need for Retrieval
Some Attention is All You Need for Retrieval
Felix Michalak
Steven Abreu
89
0
0
21 Oct 2025
Previous
123456...888990
Next