Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.05457
Cited By
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
14 March 2018
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"
50 / 1,882 papers shown
Title
Stratum: System-Hardware Co-Design with Tiered Monolithic 3D-Stackable DRAM for Efficient MoE Serving
Yue Pan
Zihan Xia
Po-Kai Hsu
Lanxiang Hu
Hyungyo Kim
...
Minxuan Zhou
Nam Sung Kim
Shimeng Yu
Tajana Rosing
Mingu Kang
MoE
72
0
0
06 Oct 2025
What Makes Diffusion Language Models Super Data Learners?
Zitian Gao
Haoming Luo
Lynx Chen
Jason Klein Liu
Ran Tao
Joey Zhou
Bryan Dai
72
0
0
05 Oct 2025
AgriGPT-VL: Agricultural Vision-Language Understanding Suite
Bo Yang
Yunkui Chen
Lanfei Feng
Y. Zhang
Xiao-Qiang Xu
...
Nueraili Aierken
Runhe Huang
Hongjian Lin
Yibin Ying
Shijian Li
VLM
247
3
0
05 Oct 2025
Measuring Language Model Hallucinations Through Distributional Correctness
Thomas F Burns
HILM
ELM
149
0
0
05 Oct 2025
The Debate on RLVR Reasoning Capability Boundary: Shrinkage, Expansion, or Both? A Two-Stage Dynamic View
Xinhao Yao
Lu Yu
Xiaolin Hu
Fengwei Teng
Qing Cui
Jun Zhou
Yong Liu
LRM
137
0
0
05 Oct 2025
SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading
Yuanzhe Shen
Y. Liu
Zisu Huang
Ruicheng Yin
Xiaoqing Zheng
Xuanjing Huang
84
1
0
04 Oct 2025
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Jiaxi Li
Yucheng Shi
Jin Lu
Ninghao Liu
LRM
120
0
0
04 Oct 2025
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Tianyu Fu
Zihan Min
Hanling Zhang
Jichao Yan
Guohao Dai
Wanli Ouyang
Yu Wang
92
1
0
03 Oct 2025
OpenStaxQA: A multilingual dataset based on open-source college textbooks
Pranav Gupta
49
0
0
03 Oct 2025
TravelBench : Exploring LLM Performance in Low-Resource Domains
Srinivas Billa
Xiaonan Jing
72
0
0
03 Oct 2025
Leave No TRACE: Black-box Detection of Copyrighted Dataset Usage in Large Language Models via Watermarking
Jingqi Zhang
Ruibo Chen
Yingqing Yang
Peihua Mai
Heng Huang
Yan Pang
WaLM
152
3
0
03 Oct 2025
Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization
Kezhao Liu
Jason Klein Liu
Mingtao Chen
Yiming Liu
OffRL
59
2
0
02 Oct 2025
More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
Xiaoyang Yuan
Yujuan Ding
Yi Bin
Wenqi Shao
Jinyu Cai
Jingkuan Song
Yang Yang
H. Shen
LRM
139
0
1
02 Oct 2025
Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models
Shaoan Xie
Lingjing Kong
Xiangchen Song
Xinshuai Dong
Guangyi Chen
Eric P.Xing
Kun Zhang
LRM
97
3
0
02 Oct 2025
The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM
Kwanhee Lee
Hyeondo Jang
Dongyeop Lee
Dan Alistarh
Namhoon Lee
72
1
0
02 Oct 2025
ExGRPO: Learning to Reason from Experience
Runzhe Zhan
Yafu Li
Zhi Wang
Xiaoye Qu
Dongrui Liu
Jing Shao
Derek F. Wong
Yu Cheng
OffRL
LRM
117
1
1
02 Oct 2025
The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining
Thiziri Nait Saada
Louis Béthune
Michal Klein
David Grangier
Marco Cuturi
Pierre Ablin
110
0
0
01 Oct 2025
When Silence Matters: The Impact of Irrelevant Audio on Text Reasoning in Large Audio-Language Models
Chen-An Li
Tzu-Han Lin
Hung-yi Lee
AuLLM
132
0
0
01 Oct 2025
Hearing the Order: Investigating Selection Bias in Large Audio-Language Models
Yu-Xiang Lin
Chen-An Li
Sheng-Lun Wei
Po-Chun Chen
Hsin-Hsi Chen
Hung-yi Lee
108
0
0
01 Oct 2025
Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability
Shojiro Yamabe
Jun Sakuma
AAML
116
0
0
01 Oct 2025
Composer: A Search Framework for Hybrid Neural Architecture Design
Bilge Acun
Prasoon Sinha
Newsha Ardalani
Sangmin Bae
Alicia Golden
Chien-Yu Lin
Meghana Madhyastha
Fei Sun
N. Yadwadkar
Carole-Jean Wu
200
1
0
01 Oct 2025
Fine-Tuning Jailbreaks under Highly Constrained Black-Box Settings: A Three-Pronged Approach
X. Li
Y. Wang
Bo Li
AAML
201
0
0
01 Oct 2025
Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Yicheng Lang
Yihua Zhang
Chongyu Fan
Changsheng Wang
Jinghan Jia
Sijia Liu
MU
337
0
0
01 Oct 2025
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Junlin Han
Shengbang Tong
David Fan
Yufan Ren
Koustuv Sinha
Juil Sock
Filippos Kokkinos
LRM
VLM
155
6
0
30 Sep 2025
CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models
Weiyu Huang
Yuezhou Hu
Jun Zhu
Jianfei Chen
CLL
88
0
0
30 Sep 2025
LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts
Yuan Zhuang
Yi Shen
Yuexin Bian
Qing Su
Shihao Ji
Yuanyuan Shi
Fei Miao
MoE
MoMe
200
1
0
30 Sep 2025
OPPO: Accelerating PPO-based RLHF via Pipeline Overlap
Kaizhuo Yan
Yingjie Yu
Yifan Yu
Haizhong Zheng
Fan Lai
VLM
84
0
0
30 Sep 2025
Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization
Yaoxiang Wang
Qingguo Hu
Yucheng Ding
Ruizhe Wang
Yeyun Gong
Jian Jiao
Yelong Shen
Peng Cheng
Jinsong Su
MoE
60
0
0
30 Sep 2025
Diversity-Incentivized Exploration for Versatile Reasoning
Zican Hu
Shilin Zhang
Yafu Li
Jianhao Yan
Xuyang Hu
Leyang Cui
Xiaoye Qu
C. L. Philip Chen
Yu Cheng
Zhi Wang
LRM
131
2
0
30 Sep 2025
Layer-wise dynamic rank for compressing large language models
Zhendong Mi
Bian Sun
Grace Li Zhang
Shaoyi Huang
ALM
144
0
0
30 Sep 2025
The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks
Arda Uzunoglu
Tianjian Li
Daniel Khashabi
140
0
0
30 Sep 2025
Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel
Chuanyang Zheng
Jiankai Sun
Yihang Gao
Enze Xie
Yuehao Wang
...
Kashif Rasul
Mac Schwager
Anderson Schneider
Zinan Lin
Yuriy Nevmyvaka
MoE
170
2
0
30 Sep 2025
Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems
Aakriti Agrawal
R. Aralikatti
Anirudh Satheesh
Souradip Chakraborty
Amrit Singh Bedi
Furong Huang
LRM
96
1
0
30 Sep 2025
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Huu Nguyen
Victor May
Harsh Raj
Marianna Nezhurina
Yishan Wang
...
Aleksandra Krasnodębska
Christoph Schuhmann
Mats Leon Richter
Xuan-Son
J. Jitsev
163
1
0
29 Sep 2025
Query Circuits: Explaining How Language Models Answer User Prompts
Tung-Yu Wu
Fazl Barez
ReLM
LRM
121
0
0
29 Sep 2025
Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models
Jitai Hao
Hao Liu
Xinyan Xiao
Qiang Huang
Jun Yu
160
0
0
29 Sep 2025
Beyond Repetition: Text Simplification and Curriculum Learning for Data-Constrained Pretraining
M. R
Dan John Velasco
73
0
0
29 Sep 2025
Rethinking Parameter Sharing for LLM Fine-Tuning with Multiple LoRAs
Hao Ban
Kaiyi Ji
MoE
161
0
0
29 Sep 2025
Expanding Computation Spaces of LLMs at Inference Time
Yoonna Jang
Kisu Yang
Isabelle Augenstein
LLMAG
ReLM
LRM
56
0
0
29 Sep 2025
SpecExit: Accelerating Large Reasoning Model via Speculative Exit
Rubing Yang
Huajun Bai
Song Liu
Guanghua Yu
Runzhi Fan
Yanbin Dang
Jiejing Zhang
Kai Liu
Jianchen Zhu
Peng Chen
ReLM
LRM
87
0
0
29 Sep 2025
Pretraining with hierarchical memories: separating long-tail and common knowledge
Hadi Pouransari
David Grangier
C Thomas
Michael Kirchhof
Oncel Tuzel
RALM
KELM
211
1
0
29 Sep 2025
Conda: Column-Normalized Adam for Training Large Language Models Faster
Junjie Wang
Pan Zhou
Yiming Dong
Huan Li
Jia Li
Xun Zhou
Qicheng Lao
Cong Fang
Zhouchen Lin
AI4CE
204
0
0
29 Sep 2025
LLM DNA: Tracing Model Evolution via Functional Representations
Zhaomin Wu
Haodong Zhao
Ziyang Wang
Jizhou Guo
Qian Wang
Bingsheng He
108
1
0
29 Sep 2025
CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task
Haosi Mo
Xinyu Ma
Xuebo Liu
Yang Li
Yu Li
Jie Liu
Min Zhang
ELM
114
0
0
29 Sep 2025
From Score Distributions to Balance: Plug-and-Play Mixture-of-Experts Routing
Rana Shahout
Colin Cai
Yilun Du
Minlan Yu
Michael Mitzenmacher
MoE
MoMe
135
2
0
29 Sep 2025
Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models
Yuhui Wang
Changjiang Li
Guangke Chen
Jiacheng Liang
Ting Wang
ReLM
KELM
LRM
85
1
0
29 Sep 2025
Short window attention enables long-term memorization
Loic Cabannes
Maximilian Beck
Gergely Szilvasy
Matthijs Douze
Maria Lomeli
Jade Copet
Pierre-Emmanuel Mazaré
Gabriel Synnaeve
Hervé Jégou
116
1
0
29 Sep 2025
Beyond Benchmarks: Understanding Mixture-of-Experts Models through Internal Mechanisms
Jiahao Ying
Mingbao Lin
Qianru Sun
Yixin Cao
MoE
36
0
0
28 Sep 2025
ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference
Haojie Ouyang
Jianwei Lv
Lei Ren
Chen Wei
Xiaojie Wang
Fangxiang Feng
VLM
132
0
0
28 Sep 2025
Sequential Diffusion Language Models
Yangzhou Liu
Yue Cao
Hao-Wen Li
Gen Luo
Z. Chen
...
Yuqiang Li
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
80
4
0
28 Sep 2025
Previous
1
2
3
...
5
6
7
...
36
37
38
Next