Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.05457
Cited By
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
14 March 2018
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"
50 / 1,910 papers shown
COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens
Eugene Kwek
Wenpeng Yin
VLM
265
0
0
08 Sep 2025
LoaQ: Layer-wise Output Approximation Quantization
Li Lin
Xiaojun Wan
MQ
90
1
0
08 Sep 2025
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding
Ziheng Li
Guoqing Liu
Jinman Zhao
Erxue Min
Yongcheng Zeng
...
Hengyi Cai
Shuaiqiang Wang
D. Yin
Xu Chen
Zhi-Hong Deng
LRM
116
3
0
08 Sep 2025
Ban&Pick: Ehancing Performance and Efficiency of MoE-LLMs via Smarter Routing
Yuanteng Chen
Peisong Wang
Yuantian Shao
Nanxin Zeng
Chang Xu
Jian Cheng
MoE
178
0
0
08 Sep 2025
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian
Michael Hoffmann
Jophin John
Stefan Schweter
Gokul Ramakrishnan
Hoi-Fong Mak
Alice Zhang
Dmitry Gaynullin
Nicolay J. Hammer
CLL
162
1
0
06 Sep 2025
Hyperbolic Large Language Models
Sarang Patil
Zeyong Zhang
Yiran Huang
Tengfei Ma
Mengjia Xu
AI4CE
215
0
0
06 Sep 2025
Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation
Hongyan Xie
Yitong Yao
Yikun Ban
Zixuan Huang
Deqing Wang
Zhenhe Wu
Haoxiang Su
Chao Wang
Shuangyong Song
LRM
211
3
0
06 Sep 2025
CTCC: A Robust and Stealthy Fingerprinting Framework for Large Language Models via Cross-Turn Contextual Correlation Backdoor
Zhenhua Xu
Xixiang Zhao
Xubin Yue
Shengwei Tian
C. D. Lin
Meng Han
189
8
0
05 Sep 2025
Set Block Decoding is a Language Model Inference Accelerator
Itai Gat
Heli Ben-Hamu
Marton Havasi
Daniel Haziza
Jeremy Reizenstein
Gabriel Synnaeve
David Lopez-Paz
Brian Karrer
Y. Lipman
149
6
0
04 Sep 2025
SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment
Yuqing Huang
Rongyang Zhang
Qimeng Wang
Chengqiang Lu
Yan Gao
...
Xuyang Zhi
Guiquan Liu
Xin Li
Hao Wang
Tong Xu
CLL
178
2
0
04 Sep 2025
Towards a Unified View of Large Language Model Post-Training
Xingtai Lv
Yuxin Zuo
Youbang Sun
Hongyi Liu
Yuntian Wei
...
Lixuan He
Xuekai Zhu
Kaiyan Zhang
Bingning Wang
Ning Ding
OffRL
108
11
0
04 Sep 2025
On Robustness and Reliability of Benchmark-Based Evaluation of LLMs
Riccardo Lunardi
V. D. Mea
Stefano Mizzaro
Kevin Roitero
165
5
0
04 Sep 2025
A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models
Yanbo Wang
Yongcan Yu
Jian Liang
Ran He
HILM
LRM
205
5
0
04 Sep 2025
EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint
Zhenhua Xu
Meng Han
Wenpeng Xing
189
7
0
03 Sep 2025
Mixture-of-Clustered-Experts: Advancing Expert Specialization and Generalization in Instruction Tuning
Sugyeong Eo
Jungjun Lee
Chanjun Park
Heuiseok Lim
MoE
146
0
0
03 Sep 2025
From Construction to Injection: Edit-Based Fingerprints for Large Language Models
Yue Li
Xin Yi
Dongsheng Shi
Yongyi Cui
Gerard de Melo
Xiaoling Wang
KELM
AAML
212
1
0
03 Sep 2025
Binary Quantization For LLMs Through Dynamic Grouping
Xinzhe Zheng
Zhen-Qun Yang
H. Xie
S. J. Qin
Arlene Chen
Fangzhen Lin
MQ
208
0
0
03 Sep 2025
TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models
Yuxuan Gu
Wuyang Zhou
Giorgos Iacovides
Danilo Mandic
89
1
0
03 Sep 2025
LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference
Krishna Teja Chitty-Venkata
Sandeep Madireddy
M. Emani
V. Vishwanath
MoE
160
1
0
02 Sep 2025
Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving
Fangzhou Wu
Sandeep Silwal
235
0
0
02 Sep 2025
JudgeAgent: Beyond Static Benchmarks for Knowledge-Driven and Dynamic LLM Evaluation
Zhichao Shi
Xuhui Jiang
Chengjin Xu
Cangli Yao
Zhenxin Huang
Shengjie Ma
Yinghan Shen
Jian Guo
Yuanzhuo Wang
LLMAG
ELM
296
0
0
02 Sep 2025
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Jindong Li
Yali Fu
Li Fan
Jiahong Liu
Yao Shu
Chengwei Qin
Menglin Yang
Irwin King
Rex Ying
OffRL
LRM
AI4CE
226
14
0
02 Sep 2025
Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs
Andong Hua
Kenan Tang
Chenhe Gu
Jindong Gu
Eric Wong
Yao Qin
LRM
115
2
0
01 Sep 2025
Dream-Coder 7B: An Open Diffusion Language Model for Code
Zhihui Xie
Jiacheng Ye
Lin Zheng
Lei Li
Jingwei Dong
...
Xueliang Zhao
Shansan Gong
Xin Jiang
Zhenguo Li
Lingpeng Kong
DiffM
139
22
0
01 Sep 2025
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Haiyuan Wan
Chen Yang
Junchi Yu
Meiqi Tu
Jiaxuan Lu
...
Jiaqing Xie
Aoran Wang
W. Zhang
Philip Torr
Dongzhan Zhou
166
8
0
01 Sep 2025
GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping
Qifu Wen
Xi Zeng
Zihan Zhou
Shuaijun Liu
M. Hosseinzadeh
Ningxin Su
Reza Rawassizadeh
268
0
0
01 Sep 2025
LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving
Huanqi Hu
Bowen Xiao
Shixuan Sun
Jianian Yin
Zhexi Zhang
...
Chengquan Jiang
Weiqi Xu
Xiaoying Jia
Xin Liu
Minyi Guo
MQ
VLM
118
5
0
01 Sep 2025
DTRNet: Dynamic Token Routing Network to Reduce Quadratic Costs in Transformers
Aman Sharma
Saeed Najafi
Parsa Farinneya
Benyamin Jamialahmadi
Marzieh S. Tahaei
Yuhe Fan
Mehdi Rezagholizadeh
Boxing Chen
A. Jafari
88
1
0
31 Aug 2025
Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling
Junfeng Ran
Guangxiang Zhao
Yuhan Wu
Dawei Zhu
Longyun Wu
Yikai Zhao
Tong Yang
Lin Sun
Xiangzheng Zhang
Sujian Li
MoE
MoMe
93
0
0
31 Aug 2025
Unlocking the Effectiveness of LoRA-FP for Seamless Transfer Implantation of Fingerprints in Downstream Models
Zhenhua Xu
Zhaokun Yan
Binhan Xu
Xin Tong
Haitao Xu
Y. Chen
Meng Han
129
8
0
31 Aug 2025
PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference
Hao Zhang
Mengsi Lyu
Zhuo Chen
Xingrun Xing
Yulong Ao
Yonghua Lin
479
1
0
29 Aug 2025
Diffusion Language Models Know the Answer Before Decoding
Pengxiang Li
Yefan Zhou
Dilxat Muhtar
L. Yin
Shilin Yan
Li Shen
Yi Liang
Soroush Vosoughi
Shiwei Liu
179
24
0
27 Aug 2025
Predicting the Order of Upcoming Tokens Improves Language Modeling
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
AI4TS
48
0
0
26 Aug 2025
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Zihao Huang
Yu Bao
Qiyang Min
S. Chen
Ran Guo
...
Defa Zhu
Yutao Zeng
Banggu Wu
Xun Zhou
Siyuan Qiao
MoE
181
4
0
26 Aug 2025
Enabling MoE on the Edge via Importance-Driven Expert Scheduling
Guoying Zhu
Meng Li
Haipeng Dai
Xuechen Liu
Weijun Wang
Keran Li
Jun xiao
Ligeng Chen
Wei Wang
MoE
290
1
0
26 Aug 2025
Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models
Chenxi Zhou
Pengfei Cao
Jiang Li
Jun Zhao
Kang Liu
Jun Zhao
Kang Liu
MQ
188
0
0
26 Aug 2025
Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap
Jun Wang
Ninglun Gu
Kailai Zhang
Zijiao Zhang
Yelun Bao
...
Liwei Liu
Yihuan Liu
Pengyong Li
Gary G. Yen
Junchi Yan
ALM
ELM
228
0
0
26 Aug 2025
Dynamic Collaboration of Multi-Language Models based on Minimal Complete Semantic Units
Chao Hao
Zezheng Wang
Yanhua Huang
Ruiwen Xu
Wenzhe Niu
Xin Liu
Zitong Yu
114
1
0
26 Aug 2025
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction
Weilin Cai
Le Qin
Shwai He
Junwei Cui
Ang Li
Jiayi Huang
MoE
124
0
0
25 Aug 2025
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Yifan Wang
Binbin Liu
Fengze Liu
Yuanfan Guo
Jiyao Deng
Xuecheng Wu
Weidong Zhou
Xiaohuan Zhou
Taifeng Wang
141
0
0
25 Aug 2025
Integral Transformer: Denoising Attention, Not Too Much Not Too Little
I. Kobyzev
Abbas Ghaddar
Dingtao Hu
Boxing Chen
131
0
0
25 Aug 2025
Proximal Supervised Fine-Tuning
Wenhong Zhu
Ruobing Xie
R. Wang
Xingwu Sun
Di Wang
Pengfei Liu
OffRL
85
3
0
25 Aug 2025
Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models
Wataru Ikeda
Kazuki Yano
Ryosuke Takahashi
Jaesung Lee
Keigo Shibata
Jun Suzuki
90
1
0
25 Aug 2025
Weights-Rotated Preference Optimization for Large Language Models
Chenxu Yang
Ruipeng Jia
Mingyu Zheng
Naibin Gu
Zheng Lin
Siyuan Chen
Weichong Yin
Hua Wu
Weiping Wang
142
0
0
25 Aug 2025
Riemannian Optimization for LoRA on the Stiefel Manifold
JuneYoung Park
MinJae Kang
Seongbae Lee
Haegang Lee
S. Kim
Jaeho Lee
155
1
0
25 Aug 2025
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Krishna Teja Chitty-Venkata
Sylvia Howland
Golara Azar
Daria Soboleva
Natalia Vassilieva
Siddhisanket Raskar
M. Emani
V. Vishwanath
MoE
113
1
0
24 Aug 2025
Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks
Jack Youstra
Mohammed Mahfoud
Yang Yan
Henry Sleight
Ethan Perez
Mrinank Sharma
AAML
174
5
0
23 Aug 2025
Learning from Diverse Reasoning Paths with Routing and Collaboration
Zhenyu Lei
Zhen Tan
Song Wang
Yaochen Zhu
Zihan Chen
Yushun Dong
Jundong Li
LRM
200
6
0
23 Aug 2025
Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs
Sewon Kim
Jiwon Kim
Seungwoo Shin
Hyejin Chung
Daeun Moon
Yejin Kwon
Hyunsoo Yoon
123
0
0
23 Aug 2025
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er
.Ilker Kesen
Gözde Gül Şahin
Aykut Erdem
ELM
VLM
158
1
0
22 Aug 2025
Previous
1
2
3
...
8
9
10
...
37
38
39
Next