Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.05457
Cited By
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
14 March 2018
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"
50 / 1,906 papers shown
Kad: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral
Ayoub Hammal
Pierre Zweigenbaum
Caio Corro
224
0
0
30 Oct 2025
From Amateur to Master: Infusing Knowledge into LLMs via Automated Curriculum Learning
Nishit Neema
Srinjoy Mukherjee
Sapan Shah
Gokul Ramakrishnan
Ganesh Venkatesh
CLL
256
0
0
30 Oct 2025
Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models
J. Curtò
I. D. Zarzà
Pablo García
Jordi Cabot
ELM
LRM
203
0
0
30 Oct 2025
OmniEduBench: A Comprehensive Chinese Benchmark for Evaluating Large Language Models in Education
Min Zhang
Hao Chen
Hao Chen
Wenqi Zhang
Didi Zhu
Xin Lin
Bo Jiang
Aimin Zhou
Fei Wu
Kun Kuang
ELM
161
0
0
30 Oct 2025
Angular Steering: Behavior Control via Rotation in Activation Space
Hieu M. Vu
T. Nguyen
LLMSV
324
3
0
30 Oct 2025
1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models
Zeliang Zong
Kai Zhang
Zheyang Li
Wenming Tan
Ye Ren
Yiyan Zhai
Jilin Hu
128
0
0
30 Oct 2025
MossNet: Mixture of State-Space Experts is a Multi-Head Attention
Shikhar Tuli
James Smith
Haris Jeelani
Chi-Heng Lin
Abhishek Patel
Vasili Ramanishka
Yen-Chang Hsu
Hongxia Jin
MoE
267
0
0
30 Oct 2025
EdgeRunner 20B: Military Task Parity with GPT-5 while Running on the Edge
Jack FitzGerald
Aristotelis Lazaridis
Dylan Bates
Aman Sharma
Jonnathan Castillo
...
Dave Anderson
Jonathan Beck
Jamie Cuticello
Colton Malkerson
Tyler Saltsman
ELM
314
0
0
30 Oct 2025
Scales++: Compute Efficient Evaluation Subset Selection with Cognitive Scales Embeddings
Andrew M. Bean
Nabeel Seedat
Shengzhuang Chen
Jonathan Richard Schwarz
92
1
0
30 Oct 2025
Kimi Linear: An Expressive, Efficient Attention Architecture
Kimi Team
Yu Zhang
Zongyu Lin
Xingcheng Yao
J. Hu
...
Guokun Lai
Yuxin Wu
Xinyu Zhou
Zhilin Yang
Yulun Du
132
8
0
30 Oct 2025
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
Biao Zhang
Yong Cheng
Siamak Shakeri
Xinyi Wang
Min Ma
Orhan Firat
141
1
0
30 Oct 2025
Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs
Pranav Bhandari
Nicolas Fay
Sanjeevan Selvaganapathy
Amitava Datta
Usman Naseem
Mehwish Nasim
LLMSV
208
1
0
29 Oct 2025
NeuronMM: High-Performance Matrix Multiplication for LLM Inference on AWS Trainium
Dinghong Song
Jierui Xu
Weichu Yang
Pengfei Su
Dong Li
150
0
0
29 Oct 2025
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
Mengzhao Chen
Meng Wu
Hui Jin
Zhihang Yuan
Jing Liu
...
Jin Ma
Zeyue Xue
Zhiheng Liu
Xingyan Bin
Ping Luo
MQ
238
1
0
29 Oct 2025
A Survey on Unlearning in Large Language Models
Ruichen Qiu
Jiajun Tan
Jiayue Pu
Honglin Wang
Xiao-Shan Gao
Fei Sun
MU
AILaw
PILM
633
0
0
29 Oct 2025
Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale
J. Michaelov
Roger P. Levy
Benjamin Bergen
AI4TS
128
0
0
28 Oct 2025
Parallel Loop Transformer for Efficient Test-Time Computation Scaling
Bohong Wu
Mengzhao Chen
Xiang Luo
Shen Yan
Qifan Yu
...
Hongrui Zhan
Zheng Zhong
Xun Zhou
Siyuan Qiao
Xingyan Bin
116
2
0
28 Oct 2025
Information-Theoretic Discrete Diffusion
Moongyu Jeon
Sangwoo Shin
Dongjae Jeon
Albert No
DiffM
FedML
167
0
0
28 Oct 2025
Optimizing Retrieval for RAG via Reinforcement Learning
Jiawei Zhou
Lei Chen
135
1
0
28 Oct 2025
LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis
Qingyue Zhang
Chang Chu
Tianren Peng
Qi Li
Xiangyang Luo
Zhihao Jiang
Shao-Lun Huang
92
0
0
28 Oct 2025
Charting the European LLM Benchmarking Landscape: A New Taxonomy and a Set of Best Practices
Špela Vintar
Taja Kuzman Pungeršek
Mojca Brglez
Nikola Ljubešić
183
0
0
28 Oct 2025
BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection
Yaniv Nikankin
Dana Arad
Itay Itzhak
Anja Reusch
Adi Simhi
Gal Kesten-Pomeranz
Yonatan Belinkov
68
1
0
28 Oct 2025
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic
Kanghyun Choi
Hyeyoon Lee
S. Park
Dain Kwon
Jinho Lee
MQ
162
0
0
28 Oct 2025
Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT
Da Chang
Peng Xue
Yu Li
Yongxiang Liu
P. Xu
Shixun Zhang
204
1
0
28 Oct 2025
Beyond Line-Level Filtering for the Pretraining Corpora of LLMs
Chanwoo Park
Suyoung Park
Yelim Ahn
Jongmin Kim
Jongyeon Park
Jaejin Lee
100
0
0
28 Oct 2025
ChessQA: Evaluating Large Language Models for Chess Understanding
Qianfeng Wen
Zhenwei Tang
Ashton Anderson
ELM
LRM
197
1
0
28 Oct 2025
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
Yuxi Liu
Renjia Deng
Yutong He
Xue Wang
Tao Yao
Kun Yuan
134
0
0
28 Oct 2025
From Cross-Task Examples to In-Task Prompts: A Graph-Based Pseudo-Labeling Framework for In-context Learning
Zihan Chen
Song Wang
Xingbo Fu
Chengshuai Shi
Zhenyu Lei
Cong Shen
Jundong Li
128
1
0
28 Oct 2025
Offline Preference Optimization via Maximum Marginal Likelihood Estimation
Saeed Najafi
Alona Fyshe
OffRL
140
0
0
27 Oct 2025
A Survey on LLM Mid-Training
Chengying Tu
Xuemiao Zhang
Rongxiang Weng
Rumei Li
Chen Zhang
Yang Bai
Hongfei Yan
Jingang Wang
Xunliang Cai
OffRL
LRM
237
1
0
27 Oct 2025
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
Yixing Chen
Yiding Wang
Siqi Zhu
Haofei Yu
Tao Feng
Muhan Zhang
M. Patwary
Jiaxuan You
LLMAG
LRM
295
5
0
27 Oct 2025
Probing Knowledge Holes in Unlearned LLMs
Myeongseob Ko
H. Just
Charles Fleming
Ming Jin
R. Jia
MU
302
0
0
27 Oct 2025
Frustratingly Easy Task-aware Pruning for Large Language Models
Yuanhe Tian
Junjie Liu
Xican Yang
Haishan Ye
Yan Song
133
1
0
26 Oct 2025
TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination
Omar Naim
Krish Sharma
Nicholas M. Asher
88
0
0
26 Oct 2025
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
Kaitong Cai
Jusheng Zhang
Yijia Fan
Jing Yang
Keze Wang
LRM
120
11
0
26 Oct 2025
SeeDNorm: Self-Rescaled Dynamic Normalization
Wenrui Cai
Defa Zhu
Qingjie Liu
Qiyang Min
144
0
0
26 Oct 2025
The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models
Yao Lu
Yuqi Li
Wenbin Xie
Shanqing Yu
Qi Xuan
Zhaowei Zhu
Shiping Wen
84
1
0
25 Oct 2025
Transformer Based Linear Attention with Optimized GPU Kernel Implementation
Armin Gerami
R. Duraiswami
132
0
0
24 Oct 2025
Decoding-Free Sampling Strategies for LLM Marginalization
David Pohl
Marco Cognetta
Junyoung Lee
Naoaki Okazaki
52
0
0
23 Oct 2025
Context-level Language Modeling by Learning Predictive Context Embeddings
Beiya Dai
Y. Liu
Daozheng Xue
Qipeng Guo
Kai Chen
Xinbing Wang
Bowen Zhou
Zhouhan Lin
LRM
139
0
0
23 Oct 2025
What Does It Take to Build a Performant Selective Classifier?
Stephan Rabanser
Nicolas Papernot
210
0
0
23 Oct 2025
Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs
Yanlin Song
Ben Liu
Víctor Gutiérrez-Basulto
Zhiwei Hu
Qianqian Xie
Min Peng
Sophia Ananiadou
Jeff Z. Pan
RALM
ReLM
LRM
279
0
0
23 Oct 2025
Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning
M. H. I. Abdalla
Zhipin Wang
Christian M. M. Frey
Steffen Eger
Josif Grabocka
135
0
0
22 Oct 2025
DiSRouter: Distributed Self-Routing for LLM Selections
Hang Zheng
Hongshen Xu
Yongkai Lin
Shuai Fan
Lu Chen
Kai Yu
131
1
0
22 Oct 2025
GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters
Anand Choudhary
Yasser Sulaıman
Lukas Mauch
G. B. Hacene
Fabien Cardinaux
Antoine Bosselut
132
0
0
22 Oct 2025
ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices
Xin Nie
Liang Dong
H. Zhang
JiaWang Xiao
G. Sun
MQ
448
0
0
22 Oct 2025
Data-Centric Lessons To Improve Speech-Language Pretraining
Vishaal Udandarao
Zhiyun Lu
Xuankai Chang
Yongqiang Wang
Violet Z. Yao
Albin Madapally Jose
Fartash Faghri
Josh Gardner
Chung-Cheng Chiu
136
0
0
22 Oct 2025
CPSVD: Enhancing Large Language Model Compression via Column-Preserving Singular Value Decomposition
Lin Xv
Jingsheng Gao
Xian Gao
Ting Li
Yuzhuo Fu
72
0
0
22 Oct 2025
Energy-Efficient and Dequantization-Free Q-LLMs: A Spiking Neural Network Approach to Salient Value Mitigation
Chenyu Wang
Zhanglu Yan
Zhi Zhou
Xu Chen
Weng-Fai Wong
MQ
152
0
0
22 Oct 2025
Restoring Pruned Large Language Models via Lost Component Compensation
Zijian Feng
Hanzhang Zhou
Zixiao Zhu
Tianjiao Li
Jia Jim Deryl Chua
Lee Onn Mak
Gee Wah Ng
Kezhi Mao
137
0
0
22 Oct 2025
Previous
1
2
3
4
5
6
...
37
38
39
Next