Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.05457
Cited By
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
14 March 2018
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"
50 / 1,910 papers shown
CEQuest: Benchmarking Large Language Models for Construction Estimation
Y. Wu
L. xilinx Wang
Rui Liu
99
1
0
22 Aug 2025
Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi
Yi Ding
MQ
133
3
0
22 Aug 2025
QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting
Nicole Cho
William Watson
Alec Koppel
Sumitra Ganesh
Manuela Veloso
AAML
156
0
0
22 Aug 2025
WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling
Jiacheng Li
Jianchao Tan
Zhidong Yang
Pingwei Sun
Feiye Huo
...
Xiangyu Zhang
Maoxin He
Guangming Tan
Weile Jia
Tong Zhao
113
3
0
21 Aug 2025
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
Qitao Tan
Xiaoying Song
Jin Lu
Guoming Li
Jun Liu
...
Jundong Li
Xiaoming Zhai
Shaoyi Huang
Wei Niu
Geng Yuan
MQ
229
0
0
21 Aug 2025
CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
Muchammad Daniyal Kautsar
Afra Majida Hariono
Widyawan
Syukron Abu Ishaq Alfarozi
Kuntpong Woraratpanya
162
0
0
21 Aug 2025
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version
Nghiem Thanh Pham
Tung Kieu
Duc-Manh Nguyen
Son Ha Xuan
Nghia Duong-Trung
Danh Le-Phuoc
174
2
0
21 Aug 2025
Dream 7B: Diffusion Large Language Models
Jiacheng Ye
Zhihui Xie
Lin Zheng
Lei Li
Zirui Wu
Xin Jiang
Zhenguo Li
Lingpeng Kong
DiffM
VLM
1.0K
110
0
21 Aug 2025
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
Xiaojuan Tang
Fanxu Meng
Pingzhi Tang
Yuxuan Wang
Di Yin
Xing Sun
M. Zhang
197
0
0
21 Aug 2025
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
Yuxian Gu
Qinghao Hu
Shang Yang
Haocheng Xi
Junyu Chen
Song Han
Han Cai
255
14
0
21 Aug 2025
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Nvidia
Aarti Basant
Abhijit Khairnar
Abhijit Paithankar
Abhinav Khattar
...
Keith Wyss
Keshav Santhanam
Kezhi Kong
Krzysztof Pawelec
Kumar Anik
LRM
298
0
0
20 Aug 2025
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
Haokun Lin
Haobo Xu
Yichen Wu
Ziyu Guo
Renrui Zhang
Zhichao Lu
Ying Wei
Gang Qu
Zhenan Sun
DiffM
MQ
178
9
0
20 Aug 2025
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Xiao Liang
Zhongzhi Li
Yeyun Gong
Yelong Shen
Y. Wu
Zhijiang Guo
Weizhu Chen
LRM
241
25
0
19 Aug 2025
Revisiting RAG Ensemble: A Theoretical and Mechanistic Analysis of Multi-RAG System Collaboration
Yifei Chen
Guanting Dong
Yutao Zhu
Zhicheng Dou
169
2
0
19 Aug 2025
A Fully Spectral Neuro-Symbolic Reasoning Architecture with Graph Signal Processing as the Computational Backbone
Andrew Kiruluta
88
0
0
19 Aug 2025
GLASS: Test-Time Acceleration for LLMs via Global-Local Neural Importance Aggregation
Amirmohsen Sattarifard
Sepehr Lavasani
Ehsan Imani
Kunlin Zhang
Hanlin Xu
Fengyu Sun
Negar Hassanpour
Chao Gao
VLM
104
1
0
19 Aug 2025
Sycophancy under Pressure: Evaluating and Mitigating Sycophantic Bias via Adversarial Dialogues in Scientific QA
Kaiwei Zhang
Qi Jia
Z. Chen
Wei Sun
Xiangyang Zhu
Chunyi Li
D. Zhu
Guangtao Zhai
AAML
141
5
0
19 Aug 2025
Maximum Score Routing For Mixture-of-Experts
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Bowen Dong
Yilong Fan
Yutao Sun
Zhenyu Li
Tengyu Pan
Xun Zhou
Jianyong Wang
MoE
120
2
0
18 Aug 2025
RAJ-PGA: Reasoning-Activated Jailbreak and Principle-Guided Alignment Framework for Large Reasoning Models
Jianhao Chen
Mayi Xu
Xiaohu Li
Yongqi Li
Xiangyu Zhang
Jianjie Huang
T. Qian
Xiaochun Cao
Tieyun Qian
LRM
173
0
0
18 Aug 2025
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
David Heineman
Valentin Hofmann
Ian H. Magnusson
Yuling Gu
Noah A. Smith
Hannaneh Hajishirzi
Kyle Lo
Jesse Dodge
ALM
122
4
0
18 Aug 2025
ReaLM: Reflection-Enhanced Autonomous Reasoning with Small Language Models
Yuanfeng Xu
Zehui Dai
Jian Liang
Jiapeng Guan
Guangrun Wang
Liang Lin
Xiaohui Lv
LLMAG
LRM
140
0
0
17 Aug 2025
Mitigating Jailbreaks with Intent-Aware LLMs
Wei Jie Yeo
Frank Xing
Erik Cambria
AAML
141
0
0
16 Aug 2025
AgentCDM: Enhancing Multi-Agent Collaborative Decision-Making via ACH-Inspired Structured Reasoning
Xuyang Zhao
Shiwan Zhao
Hualong Yu
Liting Zhang
Qicheng Li
LRM
AI4CE
87
2
0
16 Aug 2025
Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends
Zhenhua Xu
Xubin Yue
Zhebo Wang
Qichen Liu
Xixiang Zhao
...
Wenjun Zeng
Wengpeng Xing
Dezhang Kong
C. D. Lin
Meng Han
AILaw
WaLM
254
11
0
15 Aug 2025
Every 28 Days the AI Dreams of Soft Skin and Burning Stars: Scaffolding AI Agents with Hormones and Emotions
Leigh Levinson
Christopher J. Agostino
58
0
0
15 Aug 2025
EffiEval: Efficient and Generalizable Model Evaluation via Capability Coverage Maximization
Yaoning Wang
Jiahao Ying
Yixin Cao
Yubo Ma
Yugang Jiang
ELM
36
2
0
13 Aug 2025
Amazon Nova AI Challenge -- Trusted AI: Advancing secure, AI-assisted software development
Sattvik Sahai
Prasoon Goyal
Michael Johnston
Anna Gottardi
Yao Lu
...
Lavina Vaz
Leslie Ball
Maureen Murray
Rahul Gupta
Shankar Ananthakrishna
113
1
0
13 Aug 2025
Slow Tuning and Low-Entropy Masking for Safe Chain-of-Thought Distillation
Ziyang Ma
Qingyue Yuan
Linhai Zhang
Deyu Zhou
LRM
123
2
0
13 Aug 2025
AgriGPT: a Large Language Model Ecosystem for Agriculture
Bo Yang
Yu Zhang
Lanfei Feng
Yunkui Chen
J. Zhang
...
Yuxuan Chen
Guijun Yang
Yong He
Runhe Huang
Shijian Li
LLMAG
KELM
222
4
0
12 Aug 2025
SinLlama -- A Large Language Model for Sinhala
Moratuwa Engineering Research Conference (MERCon), 2025
H.W.K.Aravinda
Rashad Sirajudeen
Samith Karunathilake
Nisansa de Silva
Surangika Ranathunga
Rishemjit Kaur
LRM
284
1
0
12 Aug 2025
TiMoE: Time-Aware Mixture of Language Experts
Robin Faro
Dongyang Fan
Tamar Alphaidze
Martin Jaggi
MoE
143
1
0
12 Aug 2025
Progressive Depth Up-scaling via Optimal Transport
Mingzi Cao
Xi Wang
Nikolaos Aletras
80
1
0
11 Aug 2025
OverFill: Two-Stage Models for Efficient Language Model Decoding
Woojeong Kim
Junxiong Wang
Jing Nathan Yan
Mohamed S. Abdelfattah
Alexander M Rush
108
0
0
11 Aug 2025
ThinkTuning: Instilling Cognitive Reflections without Distillation
Aswin Rrv
Jacob Dineen
Divij Handa
Md Nayem Uddin
Mihir Parmar
Chitta Baral
Ben Zhou
ReLM
LRM
204
4
0
11 Aug 2025
BharatBBQ: A Multilingual Bias Benchmark for Question Answering in the Indian Context
Aditya Tomar
Nihar Ranjan Sahoo
P. Bhattacharyya
123
1
0
09 Aug 2025
AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance
Lixuan He
Jie Feng
Yong Li
OffRL
LRM
235
3
0
09 Aug 2025
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Zhijun Tu
Hanting Chen
Siqi Liu
Chuanjian Liu
Jian Li
Jie Hu
Yunhe Wang
MQ
122
0
0
09 Aug 2025
Train It and Forget It: Merge Lists are Unnecessary for BPE Inference in Language Models
Tomohiro Sawada
Kartik Goyal
MoMe
102
0
0
08 Aug 2025
Pruning Large Language Models by Identifying and Preserving Functional Networks
Yiheng Liu
Junhao Ning
Sichen Xia
Xiaohui Gao
Ning Qiang
Bao Ge
Junwei Han
Xiaoyan Cai
155
1
0
07 Aug 2025
Align, Don't Divide: Revisiting the LoRA Architecture in Multi-Task Learning
Jinda Liu
Bo Cheng
Yi-Ju Chang
Yuan Wu
MoMe
83
0
0
07 Aug 2025
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs
Xiaodong Chen
Mingming Ha
Zhenzhong Lan
Jing Zhang
Jianguo Li
MoE
121
1
0
07 Aug 2025
Cross-LoRA: A Data-Free LoRA Transfer Framework across Heterogeneous LLMs
Feifan Xia
Mingyang Liao
Yuyang Fang
Defang Li
Yantong Xie
Guanqiang Qi
Yang Li
Deguo Xia
Jizhou Huang
MoMe
120
3
0
07 Aug 2025
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards
Xu Guo
Tianyi Liang
Tong Jian
Xiaogui Yang
Ling-I Wu
Chenhui Li
Z. Lu
Qipeng Guo
Kai Chen
277
2
0
06 Aug 2025
Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning
Magauiya Zhussip
Dmitriy Shopkhoev
Ammar Ali
Stamatios Lefkimmiatis
109
2
0
06 Aug 2025
FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design
Hao Zhang
Aining Jia
Weifeng Bu
Y. Cai
Kai Sheng
Hao Chen
Xin He
MQ
128
0
0
06 Aug 2025
Tensorized Clustered LoRA Merging for Multi-Task Interference
Zhan Su
Fengran Mo
G. Liang
Jinghan Zhang
Bingbing Wen
Prayag Tiwari
Jian-Yun Nie
MoMe
182
0
0
06 Aug 2025
RCP-Merging: Merging Long Chain-of-Thought Models with Domain-Specific Models by Considering Reasoning Capability as Prior
Junyao Yang
Jianwei Wang
Huiping Zhuang
Cen Chen
Ziqian Zeng
MoMe
LRM
173
1
0
05 Aug 2025
Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models
He Xiao
Qingyao Yang
Dirui Xie
Wendong Xu
Wenyong Zhou
Haobo Liu
Zhengwu Liu
Ngai Wong
Zhengwu Liu
Ngai Wong
MQ
118
0
0
05 Aug 2025
RegMean++: Enhancing Effectiveness and Generalization of Regression Mean for Model Merging
The-Hai Nguyen
Dang Huu-Tien
Takeshi Suzuki
Le-Minh Nguyen
MoMe
283
2
0
05 Aug 2025
Trainable Dynamic Mask Sparse Attention
Jingze Shi
Yifan Wu
Yiran Peng
Yiran Peng
Liangdong Wang
Guang Liu
Yuyu Luo
354
3
0
04 Aug 2025
Previous
1
2
3
...
9
10
11
...
37
38
39
Next