Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1803.05457
Cited By

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
Challenge

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

14 March 2018

Ashish Sabharwal

Carissa Schoenick

Oyvind Tafjord

ArXiv (abs)PDF HTML

Papers citing "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"

50 / 1,910 papers shown

RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models

RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models

235

0

0

24 Sep 2025

Enhancing Linear Attention with Residual Learning

Enhancing Linear Attention with Residual Learning

118

0

0

24 Sep 2025

Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment

Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment

208

0

0

24 Sep 2025

Soft Tokens, Hard Truths

Soft Tokens, Hard Truths

Ariel Kwiatkowski

165

1

0

23 Sep 2025

Prior-based Noisy Text Data Filtering: Fast and Strong Alternative For Perplexity

Prior-based Noisy Text Data Filtering: Fast and Strong Alternative For Perplexity

150

0

0

23 Sep 2025

HyperAdapt: Simple High-Rank Adaptation

HyperAdapt: Simple High-Rank Adaptation

Joseph Campbell

167

0

0

23 Sep 2025

CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs

CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs

157

0

0

23 Sep 2025

On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

Edith C. H. Ngai

96

0

0

22 Sep 2025

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

291

0

0

22 Sep 2025

TASO: Task-Aligned Sparse Optimization for Parameter-Efficient Model Adaptation

TASO: Task-Aligned Sparse Optimization for Parameter-Efficient Model Adaptation

88

1

0

22 Sep 2025

Diagnosing Model Editing via Knowledge Spectrum

Diagnosing Model Editing via Knowledge Spectrum

Tsung-Hsuan Pan

117

0

0

22 Sep 2025

Training-free Truthfulness Detection via Value Vectors in LLMs

Training-free Truthfulness Detection via Value Vectors in LLMs

93

0

0

22 Sep 2025

seqBench: A Tunable Benchmark to Quantify Sequential Reasoning Limits of LLMs

seqBench: A Tunable Benchmark to Quantify Sequential Reasoning Limits of LLMs

Mohammad Ramezanali

91

0

0

21 Sep 2025

Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation

Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation

205

2

0

21 Sep 2025

MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE

MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE

Soheil Zibakhsh

Mohammad Samragh

267

0

0

21 Sep 2025

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

143

2

0

21 Sep 2025

EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs

EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs

70

0

0

20 Sep 2025

Rethinking the Role of Text Complexity in Language Model Pretraining

Rethinking the Role of Text Complexity in Language Model Pretraining

Dan John Velasco

215

2

0

20 Sep 2025

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection

Tanmoy Chakraborty

105

0

0

19 Sep 2025

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

239

2

0

19 Sep 2025

DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning

DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning

118

1

0

19 Sep 2025

Concept Unlearning in Large Language Models via Self-Constructed Knowledge Triplets

Concept Unlearning in Large Language Models via Self-Constructed Knowledge Triplets

Tomoya Yamashita

Toshiki Shibahara

98

1

0

19 Sep 2025

Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research

Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research

Richard Diehl Martinez

David Demitri Africa

144

1

0

19 Sep 2025

Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining

Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining

155

1

0

19 Sep 2025

Pre-training under infinite compute

Pre-training under infinite compute

Abigail Z. Jacobs

Tatsunori Hashimoto

229

3

0

18 Sep 2025

CARGO: A Framework for Confidence-Aware Routing of Large Language Models

CARGO: A Framework for Confidence-Aware Routing of Large Language Models

Michael Olchawa

Khalil Zoghlami

153

1

0

18 Sep 2025

Fair-GPTQ: Bias-Aware Quantization for Large Language Models

Fair-GPTQ: Bias-Aware Quantization for Large Language Models

Irina Proskurina

Guillaume Metzler

130

0

0

18 Sep 2025

FURINA: Free from Unmergeable Router via LINear Aggregation of mixed experts

FURINA: Free from Unmergeable Router via LINear Aggregation of mixed experts

128

0

0

18 Sep 2025

NIRVANA: Structured pruning reimagined for large language models compression

NIRVANA: Structured pruning reimagined for large language models compression

1.6K

1

0

17 Sep 2025

SBVR: Summation of BitVector Representation for Efficient LLM Quantization

SBVR: Summation of BitVector Representation for Efficient LLM Quantization

152

0

0

17 Sep 2025

Synthetic bootstrapped pretraining

Synthetic bootstrapped pretraining

Tatsunori Hashimoto

Emmanuel Candès

295

0

0

17 Sep 2025

DSFT: Inspiring Diffusion Large Language Models to Comprehend Mathematical and Logical Patterns

DSFT: Inspiring Diffusion Large Language Models to Comprehend Mathematical and Logical Patterns

81

0

0

17 Sep 2025

SteeringSafety: A Systematic Safety Evaluation Framework of Representation Steering in LLMs

SteeringSafety: A Systematic Safety Evaluation Framework of Representation Steering in LLMs

Nicholas Crispino

Nathan W. Henry

333

1

0

16 Sep 2025

Preservation of Language Understanding Capabilities in Speech-aware Large Language Models

Preservation of Language Understanding Capabilities in Speech-aware Large Language Models

Paweł Skórzewski

Mateusz Czyżnikiewicz

Łukasz Bondaruk

Marcin Lewandowski

190

0

0

15 Sep 2025

AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models

AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models

116

3

0

15 Sep 2025

NeuroStrike: Neuron-Level Attacks on Aligned LLMs

NeuroStrike: Neuron-Level Attacks on Aligned LLMs

Mohamadreza Rostami

Maximilian Thang

240

1

0

15 Sep 2025

MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables

MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables

Matteo Marcuzzo

Jose Camacho-Collados

Mohammad Taher Pilehvar

216

3

0

15 Sep 2025

CBP-Tuning: Efficient Local Customization for Black-box Large Language Models

CBP-Tuning: Efficient Local Customization for Black-box Large Language Models

112

0

0

15 Sep 2025

Fluid Language Model Benchmarking

Fluid Language Model Benchmarking

Valentin Hofmann

Ian H. Magnusson

Hannaneh Hajishirzi

136

7

0

14 Sep 2025

From Parameters to Performance: A Data-Driven Study on LLM Structure and Development

From Parameters to Performance: A Data-Driven Study on LLM Structure and Development

135

0

0

14 Sep 2025

Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs

Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs

215

0

0

14 Sep 2025

AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs

AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs

Balaraman Ravindran

92

0

0

14 Sep 2025

Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs

Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs

164

5

0

12 Sep 2025

Test-Time Warmup for Multimodal Large Language Models

Test-Time Warmup for Multimodal Large Language Models

Nikita Rajaneesh

Thomas P. Zollo

209

0

0

12 Sep 2025

Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models

Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models

Robert Underwood

Sandeep Madireddy

Franck Cappello

Arvind Ramanathan

112

1

0

12 Sep 2025

GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models

GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models

152

2

0

11 Sep 2025

TORSO: Template-Oriented Reasoning Towards General Tasks

TORSO: Template-Oriented Reasoning Towards General Tasks

189

0

0

11 Sep 2025

ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms

ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms

Oussama Elachqar

192

1

0

11 Sep 2025

Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison

Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison

Marianna Nezhurina

Taishi Nakamura

Timur Carstensen

Niccolò Ajroldi

Ville Komulainen

178

2

0

10 Sep 2025

Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models

Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models

Nikita Raichada

Monali Deshmukh

80

0

0

10 Sep 2025

1 2 3...7 8 9...37 38 39