Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1803.05457
Cited By

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
Challenge

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

14 March 2018

Ashish Sabharwal

Carissa Schoenick

Oyvind Tafjord

ArXiv (abs)PDF HTML

Papers citing "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"

50 / 1,906 papers shown

Kad: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral

Kad: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral

Pierre Zweigenbaum

224

0

0

30 Oct 2025

From Amateur to Master: Infusing Knowledge into LLMs via Automated Curriculum Learning

From Amateur to Master: Infusing Knowledge into LLMs via Automated Curriculum Learning

Srinjoy Mukherjee

Gokul Ramakrishnan

Ganesh Venkatesh

256

0

0

30 Oct 2025

Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models

Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models

203

0

0

30 Oct 2025

OmniEduBench: A Comprehensive Chinese Benchmark for Evaluating Large Language Models in Education

OmniEduBench: A Comprehensive Chinese Benchmark for Evaluating Large Language Models in Education

161

0

0

30 Oct 2025

Angular Steering: Behavior Control via Rotation in Activation Space

Angular Steering: Behavior Control via Rotation in Activation Space

324

3

0

30 Oct 2025

1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models

1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models

128

0

0

30 Oct 2025

MossNet: Mixture of State-Space Experts is a Multi-Head Attention

MossNet: Mixture of State-Space Experts is a Multi-Head Attention

Vasili Ramanishka

267

0

0

30 Oct 2025

EdgeRunner 20B: Military Task Parity with GPT-5 while Running on the Edge

EdgeRunner 20B: Military Task Parity with GPT-5 while Running on the Edge

Jack FitzGerald

Aristotelis Lazaridis

Jonnathan Castillo

...

Jamie Cuticello

Colton Malkerson

314

0

0

30 Oct 2025

Scales++: Compute Efficient Evaluation Subset Selection with Cognitive Scales Embeddings

Scales++: Compute Efficient Evaluation Subset Selection with Cognitive Scales Embeddings

Shengzhuang Chen

Jonathan Richard Schwarz

92

1

0

30 Oct 2025

Kimi Linear: An Expressive, Efficient Attention Architecture

Kimi Linear: An Expressive, Efficient Attention Architecture

...

132

8

0

30 Oct 2025

Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model

Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model

141

1

0

30 Oct 2025

Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs

Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs

Pranav Bhandari

Sanjeevan Selvaganapathy

208

1

0

29 Oct 2025

NeuronMM: High-Performance Matrix Multiplication for LLM Inference on AWS Trainium

NeuronMM: High-Performance Matrix Multiplication for LLM Inference on AWS Trainium

150

0

0

29 Oct 2025

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

...

238

1

0

29 Oct 2025

A Survey on Unlearning in Large Language Models

A Survey on Unlearning in Large Language Models

633

0

0

29 Oct 2025

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

Benjamin Bergen

128

0

0

28 Oct 2025

Parallel Loop Transformer for Efficient Test-Time Computation Scaling

Parallel Loop Transformer for Efficient Test-Time Computation Scaling

...

116

2

0

28 Oct 2025

Information-Theoretic Discrete Diffusion

Information-Theoretic Discrete Diffusion

167

0

0

28 Oct 2025

Optimizing Retrieval for RAG via Reinforcement Learning

Optimizing Retrieval for RAG via Reinforcement Learning

135

1

0

28 Oct 2025

LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis

LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis

92

0

0

28 Oct 2025

Charting the European LLM Benchmarking Landscape: A New Taxonomy and a Set of Best Practices

Charting the European LLM Benchmarking Landscape: A New Taxonomy and a Set of Best Practices

Taja Kuzman Pungeršek

Nikola Ljubešić

183

0

0

28 Oct 2025

BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection

BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection

Gal Kesten-Pomeranz

Yonatan Belinkov

68

1

0

28 Oct 2025

FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic

FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic

162

0

0

28 Oct 2025

Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT

Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT

204

1

0

28 Oct 2025

Beyond Line-Level Filtering for the Pretraining Corpora of LLMs

Beyond Line-Level Filtering for the Pretraining Corpora of LLMs

100

0

0

28 Oct 2025

ChessQA: Evaluating Large Language Models for Chess Understanding

ChessQA: Evaluating Large Language Models for Chess Understanding

Ashton Anderson

197

1

0

28 Oct 2025

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

134

0

0

28 Oct 2025

From Cross-Task Examples to In-Task Prompts: A Graph-Based Pseudo-Labeling Framework for In-context Learning

From Cross-Task Examples to In-Task Prompts: A Graph-Based Pseudo-Labeling Framework for In-context Learning

128

1

0

28 Oct 2025

Offline Preference Optimization via Maximum Marginal Likelihood Estimation

Offline Preference Optimization via Maximum Marginal Likelihood Estimation

140

0

0

27 Oct 2025

A Survey on LLM Mid-Training

A Survey on LLM Mid-Training

237

1

0

27 Oct 2025

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

295

5

0

27 Oct 2025

Probing Knowledge Holes in Unlearned LLMs

Probing Knowledge Holes in Unlearned LLMs

Charles Fleming

302

0

0

27 Oct 2025

Frustratingly Easy Task-aware Pruning for Large Language Models

Frustratingly Easy Task-aware Pruning for Large Language Models

133

1

0

26 Oct 2025

TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination

TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination

Nicholas M. Asher

88

0

0

26 Oct 2025

RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability

RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability

120

11

0

26 Oct 2025

SeeDNorm: Self-Rescaled Dynamic Normalization

SeeDNorm: Self-Rescaled Dynamic Normalization

144

0

0

26 Oct 2025

The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models

The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models

84

1

0

25 Oct 2025

Transformer Based Linear Attention with Optimized GPU Kernel Implementation

Transformer Based Linear Attention with Optimized GPU Kernel Implementation

132

0

0

24 Oct 2025

Decoding-Free Sampling Strategies for LLM Marginalization

Decoding-Free Sampling Strategies for LLM Marginalization

52

0

0

23 Oct 2025

Context-level Language Modeling by Learning Predictive Context Embeddings

Context-level Language Modeling by Learning Predictive Context Embeddings

139

0

0

23 Oct 2025

What Does It Take to Build a Performant Selective Classifier?

What Does It Take to Build a Performant Selective Classifier?

Stephan Rabanser

Nicolas Papernot

210

0

0

23 Oct 2025

Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs

Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs

Víctor Gutiérrez-Basulto

Sophia Ananiadou

279

0

0

23 Oct 2025

Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning

Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning

M. H. I. Abdalla

Christian M. M. Frey

135

0

0

22 Oct 2025

DiSRouter: Distributed Self-Routing for LLM Selections

DiSRouter: Distributed Self-Routing for LLM Selections

131

1

0

22 Oct 2025

GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters

GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters

Anand Choudhary

Yasser Sulaıman

Fabien Cardinaux

Antoine Bosselut

132

0

0

22 Oct 2025

ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices

ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices

448

0

0

22 Oct 2025

Data-Centric Lessons To Improve Speech-Language Pretraining

Data-Centric Lessons To Improve Speech-Language Pretraining

Vishaal Udandarao

Albin Madapally Jose

Chung-Cheng Chiu

136

0

0

22 Oct 2025

CPSVD: Enhancing Large Language Model Compression via Column-Preserving Singular Value Decomposition

CPSVD: Enhancing Large Language Model Compression via Column-Preserving Singular Value Decomposition

72

0

0

22 Oct 2025

Energy-Efficient and Dequantization-Free Q-LLMs: A Spiking Neural Network Approach to Salient Value Mitigation

Energy-Efficient and Dequantization-Free Q-LLMs: A Spiking Neural Network Approach to Salient Value Mitigation

152

0

0

22 Oct 2025

Restoring Pruned Large Language Models via Lost Component Compensation

Restoring Pruned Large Language Models via Lost Component Compensation

Jia Jim Deryl Chua

137

0

0

22 Oct 2025

1 2 3 4 5 6...37 38 39