Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1803.05457
Cited By

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
Challenge

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

14 March 2018

Ashish Sabharwal

Carissa Schoenick

Oyvind Tafjord

ArXiv (abs)PDF HTML

Papers citing "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"

50 / 1,910 papers shown

COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens

COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens

265

0

0

08 Sep 2025

LoaQ: Layer-wise Output Approximation Quantization

LoaQ: Layer-wise Output Approximation Quantization

90

1

0

08 Sep 2025

Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding

Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding

...

Shuaiqiang Wang

116

3

0

08 Sep 2025

Ban&Pick: Ehancing Performance and Efficiency of MoE-LLMs via Smarter Routing

Ban&Pick: Ehancing Performance and Efficiency of MoE-LLMs via Smarter Routing

178

0

0

08 Sep 2025

Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian

Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian

Michael Hoffmann

Stefan Schweter

Gokul Ramakrishnan

Dmitry Gaynullin

Nicolay J. Hammer

162

1

0

06 Sep 2025

Hyperbolic Large Language Models

Hyperbolic Large Language Models

215

0

0

06 Sep 2025

Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation

Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation

Shuangyong Song

211

3

0

06 Sep 2025

CTCC: A Robust and Stealthy Fingerprinting Framework for Large Language Models via Cross-Turn Contextual Correlation Backdoor

CTCC: A Robust and Stealthy Fingerprinting Framework for Large Language Models via Cross-Turn Contextual Correlation Backdoor

189

8

0

05 Sep 2025

Set Block Decoding is a Language Model Inference Accelerator

Set Block Decoding is a Language Model Inference Accelerator

Jeremy Reizenstein

Gabriel Synnaeve

David Lopez-Paz

149

6

0

04 Sep 2025

SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment

SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment

...

178

2

0

04 Sep 2025

Towards a Unified View of Large Language Model Post-Training

Towards a Unified View of Large Language Model Post-Training

...

108

11

0

04 Sep 2025

On Robustness and Reliability of Benchmark-Based Evaluation of LLMs

On Robustness and Reliability of Benchmark-Based Evaluation of LLMs

Riccardo Lunardi

Stefano Mizzaro

165

5

0

04 Sep 2025

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models

205

5

0

04 Sep 2025

EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint

EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint

189

7

0

03 Sep 2025

Mixture-of-Clustered-Experts: Advancing Expert Specialization and Generalization in Instruction Tuning

Mixture-of-Clustered-Experts: Advancing Expert Specialization and Generalization in Instruction Tuning

146

0

0

03 Sep 2025

From Construction to Injection: Edit-Based Fingerprints for Large Language Models

From Construction to Injection: Edit-Based Fingerprints for Large Language Models

212

1

0

03 Sep 2025

Binary Quantization For LLMs Through Dynamic Grouping

Binary Quantization For LLMs Through Dynamic Grouping

208

0

0

03 Sep 2025

TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models

TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models

Giorgos Iacovides

89

1

0

03 Sep 2025

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

Krishna Teja Chitty-Venkata

Sandeep Madireddy

160

1

0

02 Sep 2025

Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving

Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving

235

0

0

02 Sep 2025

JudgeAgent: Beyond Static Benchmarks for Knowledge-Driven and Dynamic LLM Evaluation

JudgeAgent: Beyond Static Benchmarks for Knowledge-Driven and Dynamic LLM Evaluation

296

0

0

02 Sep 2025

Implicit Reasoning in Large Language Models: A Comprehensive Survey

Implicit Reasoning in Large Language Models: A Comprehensive Survey

OffRL LRM AI4CE

226

14

0

02 Sep 2025

Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs

Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs

115

2

0

01 Sep 2025

Dream-Coder 7B: An Open Diffusion Language Model for Code

Dream-Coder 7B: An Open Diffusion Language Model for Code

...

139

22

0

01 Sep 2025

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

...

166

8

0

01 Sep 2025

GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping

GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping

M. Hosseinzadeh

Reza Rawassizadeh

268

0

0

01 Sep 2025

LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving

LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving

...

Chengquan Jiang

118

5

0

01 Sep 2025

DTRNet: Dynamic Token Routing Network to Reduce Quadratic Costs in Transformers

DTRNet: Dynamic Token Routing Network to Reduce Quadratic Costs in Transformers

Parsa Farinneya

Benyamin Jamialahmadi

Marzieh S. Tahaei

Mehdi Rezagholizadeh

88

1

0

31 Aug 2025

Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling

Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling

Guangxiang Zhao

Xiangzheng Zhang

93

0

0

31 Aug 2025

Unlocking the Effectiveness of LoRA-FP for Seamless Transfer Implantation of Fingerprints in Downstream Models

Unlocking the Effectiveness of LoRA-FP for Seamless Transfer Implantation of Fingerprints in Downstream Models

129

8

0

31 Aug 2025

PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference

PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference

479

1

0

29 Aug 2025

Diffusion Language Models Know the Answer Before Decoding

Diffusion Language Models Know the Answer Before Decoding

Soroush Vosoughi

179

24

0

27 Aug 2025

Predicting the Order of Upcoming Tokens Improves Language Modeling

Predicting the Order of Upcoming Tokens Improves Language Modeling

Zayd Muhammad Kawakibi Zuhri

Erland Hilman Fuadi

Alham Fikri Aji

48

0

0

26 Aug 2025

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

...

181

4

0

26 Aug 2025

Enabling MoE on the Edge via Importance-Driven Expert Scheduling

Enabling MoE on the Edge via Importance-Driven Expert Scheduling

290

1

0

26 Aug 2025

Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models

Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models

Jun Zhao

Kang Liu

188

0

0

26 Aug 2025

Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap

Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap

...

228

0

0

26 Aug 2025

Dynamic Collaboration of Multi-Language Models based on Minimal Complete Semantic Units

Dynamic Collaboration of Multi-Language Models based on Minimal Complete Semantic Units

114

1

0

26 Aug 2025

DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction

DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction

124

0

0

25 Aug 2025

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

141

0

0

25 Aug 2025

Integral Transformer: Denoising Attention, Not Too Much Not Too Little

Integral Transformer: Denoising Attention, Not Too Much Not Too Little

131

0

0

25 Aug 2025

Proximal Supervised Fine-Tuning

Proximal Supervised Fine-Tuning

85

3

0

25 Aug 2025

Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models

Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models

Ryosuke Takahashi

90

1

0

25 Aug 2025

Weights-Rotated Preference Optimization for Large Language Models

Weights-Rotated Preference Optimization for Large Language Models

142

0

0

25 Aug 2025

Riemannian Optimization for LoRA on the Stiefel Manifold

Riemannian Optimization for LoRA on the Stiefel Manifold

155

1

0

25 Aug 2025

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

Krishna Teja Chitty-Venkata

Natalia Vassilieva

Siddhisanket Raskar

113

1

0

24 Aug 2025

Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks

Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks

Mohammed Mahfoud

174

5

0

23 Aug 2025

Learning from Diverse Reasoning Paths with Routing and Collaboration

Learning from Diverse Reasoning Paths with Routing and Collaboration

200

6

0

23 Aug 2025

Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs

Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs

123

0

0

23 Aug 2025

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish

Gözde Gül Şahin

158

1

0

22 Aug 2025

1 2 3...8 9 10...37 38 39