Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1803.05457
Cited By

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
Challenge

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

14 March 2018

Ashish Sabharwal

Carissa Schoenick

Oyvind Tafjord

ArXiv (abs)PDF HTML

Papers citing "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"

50 / 1,907 papers shown

ARA: Adaptive Rank Allocation for Efficient Large Language Model SVD Compression

ARA: Adaptive Rank Allocation for Efficient Large Language Model SVD Compression

120

0

0

22 Oct 2025

Beyond Uniform SVD:Dual-Level Optimization across Columns and Modules for LLM Compression

Beyond Uniform SVD:Dual-Level Optimization across Columns and Modules for LLM Compression

72

0

0

22 Oct 2025

Latent Space Factorization in LoRA

Latent Space Factorization in LoRA

108

0

0

22 Oct 2025

NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

Ekaterina Shutova

161

0

0

21 Oct 2025

ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning

ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning

Xiangdong Zhang

138

0

0

21 Oct 2025

Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation

Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation

Giovanni De Muri

155

0

0

21 Oct 2025

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

Shivaram Venkataraman

118

0

0

21 Oct 2025

Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection

Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection

185

0

0

21 Oct 2025

Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression

Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression

Yasuyuki Okoshi

Kazushi Kawamura

Masato Motomura

212

0

0

21 Oct 2025

ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters

ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters

228

0

0

21 Oct 2025

ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts

ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts

...

192

0

0

20 Oct 2025

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

...

158

0

0

20 Oct 2025

EduAdapt: A Question Answer Benchmark Dataset for Evaluating Grade-Level Adaptability in LLMs

EduAdapt: A Question Answer Benchmark Dataset for Evaluating Grade-Level Adaptability in LLMs

Abdellah El Mekki

Muhammad Abdul-Mageed

238

0

0

20 Oct 2025

Mapping Post-Training Forgetting in Language Models at Scale

Mapping Post-Training Forgetting in Language Models at Scale

Andreas Hochlehnert

Matthias Bethge

153

0

0

20 Oct 2025

Unbiased Gradient Low-Rank Projection

Unbiased Gradient Low-Rank Projection

148

0

0

20 Oct 2025

DynaKV: Enabling Accurate and Efficient Long-Sequence LLM Decoding on Smartphones

DynaKV: Enabling Accurate and Efficient Long-Sequence LLM Decoding on Smartphones

186

1

0

20 Oct 2025

The Free Transformer

The Free Transformer

François Fleuret

64

0

0

20 Oct 2025

Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models

Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models

Arijit Ghosh Chowdhury

Sharlina Keshava

Hannah R Marlowe

122

1

0

20 Oct 2025

From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models

From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models

Evgeny Stupachenko

128

1

0

20 Oct 2025

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

Charu C. Aggarwal

554

1

0

19 Oct 2025

DistilLock: Safeguarding LLMs from Unauthorized Knowledge Distillation on the Edge

DistilLock: Safeguarding LLMs from Unauthorized Knowledge Distillation on the Edge

126

0

0

19 Oct 2025

Vocab Diet: Reshaping the Vocabulary of LLMs with Vector Arithmetic

Vocab Diet: Reshaping the Vocabulary of LLMs with Vector Arithmetic

161

0

0

19 Oct 2025

Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration

Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration

LLMAG AI4CE LRM

72

0

0

18 Oct 2025

Expert Merging in Sparse Mixture of Experts with Nash Bargaining

Expert Merging in Sparse Mixture of Experts with Nash Bargaining

188

1

0

17 Oct 2025

Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation

Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation

148

0

0

17 Oct 2025

When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling

When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling

147

0

0

17 Oct 2025

From Characters to Tokens: Dynamic Grouping with Hierarchical BPE

From Characters to Tokens: Dynamic Grouping with Hierarchical BPE

104

0

0

17 Oct 2025

ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning

ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning

Federico Bianchi

144

1

0

17 Oct 2025

KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models

KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models

142

0

0

17 Oct 2025

Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning

Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning

Muhammad Abdullah Sohail

172

0

0

17 Oct 2025

Predicting Task Performance with Context-aware Scaling Laws

Predicting Task Performance with Context-aware Scaling Laws

Kyle Montgomery

Michael Bendersky

128

1

0

16 Oct 2025

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Badr Youbi Idrissi

Mohammad Pezeshki

Alexia Jolicoeur-Martineau

David Lopez-Paz

122

0

0

16 Oct 2025

MergeMoE: Efficient Compression of MoE Models via Expert Output Merging

MergeMoE: Efficient Compression of MoE Models via Expert Output Merging

164

1

0

16 Oct 2025

RLSR: Reinforcement Learning with Supervised Reward Outperforms SFT in Instruction Following

RLSR: Reinforcement Learning with Supervised Reward Outperforms SFT in Instruction Following

111

0

0

16 Oct 2025

Kelle: Co-design KV Caching and eDRAM for Efficient LLM Serving in Edge Computing

Kelle: Co-design KV Caching and eDRAM for Efficient LLM Serving in Edge Computing

88

1

0

16 Oct 2025

Tahakom LLM Guidelines and Recipes: From Pre-training Data to an Arabic LLM

Tahakom LLM Guidelines and Recipes: From Pre-training Data to an Arabic LLM

Raghad Alshabanah

Shahad Alfawzan

Shuruq Alarefei

...

193

0

0

15 Oct 2025

LLMs Can Get "Brain Rot"!

LLMs Can Get "Brain Rot"!

154

0

0

15 Oct 2025

Selective Adversarial Attacks on LLM Benchmarks

Selective Adversarial Attacks on LLM Benchmarks

Anastasia Orlova

112

0

0

15 Oct 2025

End-to-End Multi-Modal Diffusion Mamba

End-to-End Multi-Modal Diffusion Mamba

130

3

0

15 Oct 2025

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Ivan Lazarevich

Nish Sinnadurai

Yani Andrew Ioannou

Vithursan Thangarasa

120

1

0

15 Oct 2025

Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models

Sparse Subnetwork Enhancement for Underrepresented Languages in Large Language Models

Daniil Gurgurov

Josef van Genabith

Simon Ostermann

198

0

0

15 Oct 2025

Closing the Gap Between Text and Speech Understanding in LLMs

Closing the Gap Between Text and Speech Understanding in LLMs

Santiago Cuervo

Maureen de Seyssel

Tatiana Likhomanenko

Zakaria Aldeneh

160

2

0

15 Oct 2025

Dr.LLM: Dynamic Layer Routing in LLMs

Dr.LLM: Dynamic Layer Routing in LLMs

335

1

1

14 Oct 2025

OPLoRA: Orthogonal Projection LoRA Prevents Catastrophic Forgetting during Parameter-Efficient Fine-Tuning

OPLoRA: Orthogonal Projection LoRA Prevents Catastrophic Forgetting during Parameter-Efficient Fine-Tuning

476

2

0

14 Oct 2025

CARVQ: Corrective Adaptor with Group Residual Vector Quantization for LLM Embedding Compression

CARVQ: Corrective Adaptor with Group Residual Vector Quantization for LLM Embedding Compression

Nilesh Malpeddi

Gabrielle De Micheli

Prathamesh Vaste

Woo Seong Chung

108

0

0

14 Oct 2025

Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities

Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities

Urs Spiegelhalter

Jorg K. H. Franke

136

0

0

13 Oct 2025

Neural Weight Compression for Language Models

Neural Weight Compression for Language Models

132

0

0

13 Oct 2025

Direct Multi-Token Decoding

Direct Multi-Token Decoding

96

0

0

13 Oct 2025

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling

Nafise Sadat Moosavi

Nikolaos Aletras

120

0

0

13 Oct 2025

ShishuLM: Lightweight Language Model with Hybrid Decoder-MLP Architecture and Paired Weight Sharing

ShishuLM: Lightweight Language Model with Hybrid Decoder-MLP Architecture and Paired Weight Sharing

Shivanshu Kumar

Gopalakrishnan Srinivasan

80

0

0

13 Oct 2025

1 2 3 4 5...37 38 39