Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

14 March 2018

Oyvind Tafjord

Papers citing "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"

50 / 1,910 papers shown

D4: Improving LLM Pretraining via Document De-Duplication and DiversificationNeural Information Processing Systems (NeurIPS), 2023

192

151

23 Aug 2023

Exploring Demonstration Ensembling for In-context Learning

167

17 Aug 2023

Shepherd: A Critic for Language Model Generation

Luke Zettlemoyer

207

105

08 Aug 2023

RecycleGPT: An Autoregressive Language Model with Recyclable Module

278

07 Aug 2023

LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning

211

161

07 Aug 2023

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions

258

02 Aug 2023

TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer

Zhen Qin

...

Xiao Luo

Yu Qiao

Yiran Zhong

190

27 Jul 2023

Thrust: Adaptively Propels Large Language Models with External KnowledgeNeural Information Processing Systems (NeurIPS), 2023

Wenlin Yao

427

19 Jul 2023

Measuring Faithfulness in Chain-of-Thought Reasoning

...

235

313

17 Jul 2023

A Comprehensive Overview of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023

Saeed Anwar

Muhammad Usman

865

1,229

12 Jul 2023

Analyzing Multiple-Choice Reading and Listening Comprehension Tests

217

03 Jul 2023

Stay on topic with Classifier-Free Guidance

Pawan Sasanka Ammanamanchi

Stella Biderman

3DV

241

30 Jun 2023

SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense ReasoningNeural Information Processing Systems (NeurIPS), 2023

Yunxiang Zhang

Xiaojun Wan

AILaw LRM

232

21 Jun 2023

A Simple and Effective Pruning Approach for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Mingjie Sun

Zhuang Liu

Anna Bair

J. Zico Kolter

496

659

20 Jun 2023

CMMLU: Measuring massive multitask language understanding in ChineseAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

439

413

15 Jun 2023

DiPlomat: A Dialogue Dataset for Situated Pragmatic ReasoningNeural Information Processing Systems (NeurIPS), 2023

Hengli Li

Songchun Zhu

Zilong Zheng

163

15 Jun 2023

One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning

319

110

13 Jun 2023

Gradient Ascent Post-training Enhances Language Model GeneralizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

208

12 Jun 2023

Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaNeural Information Processing Systems (NeurIPS), 2023

...

3.2K

6,725

09 Jun 2023

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning OptimizationInternational Conference on Learning Representations (ICLR), 2023

...

Yue Zhang

472

332

08 Jun 2023

K2: A Foundation Language Model for Geoscience Knowledge Understanding and UtilizationWeb Search and Data Mining (WSDM), 2023

Cheng Deng

...

Xinbing Wang

260

103

08 Jun 2023

Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models

Jose Berengueres

Marybeth Sandell

181

06 Jun 2023

LLM-QAT: Data-Free Quantization Aware Training for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yashar Mehdad

Raghuraman Krishnamoorthi

Vikas Chandra

263

298

29 May 2023

Scaling Data-Constrained Language ModelsNeural Information Processing Systems (NeurIPS), 2023

687

329

25 May 2023

SAIL: Search-Augmented Instruction LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

237

24 May 2023

On Degrees of Freedom in Defining and Testing Natural Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Saku Sugawara

S. Tsugita

ELM

326

24 May 2023

Universal Self-Adaptive PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xingchen Wan

Ruoxi Sun

Hootan Nakhost

H. Dai

Julian Martin Eisenschlos

Sercan O. Arik

Tomas Pfister

LRM

225

24 May 2023

Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

...

468

24 May 2023

Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models

Shashank Sonkar

Richard G. Baraniuk

126

23 May 2023

Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer QuantizationNeural Information Processing Systems (NeurIPS), 2023

375

131

23 May 2023

Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

400

23 May 2023

RWKV: Reinventing RNNs for the Transformer EraConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

...

Rui-Jie Zhu

583

862

22 May 2023

VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

141

20 May 2023

LLM-Pruner: On the Structural Pruning of Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023

Xinyin Ma

Gongfan Fang

Xinchao Wang

630

671

19 May 2023

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Fangkai Yang

Zezhong Wang

273

19 May 2023

A quantitative study of NLP approaches to question difficulty estimationInternational Conference on Artificial Intelligence in Education (AIED), 2023

Luca Benedetto

125

17 May 2023

Vera: A General-Purpose Plausibility Estimation Model for Commonsense StatementsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yejin Choi

299

05 May 2023

Faithful Question Answering with Monte-Carlo PlanningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

358

04 May 2023

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondACM Transactions on Knowledge Discovery from Data (TKDD), 2023

432

930

26 Apr 2023

In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT

Xinyue Shen

Sihao Lin

Michael Backes

Yang Zhang

232

18 Apr 2023

FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domainInternational Workshop on Health Text Mining and Information Analysis (LOUHI), 2023

149

09 Apr 2023

Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster

298

124

06 Apr 2023

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Lei Wang

315

383

04 Apr 2023

RPTQ: Reorder-based Post-training Quantization for Large Language Models

585

113

03 Apr 2023

BloombergGPT: A Large Language Model for Finance

684

1,157

30 Mar 2023

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

Yu Qiao

590

940

28 Mar 2023

Natural Language Reasoning, A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023

Hongbo Zhang

320

26 Mar 2023

Context-faithful Prompting for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

256

20 Mar 2023

Machine Learning Approaches in Agile Manufacturing with Recycled Materials for Sustainability

A. Varde

Jianyu Liang

AI4CE

146

15 Mar 2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer LearningInternational Conference on Learning Representations (ICLR), 2023

Huan Sun

224

151

06 Mar 2023