ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.05457
  4. Cited By
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
  Challenge

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

14 March 2018
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
    ELMRALMLRM
ArXiv (abs)PDFHTML

Papers citing "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"

50 / 1,882 papers shown
Title
Disentangling Reasoning Capabilities from Language Models with
  Compositional Reasoning Transformers
Disentangling Reasoning Capabilities from Language Models with Compositional Reasoning TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Wanjun Zhong
Tingting Ma
Jiahai Wang
Jian Yin
Tiejun Zhao
Chin-Yew Lin
Nan Duan
LRMCoGe
158
4
0
20 Oct 2022
Deep Bidirectional Language-Knowledge Graph Pretraining
Deep Bidirectional Language-Knowledge Graph PretrainingNeural Information Processing Systems (NeurIPS), 2022
Michihiro Yasunaga
Antoine Bosselut
Hongyu Ren
Xikun Zhang
Christopher D. Manning
Abigail Z. Jacobs
J. Leskovec
248
240
0
17 Oct 2022
Zero-Shot Learners for Natural Language Understanding via a Unified
  Multiple Choice Perspective
Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice PerspectiveConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ping Yang
Junjie Wang
Ruyi Gan
Xinyu Zhu
Lin Zhang
Ziwei Wu
Xinyu Gao
Jiaxing Zhang
Tetsuya Sakai
BDL
137
29
0
16 Oct 2022
Task Compass: Scaling Multi-task Pre-training with Task Prefix
Task Compass: Scaling Multi-task Pre-training with Task PrefixConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhuosheng Zhang
Shuohang Wang
Yichong Xu
Yuwei Fang
Wenhao Yu
Yang Liu
Han Zhao
Chenguang Zhu
Michael Zeng
SSLLRM
154
19
0
12 Oct 2022
EduQG: A Multi-format Multiple Choice Dataset for the Educational Domain
EduQG: A Multi-format Multiple Choice Dataset for the Educational DomainIEEE Access (IEEE Access), 2022
Amir Hadifar
Semere Kiros Bitew
Johannes Deleu
Chris Develder
Thomas Demeester
AI4Ed
141
23
0
12 Oct 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question
  Answering
Rainier: Reinforced Knowledge Introspector for Commonsense Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hamish Ivison
Skyler Hallinan
Ximing Lu
Pengfei He
Sean Welleck
Hannaneh Hajishirzi
Yejin Choi
RALM
230
62
0
06 Oct 2022
Guess the Instruction! Flipped Learning Makes Language Models Stronger
  Zero-Shot Learners
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersInternational Conference on Learning Representations (ICLR), 2022
Seonghyeon Ye
Doyoung Kim
Joel Jang
Joongbo Shin
Minjoon Seo
FedMLVLMUQCVLRM
371
25
0
06 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained ModelInternational Conference on Learning Representations (ICLR), 2022
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng Zhang
Yuxiao Dong
Jie Tang
BDLLRM
690
1,206
0
05 Oct 2022
Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Knowledge Unlearning for Mitigating Privacy Risks in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Joel Jang
Dongkeun Yoon
Sohee Yang
Sungmin Cha
Moontae Lee
Lajanugen Logeswaran
Minjoon Seo
KELMPILMMU
418
331
0
04 Oct 2022
Can Large Language Models Truly Understand Prompts? A Case Study with
  Negated Prompts
Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts
Joel Jang
Seonghyeon Ye
Minjoon Seo
ELMLRM
231
77
0
26 Sep 2022
Dynamic Relevance Graph Network for Knowledge-Aware Question Answering
Dynamic Relevance Graph Network for Knowledge-Aware Question AnsweringInternational Conference on Computational Linguistics (COLING), 2022
Chen Zheng
Parisa Kordjamshidi
119
7
0
20 Sep 2022
Learn to Explain: Multimodal Reasoning via Thought Chains for Science
  Question Answering
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question AnsweringNeural Information Processing Systems (NeurIPS), 2022
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
Ashwin Kalyan
ELMReLMLRM
538
1,813
0
20 Sep 2022
Interactive Question Answering Systems: Literature Review
Interactive Question Answering Systems: Literature ReviewACM Computing Surveys (ACM CSUR), 2022
Giovanni Maria Biancofiore
Yashar Deldjoo
Tommaso Di Noia
E. Sciascio
Fedelucio Narducci
351
36
0
04 Sep 2022
Faithful Reasoning Using Large Language Models
Faithful Reasoning Using Large Language Models
Antonia Creswell
Murray Shanahan
ReLMLRM
166
137
0
30 Aug 2022
Going Beyond Approximation: Encoding Constraints for Explainable
  Multi-hop Inference via Differentiable Combinatorial Solvers
Going Beyond Approximation: Encoding Constraints for Explainable Multi-hop Inference via Differentiable Combinatorial Solvers
Mokanarangan Thayaparan
Marco Valentino
André Freitas
136
0
0
05 Aug 2022
Few-shot Adaptation Works with UnpredicTable Data
Few-shot Adaptation Works with UnpredicTable DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jun Shern Chan
Michael Pieler
Jonathan Jao
Jérémy Scheurer
Ethan Perez
390
6
0
01 Aug 2022
Rationale-Augmented Ensembles in Language Models
Rationale-Augmented Ensembles in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Denny Zhou
ReLMLRM
219
135
0
02 Jul 2022
Modern Question Answering Datasets and Benchmarks: A Survey
Modern Question Answering Datasets and Benchmarks: A Survey
Zhen Wang
156
29
0
30 Jun 2022
Language Models are General-Purpose Interfaces
Language Models are General-Purpose Interfaces
Y. Hao
Haoyu Song
Li Dong
Shaohan Huang
Zewen Chi
Wenhui Wang
Shuming Ma
Furu Wei
MLLM
179
108
0
13 Jun 2022
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
CoSe-Co: Text Conditioned Generative CommonSense ContextualizerNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Rachit Bansal
Milan Aggarwal
S. Bhatia
Jivat Neet Kaur
Balaji Krishnamurthy
138
4
0
12 Jun 2022
Eliciting and Understanding Cross-Task Skills with Task-Level
  Mixture-of-Experts
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-ExpertsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Qinyuan Ye
Juan Zha
Xiang Ren
MoE
251
16
0
25 May 2022
UL2: Unifying Language Learning Paradigms
UL2: Unifying Language Learning ParadigmsInternational Conference on Learning Representations (ICLR), 2022
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
486
357
0
10 May 2022
METGEN: A Module-Based Entailment Tree Generation Framework for Answer
  Explanation
METGEN: A Module-Based Entailment Tree Generation Framework for Answer Explanation
Ruixin Hong
Hongming Zhang
Xintong Yu
Changshui Zhang
ReLMLRM
185
37
0
05 May 2022
OPT: Open Pre-trained Transformer Language Models
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLMOSLMAI4CE
811
4,320
0
02 May 2022
Clues Before Answers: Generation-Enhanced Multiple-Choice QA
Clues Before Answers: Generation-Enhanced Multiple-Choice QANorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Zixian Huang
Ao Wu
Jiaying Zhou
Yu Gu
Yue Zhao
Gong Cheng
128
29
0
30 Apr 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
330
938
0
14 Apr 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning
  from Human Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
817
3,418
0
12 Apr 2022
What Language Model Architecture and Pretraining Objective Work Best for
  Zero-Shot Generalization?
What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?International Conference on Machine Learning (ICML), 2022
Thomas Wang
Adam Roberts
Daniel Hesslow
Teven Le Scao
Hyung Won Chung
Iz Beltagy
Julien Launay
Colin Raffel
262
211
0
12 Apr 2022
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning
  Tasks
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Peter Clark
Chitta Baral
Ashwin Kalyan
AIMatReLMELMLRM
224
121
0
12 Apr 2022
Metaethical Perspectives on 'Benchmarking' AI Ethics
Metaethical Perspectives on 'Benchmarking' AI EthicsAI and Ethics (AE), 2022
Travis LaCroix
A. Luccioni
99
10
0
11 Apr 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with PathwaysJournal of machine learning research (JMLR), 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILMLRM
1.2K
7,320
0
05 Apr 2022
REx: Data-Free Residual Quantization Error Expansion
REx: Data-Free Residual Quantization Error ExpansionNeural Information Processing Systems (NeurIPS), 2022
Edouard Yvinec
Arnaud Dapgony
Matthieu Cord
Kévin Bailly
MQ
310
9
0
28 Mar 2022
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical
  domain Question Answering
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question AnsweringACM Conference on Health, Inference, and Learning (ACM CHIL), 2022
Ankit Pal
Logesh Kumar Umapathi
Malaikannan Sankarasubbu
ELMLM&MA
381
504
0
27 Mar 2022
Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic
  Dataset for Narrative Comprehension
Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative ComprehensionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Ying Xu
Dakuo Wang
Mo Yu
Daniel E. Ritchie
Bingsheng Yao
...
Xiaojuan Ma
Diyi Yang
Nanyun Peng
Zhou Yu
M. Warschauer
AI4Ed
185
123
0
26 Mar 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLMBDLLRMAI4CE
1.6K
5,319
0
21 Mar 2022
E-KAR: A Benchmark for Rationalizing Natural Language Analogical
  Reasoning
E-KAR: A Benchmark for Rationalizing Natural Language Analogical ReasoningFindings (Findings), 2022
Jiangjie Chen
Rui Xu
Ziquan Fu
Wei Shi
Zhongqiao Li
Xinbo Zhang
Changzhi Sun
Lei Li
Yanghua Xiao
Hao Zhou
ELM
126
46
0
16 Mar 2022
ScienceWorld: Is your Agent Smarter than a 5th Grader?
ScienceWorld: Is your Agent Smarter than a 5th Grader?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ruoyao Wang
Peter Alexander Jansen
Marc-Alexandre Côté
Prithviraj Ammanabrolu
LLMAGReLMLRM
302
164
0
14 Mar 2022
ILDAE: Instance-Level Difficulty Analysis of Evaluation Data
ILDAE: Instance-Level Difficulty Analysis of Evaluation DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Neeraj Varshney
Swaroop Mishra
Chitta Baral
147
21
0
07 Mar 2022
Feeding What You Need by Understanding What You Learned
Feeding What You Need by Understanding What You LearnedAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiaoqiang Wang
Bang Liu
Fangli Xu
Bowei Long
Siliang Tang
Lingfei Wu
163
6
0
05 Mar 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning
  Work?
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Sewon Min
Xinxi Lyu
Ari Holtzman
Mikel Artetxe
M. Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
LLMAGLRM
480
1,779
0
25 Feb 2022
UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training
UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training
Daniel Khashabi
Yeganeh Kordi
Hannaneh Hajishirzi
221
73
0
23 Feb 2022
ST-MoE: Designing Stable and Transferable Sparse Expert Models
ST-MoE: Designing Stable and Transferable Sparse Expert Models
Barret Zoph
Irwan Bello
Sameer Kumar
Nan Du
Yanping Huang
J. Dean
Noam M. Shazeer
W. Fedus
MoE
374
292
0
17 Feb 2022
Pirá: A Bilingual Portuguese-English Dataset for Question-Answering
  about the Ocean
Pirá: A Bilingual Portuguese-English Dataset for Question-Answering about the OceanInternational Conference on Information and Knowledge Management (CIKM), 2021
André F. A. Paschoal
Paulo Pirozelli
Valdinei Freire
K. V. Delgado
S. M. Peres
...
Flávio Nakasato
A. Oliveira
A. Brandão
A. H. R. Costa
Fabio Gagliardi Cozman
RALM
102
18
0
04 Feb 2022
Unified Question Generation with Continual Lifelong Learning
Unified Question Generation with Continual Lifelong LearningThe Web Conference (WWW), 2022
Wei Yuan
Hongzhi Yin
Tieke He
Tong Chen
Qiufeng Wang
Li-zhen Cui
207
11
0
24 Jan 2022
Leaf: Multiple-Choice Question Generation
Leaf: Multiple-Choice Question GenerationEuropean Conference on Information Retrieval (ECIR), 2022
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
220
28
0
22 Jan 2022
A Survey on non-English Question Answering Dataset
A Survey on non-English Question Answering Dataset
Andrea Chandra
Affandy Fahrizain
Ibrahim
Simon Willyanto Laufried
205
12
0
27 Dec 2021
An Inference Approach To Question Answering Over Knowledge Graphs
An Inference Approach To Question Answering Over Knowledge Graphs
Aayushee Gupta
K. Annervaz
Ambedkar Dukkipati
Shubhashis Sengupta
85
0
0
21 Dec 2021
Few-shot Learning with Multilingual Language Models
Few-shot Learning with Multilingual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xi Lin
Todor Mihaylov
Mikel Artetxe
Tianlu Wang
Shuohui Chen
...
Luke Zettlemoyer
Zornitsa Kozareva
Mona T. Diab
Ves Stoyanov
Xian Li
BDLELMLRM
265
351
0
20 Dec 2021
ActKnow: Active External Knowledge Infusion Learning for Question
  Answering in Low Data Regime
ActKnow: Active External Knowledge Infusion Learning for Question Answering in Low Data Regime
K. Annervaz
Pritam Kumar Nath
Ambedkar Dukkipati
RALM
92
1
0
17 Dec 2021
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Nan Du
Yanping Huang
Andrew M. Dai
Simon Tong
Dmitry Lepikhin
...
Kun Zhang
Quoc V. Le
Yonghui Wu
Zhiwen Chen
Claire Cui
ALMMoE
613
1,025
0
13 Dec 2021
Previous
123...3435363738
Next