Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.05457
Cited By
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
14 March 2018
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge"
50 / 1,910 papers shown
D4: Improving LLM Pretraining via Document De-Duplication and Diversification
Neural Information Processing Systems (NeurIPS), 2023
Kushal Tirumala
Daniel Simig
Armen Aghajanyan
Ari S. Morcos
SyDa
192
151
0
23 Aug 2023
Exploring Demonstration Ensembling for In-context Learning
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Lu Wang
AIMat
167
12
0
17 Aug 2023
Shepherd: A Critic for Language Model Generation
Tianlu Wang
Ping Yu
Xiaoqing Ellen Tan
Sean O'Brien
Ramakanth Pasunuru
Jane Dwivedi-Yu
O. Yu. Golovneva
Luke Zettlemoyer
Maryam Fazel-Zarandi
Asli Celikyilmaz
ALM
207
105
0
08 Aug 2023
RecycleGPT: An Autoregressive Language Model with Recyclable Module
Yu Jiang
Qiaozhi He
Xiaomin Zhuang
Zhihua Wu
Kunpeng Wang
Wenlai Zhao
Guangwen Yang
KELM
278
3
0
07 Aug 2023
LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Longteng Zhang
Lin Zhang
Shaoshuai Shi
Xiaowen Chu
Yue Liu
AI4CE
211
161
0
07 Aug 2023
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
ReLM
KELM
LRM
258
4
0
02 Aug 2023
TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer
Zhen Qin
Dong Li
Weigao Sun
Weixuan Sun
Xuyang Shen
...
Yunshen Wei
Baohong Lv
Xiao Luo
Yu Qiao
Yiran Zhong
190
32
0
27 Jul 2023
Thrust: Adaptively Propels Large Language Models with External Knowledge
Neural Information Processing Systems (NeurIPS), 2023
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Jianshu Chen
KELM
427
5
0
19 Jul 2023
Measuring Faithfulness in Chain-of-Thought Reasoning
Tamera Lanham
Anna Chen
Ansh Radhakrishnan
Benoit Steiner
Carson E. Denison
...
Zac Hatfield-Dodds
Jared Kaplan
J. Brauner
Sam Bowman
Ethan Perez
ReLM
LRM
235
313
0
17 Jul 2023
A Comprehensive Overview of Large Language Models
ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Lin Wang
OffRL
865
1,229
0
12 Jul 2023
Analyzing Multiple-Choice Reading and Listening Comprehension Tests
Vatsal Raina
Adian Liusie
Mark Gales
ELM
217
4
0
03 Jul 2023
Stay on topic with Classifier-Free Guidance
Guillaume Sanchez
Honglu Fan
Alexander Spangher
Elad Levi
Pawan Sasanka Ammanamanchi
Stella Biderman
3DV
241
70
0
30 Jun 2023
SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning
Neural Information Processing Systems (NeurIPS), 2023
Yunxiang Zhang
Xiaojun Wan
AILaw
LRM
232
9
0
21 Jun 2023
A Simple and Effective Pruning Approach for Large Language Models
International Conference on Learning Representations (ICLR), 2023
Mingjie Sun
Zhuang Liu
Anna Bair
J. Zico Kolter
496
659
0
20 Jun 2023
CMMLU: Measuring massive multitask language understanding in Chinese
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jinyan Su
Yixuan Zhang
Fajri Koto
Yifei Yang
Hai Zhao
Yeyun Gong
Nan Duan
Tim Baldwin
ALM
ELM
439
413
0
15 Jun 2023
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Neural Information Processing Systems (NeurIPS), 2023
Hengli Li
Songchun Zhu
Zilong Zheng
163
16
0
15 Jun 2023
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Arnav Chavan
Zhuang Liu
D. K. Gupta
Eric P. Xing
Zhiqiang Shen
319
110
0
13 Jun 2023
Gradient Ascent Post-training Enhances Language Model Generalization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Dongkeun Yoon
Joel Jang
Sungdong Kim
Minjoon Seo
VLM
AI4CE
208
3
0
12 Jun 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Neural Information Processing Systems (NeurIPS), 2023
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
3.2K
6,725
0
09 Jun 2023
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
International Conference on Learning Representations (ICLR), 2023
Yidong Wang
Zhuohao Yu
Zhengran Zeng
Linyi Yang
Cunxiang Wang
...
Yongfeng Zhang
Xingxu Xie
Wei Ye
Shi-Bo Zhang
Yue Zhang
ALM
ELM
472
332
0
08 Jun 2023
K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization
Web Search and Data Mining (WSDM), 2023
Cheng Deng
Tianhang Zhang
Zhongmou He
Yi Xu
Qiyuan Chen
...
Weinan Zhang
Xinbing Wang
Cheng Zhou
Zhouhan Lin
Junxian He
ALM
260
103
0
08 Jun 2023
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models
Jose Berengueres
Marybeth Sandell
181
0
0
06 Jun 2023
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zechun Liu
Barlas Oğuz
Changsheng Zhao
Ernie Chang
Pierre Stock
Yashar Mehdad
Yangyang Shi
Raghuraman Krishnamoorthi
Vikas Chandra
MQ
263
298
0
29 May 2023
Scaling Data-Constrained Language Models
Neural Information Processing Systems (NeurIPS), 2023
Niklas Muennighoff
Alexander M. Rush
Boaz Barak
Teven Le Scao
Aleksandra Piktus
Nouamane Tazi
S. Pyysalo
Thomas Wolf
Colin Raffel
ALM
687
329
0
25 May 2023
SAIL: Search-Augmented Instruction Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongyin Luo
Yung-Sung Chuang
Yuan Gong
Tianhua Zhang
Yoon Kim
Xixin Wu
D. Fox
Helen Meng
James R. Glass
ALM
LRM
RALM
237
35
0
24 May 2023
On Degrees of Freedom in Defining and Testing Natural Language Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Saku Sugawara
S. Tsugita
ELM
326
2
0
24 May 2023
Universal Self-Adaptive Prompting
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan O. Arik
Tomas Pfister
LRM
225
13
0
24 May 2023
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
International Conference on Learning Representations (ICLR), 2023
Sheng Shen
Le Hou
Yan-Quan Zhou
Nan Du
Shayne Longpre
...
Vincent Zhao
Hongkun Yu
Kurt Keutzer
Trevor Darrell
Denny Zhou
ALM
MoE
468
83
0
24 May 2023
Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models
Shashank Sonkar
Richard G. Baraniuk
126
1
0
23 May 2023
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization
Neural Information Processing Systems (NeurIPS), 2023
Jeonghoon Kim
J. H. Lee
Sungdong Kim
Joonsuk Park
Kang Min Yoo
S. Kwon
Dongsoo Lee
MQ
375
131
0
23 May 2023
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Alfonso Amayuelas
Kyle Wong
Liangming Pan
Wenhu Chen
Wenjie Wang
400
40
0
23 May 2023
RWKV: Reinventing RNNs for the Transformer Era
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
583
862
0
22 May 2023
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models
Dao Xuan-Quy
Le Ngoc-Bich
Vo The-Duy
Phan Xuan-Dung
Ngo Bac-Bien
Nguyen Van-Tien
Nguyen Thi-My-Thanh
Nguyen Hong-Phuoc
141
22
0
20 May 2023
LLM-Pruner: On the Structural Pruning of Large Language Models
Neural Information Processing Systems (NeurIPS), 2023
Xinyin Ma
Gongfan Fang
Xinchao Wang
630
671
0
19 May 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Fangkai Yang
Lu Wang
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
273
72
0
19 May 2023
A quantitative study of NLP approaches to question difficulty estimation
International Conference on Artificial Intelligence in Education (AIED), 2023
Luca Benedetto
125
17
0
17 May 2023
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hamish Ivison
Wenya Wang
Dianzhuo Wang
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
VLM
299
61
0
05 May 2023
Faithful Question Answering with Monte-Carlo Planning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ruixin Hong
Hongming Zhang
Honghui Zhao
Dong Yu
Changshui Zhang
ReLM
LRM
358
25
0
04 May 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
ACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
432
930
0
26 Apr 2023
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Xinyue Shen
Sihao Lin
Michael Backes
Yang Zhang
232
72
0
18 Apr 2023
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain
International Workshop on Health Text Mining and Information Analysis (LOUHI), 2023
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
149
43
0
09 Apr 2023
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster
Nolan Dey
Gurpreet Gosal
Zhiming Chen
Chen
Hemant Khachane
William Marshall
Ribhu Pathria
Marvin Tom
Joel Hestness
MoE
LRM
298
124
0
06 Apr 2023
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhiqiang Hu
Lei Wang
Yihuai Lan
Wanyu Xu
Ee-Peng Lim
Lidong Bing
Xing Xu
Soujanya Poria
Roy Ka-wei Lee
ALM
315
383
0
04 Apr 2023
RPTQ: Reorder-based Post-training Quantization for Large Language Models
Zhihang Yuan
Lin Niu
Jia-Wen Liu
Wenyu Liu
Xinggang Wang
Yuzhang Shang
Guangyu Sun
Qiang Wu
Jiaxiang Wu
Bingzhe Wu
MQ
585
113
0
03 Apr 2023
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
684
1,157
0
30 Mar 2023
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Renrui Zhang
Jiaming Han
Chris Liu
Shiyang Feng
Aojun Zhou
Xiangfei Hu
Shilin Yan
Pan Lu
Jiaming Song
Yu Qiao
MLLM
590
940
0
28 Mar 2023
Natural Language Reasoning, A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLM
LRM
320
96
0
26 Mar 2023
Context-faithful Prompting for Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wenxuan Zhou
Sheng Zhang
Hoifung Poon
Muhao Chen
KELM
256
81
0
20 Mar 2023
Machine Learning Approaches in Agile Manufacturing with Recycled Materials for Sustainability
A. Varde
Jianyu Liang
AI4CE
146
4
0
15 Mar 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
International Conference on Learning Representations (ICLR), 2023
Zhen Wang
Yikang Shen
Leonid Karlinsky
Rogerio Feris
Huan Sun
Yoon Kim
VLM
VPVLM
224
151
0
06 Mar 2023
Previous
1
2
3
...
33
34
35
...
37
38
39
Next
Page 34 of 39
Page
of 39
Go