Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.03350
Cited By
Measuring and Narrowing the Compositionality Gap in Language Models
7 October 2022
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Measuring and Narrowing the Compositionality Gap in Language Models"
50 / 419 papers shown
Title
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
Shubhra (Santu) Karmaker
Dongji Feng
25
48
0
19 May 2023
Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Shrestha Mohanty
Negar Arabzadeh
Julia Kiseleva
Artem Zholus
Milagro Teruel
Ahmed Hassan Awadallah
Yuxuan Sun
Kavya Srinet
Arthur Szlam
LM&Ro
11
10
0
18 May 2023
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling
Weijia Xu
Andrzej Banburski-Fahey
Nebojsa Jojic
ReLM
LRM
14
32
0
17 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
45
28
0
17 May 2023
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
Shangbin Feng
Vidhisha Balachandran
Yuyang Bai
Yulia Tsvetkov
KELM
HILM
11
37
0
14 May 2023
HPE:Answering Complex Questions over Text by Hybrid Question Parsing and Execution
Ye Liu
Semih Yavuz
Rui Meng
Dragomir R. Radev
Caiming Xiong
Yingbo Zhou
24
8
0
12 May 2023
Active Retrieval Augmented Generation
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
9
248
0
11 May 2023
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
Xiaonan Li
Xipeng Qiu
ReLM
KELM
LRM
AI4MH
9
32
0
09 May 2023
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Gibbeum Lee
Volker Hartmann
Jongho Park
Dimitris Papailiopoulos
Kangwook Lee
19
62
0
08 May 2023
Artificial Neuropsychology: Are Large Language Models Developing Executive Functions?
H. Vázquez
ELM
LLMAG
11
0
0
06 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-Wei Lee
Ee-Peng Lim
ReLM
LRM
15
308
0
06 May 2023
Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects
Kehui Tan
Tianqi Pang
Chenyou Fan
Song Yu
19
13
0
05 May 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Ruochen Zhao
Xingxuan Li
Shafiq R. Joty
Chengwei Qin
Lidong Bing
LRM
KELM
13
152
0
05 May 2023
Learning to Reason and Memorize with Self-Notes
Jack Lanchantin
Shubham Toshniwal
Jason Weston
Arthur Szlam
Sainbayar Sukhbaatar
ReLM
LRM
LLMAG
85
27
0
01 May 2023
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks
Shicheng Xu
Liang Pang
Huawei Shen
Xueqi Cheng
Tat-Seng Chua
RALM
KELM
LRM
27
37
0
28 Apr 2023
q2d: Turning Questions into Dialogs to Teach Models How to Search
Yonatan Bitton
Shlomi Cohen-Ganor
Ido Hakimi
Yoad Lewenberg
Roee Aharoni
Enav Weinreb
34
4
0
27 Apr 2023
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Ori Yoran
Tomer Wolfson
Ben Bogin
Uri Katz
Daniel Deutch
Jonathan Berant
ReLM
LRM
KELM
11
94
0
25 Apr 2023
ChatLLM Network: More brains, More intelligence
Rui Hao
Linmei Hu
Weijian Qi
Qingliu Wu
Yirui Zhang
Liqiang Nie
LLMAG
ALM
LRM
8
34
0
24 Apr 2023
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
Jiashuo Sun
Yi Luo
Yeyun Gong
Chen Lin
Yelong Shen
Jian Guo
Nan Duan
LRM
25
19
0
23 Apr 2023
Tool Learning with Foundation Models
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
...
Cheng Yang
Tongshuang Wu
Heng Ji
Zhiyuan Liu
Maosong Sun
14
196
0
17 Apr 2023
ChemCrow: Augmenting large-language models with chemistry tools
Andres M Bran
Sam Cox
Oliver Schilter
Carlo Baldassari
Andrew D. White
P. Schwaller
LLMAG
8
347
0
11 Apr 2023
Exploring Effective Factors for Improving Visual In-Context Learning
Yanpeng Sun
Qiang Chen
Jian Wang
Jingdong Wang
Zechao Li
LRM
VLM
41
17
0
10 Apr 2023
Evaluating GPT-4 and ChatGPT on Japanese Medical Licensing Examinations
Jungo Kasai
Y. Kasai
Keisuke Sakaguchi
Yutaro Yamada
Dragomir R. Radev
LM&MA
ELM
20
98
0
31 Mar 2023
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
...
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
32
1,389
0
30 Mar 2023
Language Models can Solve Computer Tasks
Geunwoo Kim
Pierre Baldi
Stephen Marcus McAleer
LLMAG
LM&Ro
12
336
0
30 Mar 2023
Natural Language Reasoning, A Survey
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLM
LRM
28
49
0
26 Mar 2023
GPT is becoming a Turing machine: Here are some ways to program it
A. Jojic
Zhen Wang
Nebojsa Jojic
LRM
37
17
0
25 Mar 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
16
116
0
21 Mar 2023
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
27
100
0
20 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
19
19
0
18 Mar 2023
ART: Automatic multi-step reasoning and tool-use for large language models
Bhargavi Paranjape
Scott M. Lundberg
Sameer Singh
Hannaneh Hajishirzi
Luke Zettlemoyer
Marco Tulio Ribeiro
KELM
ReLM
LRM
11
138
0
16 Mar 2023
Query2doc: Query Expansion with Large Language Models
Liang Wang
Nan Yang
Furu Wei
81
96
0
14 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
87
148
0
07 Mar 2023
CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification
Seungone Kim
Se June Joo
Yul Jang
Hyungjoo Chae
Jinyoung Yeo
LRM
6
12
0
07 Mar 2023
OpenICL: An Open-Source Framework for In-context Learning
Zhenyu Wu
Yaoxiang Wang
Jiacheng Ye
Jiangtao Feng
Jingjing Xu
Yu Qiao
Zhiyong Wu
21
49
0
06 Mar 2023
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers
Tianlong Chen
Zhenyu (Allen) Zhang
Ajay Jaiswal
Shiwei Liu
Zhangyang Wang
MoE
14
46
0
02 Mar 2023
Understanding Natural Language Understanding Systems. A Critical Analysis
Alessandro Lenci
ELM
26
12
0
01 Mar 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
19
15
0
17 Feb 2023
Augmented Language Models: a Survey
Grégoire Mialon
Roberto Dessì
Maria Lomeli
Christoforos Nalmpantis
Ramakanth Pasunuru
...
Jane Dwivedi-Yu
Asli Celikyilmaz
Edouard Grave
Yann LeCun
Thomas Scialom
LRM
KELM
16
362
0
15 Feb 2023
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
Zhihong Shao
Yeyun Gong
Yelong Shen
Minlie Huang
Nan Duan
Weizhu Chen
ReLM
LRM
18
67
0
01 Feb 2023
Large Language Models Can Be Easily Distracted by Irrelevant Context
Freda Shi
Xinyun Chen
Kanishka Misra
Nathan Scales
David Dohan
Ed H. Chi
Nathanael Scharli
Denny Zhou
ReLM
RALM
LRM
19
522
0
31 Jan 2023
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLM
VPVLM
VLM
LRM
6
107
0
31 Jan 2023
ThoughtSource: A central hub for large language model reasoning data
Simon Ott
Konstantin Hebenstreit
Valentin Liévin
C. Hother
M. Moradi
Maximilian Mayrhauser
Robert Praas
Ole Winther
Matthias Samwald
ReLM
LRM
12
41
0
27 Jan 2023
Causal Reasoning of Entities and Events in Procedural Texts
Li Zhang
Hainiu Xu
Yue Yang
Shuyan Zhou
Weiqiu You
Manni Arora
Chris Callison-Burch
ReLM
LRM
13
35
0
26 Jan 2023
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELM
ReLM
21
205
0
16 Jan 2023
Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes
Justin Reppert
Ben Rachbach
Charlie George
Luke Stebbing
Ju-Seung Byun
Maggie Appleton
Andreas Stuhlmuller
ReLM
LRM
22
16
0
04 Jan 2023
A Survey on In-context Learning
Qingxiu Dong
Lei Li
Damai Dai
Ce Zheng
Jingyuan Ma
...
Zhiyong Wu
Baobao Chang
Xu Sun
Lei Li
Zhifang Sui
ReLM
AIMat
20
443
0
31 Dec 2022
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Omar Khattab
Keshav Santhanam
Xiang Lisa Li
David Leo Wright Hall
Percy Liang
Christopher Potts
Matei A. Zaharia
RALM
KELM
14
243
0
28 Dec 2022
Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss Policy for Transfer Learning
Christopher T. Lengerich
Gabriel Synnaeve
Amy Zhang
Hugh Leather
Kurt Shuster
Franccois Charton
Charysse Redwood
SSL
OffRL
14
1
0
21 Dec 2022
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
KELM
RALM
LRM
6
371
0
20 Dec 2022
Previous
1
2
3
4
5
6
7
8
9
Next