Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.05113
Cited By
Piecing Together Clues: A Benchmark for Evaluating the Detective Skills of Large Language Models
11 July 2023
Zhouhong Gu
Lin Zhang
Jiangjie Chen
Haoning Ye
Xiaoxuan Zhu
Zihan Li
Jianchen Wang
Yikai Zhang
Wenhao Huang
Yanghua Xiao
Hongwei Feng
RALM
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Piecing Together Clues: A Benchmark for Evaluating the Detective Skills of Large Language Models"
5 / 5 papers shown
Title
Complexity-Based Prompting for Multi-Step Reasoning
Yao Fu
Hao-Chun Peng
Ashish Sabharwal
Peter Clark
Tushar Khot
ReLM
LRM
162
411
0
03 Oct 2022
INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
Zeqiu Wu
Ryu Parish
Hao Cheng
Sewon Min
Prithviraj Ammanabrolu
Mari Ostendorf
Hannaneh Hajishirzi
65
14
0
02 Jul 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,217
0
21 Mar 2022
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
245
671
0
06 Jan 2021
1