ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.06729
  4. Cited By
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark

STREET: A Multi-Task Structured Reasoning and Explanation Benchmark

13 February 2023
D. Ribeiro
Shen Wang
Xiaofei Ma
He Zhu
Rui Dong
Deguang Kong
Juliette Burger
Anjelica Ramos
William Yang Wang
Zhiheng Huang
George Karypis
Bing Xiang
Dan Roth
    LRM
    ReLM
ArXivPDFHTML

Papers citing "STREET: A Multi-Task Structured Reasoning and Explanation Benchmark"

23 / 23 papers shown
Title
MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training
MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training
Xinxin You
Xien Liu
Qixin Sun
Huan Zhang
Kaiyin Zhou
Shaohui Liu
Guoping Hu
Shijin Wang
Si Liu
Ji Wu
83
0
0
13 Feb 2025
Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation
Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation
Chengwen Qi
Ren Ma
Bowen Li
He Du
Binyuan Hui
Jinwang Wu
Yuanjun Laili
Conghui He
ReLM
LRM
72
2
0
10 Feb 2025
ExoViP: Step-by-step Verification and Exploration with Exoskeleton
  Modules for Compositional Visual Reasoning
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Y. Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
32
2
0
05 Aug 2024
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
Shashank Kirtania
Priyanshu Gupta
Arjun Radhakirshna
LRM
20
4
0
22 Jun 2024
Bi-Chainer: Automated Large Language Models Reasoning with Bidirectional
  Chaining
Bi-Chainer: Automated Large Language Models Reasoning with Bidirectional Chaining
Shuqi Liu
Bowei He
Linqi Song
LRM
40
1
0
05 Jun 2024
Evaluating Mathematical Reasoning of Large Language Models: A Focus on
  Error Identification and Correction
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Xiaoyuan Li
Wenjie Wang
Moxin Li
Junrong Guo
Yang Zhang
Fuli Feng
ELM
LRM
33
15
0
02 Jun 2024
Eliciting Better Multilingual Structured Reasoning from LLMs through
  Code
Eliciting Better Multilingual Structured Reasoning from LLMs through Code
Bryan Li
Tamer Alkhouli
Daniele Bonadiman
Nikolaos Pappas
Saab Mansour
LRM
20
7
0
05 Mar 2024
SEER: Facilitating Structured Reasoning and Explanation via
  Reinforcement Learning
SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning
Guoxin Chen
Kexin Tang
Chao Yang
Fuying Ye
Yu Qiao
Yiming Qian
LRM
8
3
0
24 Jan 2024
Evaluating the Rationale Understanding of Critical Reasoning in Logical
  Reading Comprehension
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension
Akira Kawabata
Saku Sugawara
ELM
17
5
0
30 Nov 2023
A Survey on Hallucination in Large Language Models: Principles,
  Taxonomy, Challenges, and Open Questions
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
Lei Huang
Weijiang Yu
Weitao Ma
Weihong Zhong
Zhangyin Feng
...
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
LRM
HILM
31
684
0
09 Nov 2023
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning
  in Large Language Models
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models
Song Jiang
Zahra Shakeri
Aaron Chan
Maziar Sanjabi
Hamed Firooz
...
Bugra Akyildiz
Yizhou Sun
Jinchao Li
Qifan Wang
Asli Celikyilmaz
LRM
ReLM
20
8
0
07 Oct 2023
Automatically Correcting Large Language Models: Surveying the landscape
  of diverse self-correction strategies
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies
Liangming Pan
Michael Stephen Saxon
Wenda Xu
Deepak Nathani
Xinyi Wang
William Yang Wang
KELM
LRM
31
200
0
06 Aug 2023
Deductive Verification of Chain-of-Thought Reasoning
Deductive Verification of Chain-of-Thought Reasoning
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLM
LRM
17
121
0
06 Jun 2023
Logic-LM: Empowering Large Language Models with Symbolic Solvers for
  Faithful Logical Reasoning
Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Liangming Pan
Alon Albalak
Xinyi Wang
William Yang Wang
ReLM
LRM
AI4CE
15
230
0
20 May 2023
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
Xi Ye
Qiaochu Chen
Işıl Dillig
Greg Durrett
ReLM
ReCod
LRM
33
61
0
16 May 2023
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine
  Reading Comprehension
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
30
3
0
05 Sep 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Explaining Answers with Entailment Trees
Explaining Answers with Entailment Trees
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Zhengnan Xie
Hannah Smith
Leighanna Pipatanangkura
Peter Clark
ReLM
FAtt
LRM
237
184
0
17 Apr 2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit
  Reasoning Strategies
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
245
671
0
06 Jan 2021
e-SNLI: Natural Language Inference with Natural Language Explanations
e-SNLI: Natural Language Inference with Natural Language Explanations
Oana-Maria Camburu
Tim Rocktaschel
Thomas Lukasiewicz
Phil Blunsom
LRM
252
618
0
04 Dec 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
Simpler Context-Dependent Logical Forms via Model Projections
Simpler Context-Dependent Logical Forms via Model Projections
R. Long
Panupong Pasupat
Percy Liang
202
102
0
16 Jun 2016
1