Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.06158
Cited By
Language Models can be Logical Solvers
10 November 2023
Jiazhan Feng
Ruochen Xu
Junheng Hao
Hiteshi Sharma
Yelong Shen
Dongyan Zhao
Weizhu Chen
ReLM
LRM
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models can be Logical Solvers"
21 / 21 papers shown
Title
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models
Weiye Xu
J. Wang
Weiyun Wang
Zhe Chen
Wengang Zhou
...
Xiaohua Wang
Xizhou Zhu
Wenhai Wang
Jifeng Dai
Jinguo Zhu
VLM
LRM
48
0
0
21 Apr 2025
Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification
Anqi Zhang
Yulin Chen
Jane Pan
Chen Zhao
Aurojit Panda
Jinyang Li
He He
ReLM
LRM
30
2
0
07 Apr 2025
Integrating Expert Knowledge into Logical Programs via LLMs
Franciszek Górski
Oskar Wysocki
Marco Valentino
André Freitas
12
0
0
17 Feb 2025
TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
Leonardo Plini
Luca Scofano
Edoardo De Matteis
Guido Maria DÁmely di Melendugno
Alessandro Flaborea
Andrea Sanchietti
G. Farinella
Fabio Galasso
Antonino Furnari
EgoV
LRM
32
1
0
04 Nov 2024
Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs
Lei Sun
Xinchen Wang
Youdi Li
RALM
14
0
0
16 Oct 2024
Autoformalization of Game Descriptions using Large Language Models
Agnieszka Mensfelt
Kostas Stathis
Vince Trencsenyi
OffRL
AI4CE
LRM
26
2
0
18 Sep 2024
Can LLMs Reason in the Wild with Programs?
Yuan Yang
Siheng Xiong
Ali Payani
Ehsan Shareghi
Faramarz Fekri
LRM
19
13
0
19 Jun 2024
A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters
Long Hei Matthew Lam
Ramya Keerthy Thatikonda
Ehsan Shareghi
ELM
LRM
25
1
0
01 Jun 2024
PREGO: online mistake detection in PRocedural EGOcentric videos
Alessandro Flaborea
Guido Maria DÁmely di Melendugno
Leonardo Plini
Luca Scofano
Edoardo De Matteis
Antonino Furnari
G. Farinella
Fabio Galasso
EgoV
40
11
0
02 Apr 2024
Can Language Models Pretend Solvers? Logic Code Simulation with LLMs
Minyu Chen
Guoqiang Li
Ling-I Wu
Ruibang Liu
Yuxin Su
Xi Chang
Jianxin Xue
LLMAG
ELM
LRM
16
0
0
24 Mar 2024
PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Zekai Zhang
Yiduo Guo
Yaobo Liang
Dongyan Zhao
Nan Duan
31
1
0
06 Mar 2024
DiLA: Enhancing LLM Tool Learning with Differential Logic Layer
Yu Zhang
Hui-Ling Zhen
Zehua Pei
Yingzhao Lian
Lihao Yin
M. Yuan
Bei Yu
LRM
21
3
0
19 Feb 2024
Puzzle Solving using Reasoning of Large Language Models: A Survey
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELM
ReLM
LRM
11
24
0
17 Feb 2024
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
Fuheng Zhao
Lawrence Lim
Ishtiyaque Ahmad
D. Agrawal
A. El Abbadi
Amr El Abbadi
39
9
0
16 Dec 2023
Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Man Luo
Shrinidhi Kumbhar
Ming shen
Mihir Parmar
Neeraj Varshney
Pratyay Banerjee
Somak Aditya
Chitta Baral
ReLM
ELM
LRM
31
23
0
02 Oct 2023
Synergistic Interplay between Search and Large Language Models for Information Retrieval
Jiazhan Feng
Chongyang Tao
Xiubo Geng
Tao Shen
Can Xu
Guodong Long
Dongyan Zhao
Daxin Jiang
KELM
47
5
0
12 May 2023
Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning
Oyvind Tafjord
Bhavana Dalvi
Peter Clark
ReLM
KELM
LRM
54
52
0
21 Oct 2022
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Abulhair Saparov
He He
ELM
LRM
ReLM
116
270
0
03 Oct 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
1