Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2002.05867
Cited By

Transformers as Soft Reasoners over Language

v1v2 (latest)

Transformers as Soft Reasoners over Language

International Joint Conference on Artificial Intelligence (IJCAI), 2020

14 February 2020

Oyvind Tafjord

Kyle Richardson

ArXiv (abs)PDF HTML

Papers citing "Transformers as Soft Reasoners over Language"

50 / 258 papers shown

Efficient PRM Training Data Synthesis via Formal Verification

Efficient PRM Training Data Synthesis via Formal Verification

Sarkar Snigdha Sarathi Das

Wenpeng Yin

Rui Zhang

356

2

0

10 Apr 2026

ARCHE: A Novel Task to Evaluate LLMs on Latent Reasoning Chain Extraction

ARCHE: A Novel Task to Evaluate LLMs on Latent Reasoning Chain Extraction

234

1

0

16 Nov 2025

DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning

DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning

Lachlan McPheat

Robert E Blackwell

Alessandra Russo

Pranava Madhyastha

319

0

0

04 Nov 2025

Normative Reasoning in Large Language Models: A Comparative Benchmark from Logical and Modal Perspectives

Normative Reasoning in Large Language Models: A Comparative Benchmark from Logical and Modal Perspectives

Takanobu Morishita

Mitsuhiro Okada

179

1

0

30 Oct 2025

Are Language Models Efficient Reasoners? A Perspective from Logic Programming

Are Language Models Efficient Reasoners? A Perspective from Logic Programming

Yanick Zengaffinen

Haruki Shirakami

Mrinmaya Sachan

Abulhair Saparov

Bernhard Schölkopf

209

0

0

29 Oct 2025

RiddleBench: A New Generative Reasoning Benchmark for LLMs

RiddleBench: A New Generative Reasoning Benchmark for LLMs

Thanmay Jayakumar

Ratish Puduppully

Anoop Kunchukuttan

311

1

0

28 Oct 2025

Chain of Execution Supervision Promotes General Reasoning in Large Language Models

Chain of Execution Supervision Promotes General Reasoning in Large Language Models

154

1

0

24 Oct 2025

Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations

Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations

Shahin Atakishiyev

Nawshad Farruque

Teruaki Hayashi

...

Osmar R. Zaïane

215

2

0

20 Oct 2025

Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages

Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages

Nadine El-Naggar

Tatsuki Kuribayashi

137

1

0

14 Oct 2025

LogiNumSynth: Synthesizing Joint Logical-Numerical Reasoning Problems for Language Models

LogiNumSynth: Synthesizing Joint Logical-Numerical Reasoning Problems for Language Models

89

1

0

13 Oct 2025

A Layered Intuition -- Method Model with Scope Extension for LLM Reasoning

A Layered Intuition -- Method Model with Scope Extension for LLM Reasoning

107

3

0

12 Oct 2025

Toward Mechanistic Explanation of Deductive Reasoning in Language Models

Toward Mechanistic Explanation of Deductive Reasoning in Language Models

ReLM LRM ELM AI4CE

205

1

0

10 Oct 2025

Two-Stage Voting for Robust and Efficient Suicide Risk Detection on Social Media

Two-Stage Voting for Robust and Efficient Suicide Risk Detection on Social Media

César Escobar-Viera

Candice Biernesser

173

0

0

09 Oct 2025

ExPrESSO: Zero-Knowledge backed Extensive Privacy Preserving Single Sign-on

ExPrESSO: Zero-Knowledge backed Extensive Privacy Preserving Single Sign-on

Kaustabh Barman

Sanjeet Raj Pandey

154

0

0

09 Oct 2025

Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning

Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning

Mubashara Akhtar

Mrinmaya Sachan

182

0

0

28 Sep 2025

DivLogicEval: A Framework for Benchmarking Logical Reasoning Evaluation in Large Language Models

DivLogicEval: A Framework for Benchmarking Logical Reasoning Evaluation in Large Language Models

339

3

0

19 Sep 2025

Rethinking Reasoning Quality in Large Language Models through Enhanced Chain-of-Thought via RL

Rethinking Reasoning Quality in Large Language Models through Enhanced Chain-of-Thought via RL

212

2

0

07 Sep 2025

Perturbing the Derivative: Wild Refitting for Model-Free Evaluation of Machine Learning Models under Bregman Losses

Perturbing the Derivative: Wild Refitting for Model-Free Evaluation of Machine Learning Models under Bregman Losses

David Simchi-Levi

520

0

0

02 Sep 2025

Efficient Graph Understanding with LLMs via Structured Context Injection

Efficient Graph Understanding with LLMs via Structured Context Injection

Govind V Waghmare

Srikanta Bedathur

163

1

0

31 Aug 2025

NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks

NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks

Swapnanil Mukherjee

Deepanway Ghosal

136

0

0

27 Aug 2025

Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models

Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Tharindu Madusanka

Ian Pratt-Hartmann

Riza Batista-Navarro

128

4

0

23 Aug 2025

Reasoning is about giving reasons

Reasoning is about giving reasons

68

0

0

20 Aug 2025

Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

Nasim Shirvani-Mahdavi

110

0

0

14 Aug 2025

Punctuation and Predicates in Language Models

Punctuation and Predicates in Language Models

Sonakshi Chauhan

Maheep Chaudhary

Samuel Nellessen

242

4

0

11 Aug 2025

Rule2Text: Natural Language Explanation of Logical Rules in Knowledge Graphs

Rule2Text: Natural Language Explanation of Logical Rules in Knowledge Graphs

Nasim Shirvani-Mahdavi

Devin Wingfield

221

3

0

31 Jul 2025

Can LLMs Solve ASP Problems? Insights from a Benchmarking Study (Extended Version)

Can LLMs Solve ASP Problems? Insights from a Benchmarking Study (Extended Version)

279

1

0

26 Jul 2025

Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?

Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?Annual Meeting of the Association for Computational Linguistics (ACL), 2025

Mohna Chakraborty

Wallapak Tavanapong

256

1

0

21 Jul 2025

Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving

Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving

206

8

0

20 Jun 2025

Theory-Grounded Evaluation of Human-Like Fallacy Patterns in LLM Reasoning

Theory-Grounded Evaluation of Human-Like Fallacy Patterns in LLM Reasoning

Andrew Keenan Richardson

Ryan Othniel Kearns

Vincent Wang-Ma'scianica

Philipp Koralus

192

0

0

10 Jun 2025

Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Ishwar B Balappanawar

Vamshi Krishna Bonagiri

K. Thirunarayan

Ponnurangam Kumaraguru

330

0

0

28 May 2025

INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling

INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling

318

8

0

22 May 2025

TurnaboutLLM: A Deductive Reasoning Benchmark from Detective Games

TurnaboutLLM: A Deductive Reasoning Benchmark from Detective Games

Muhammad Adil Shahid

272

0

0

21 May 2025

SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas

SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas

362

21

0

20 May 2025

Teaching Small Language Models to Learn Logic through Meta-Learning

Teaching Small Language Models to Learn Logic through Meta-Learning

Leonardo Bertolazzi

Manuel Vargas Guzmán

Raffaella Bernardi

345

0

0

20 May 2025

Evaluating the Logical Reasoning Abilities of Large Reasoning Models

Evaluating the Logical Reasoning Abilities of Large Reasoning Models

333

2

0

17 May 2025

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

...

592

28

0

26 Apr 2025

Exploring Compositional Generalization (in COGS/ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP)

Exploring Compositional Generalization (in COGS/ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP)

708

0

0

21 Apr 2025

LogicTree: Structured Proof Exploration for Coherent and Rigorous Logical Reasoning with Large Language Models

LogicTree: Structured Proof Exploration for Coherent and Rigorous Logical Reasoning with Large Language Models

394

0

0

18 Apr 2025

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

529

27

0

13 Apr 2025

Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models

Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models

José P. Pombal

Nuno M. Guerreiro

André F. T. Martins

716

10

0

01 Apr 2025

Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation

Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation

521

0

0

27 Feb 2025

AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models

AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models

332

14

0

24 Feb 2025

Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)

Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)

377

2

0

24 Feb 2025

Reasoning Bias of Next Token Prediction Training

Reasoning Bias of Next Token Prediction Training

Zhongwang Zhang

Zhi-Qin John Xu

541

3

0

21 Feb 2025

Integrating Expert Knowledge into Logical Programs via LLMs

Integrating Expert Knowledge into Logical Programs via LLMs

Franciszek Górski

Marco Valentino

1.0K

2

0

17 Feb 2025

Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering

Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering

324

1

0

17 Feb 2025

LogiDynamics: Unraveling the Dynamics of Inductive, Abductive and Deductive Logical Inferences in LLM Reasoning

LogiDynamics: Unraveling the Dynamics of Inductive, Abductive and Deductive Logical Inferences in LLM Reasoning

616

11

0

16 Feb 2025

Logical forms complement probability in understanding language model (and human) performance

Logical forms complement probability in understanding language model (and human) performanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

455

2

0

13 Feb 2025

Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation

Large Language Models Meet Symbolic Provers for Logical Reasoning EvaluationInternational Conference on Learning Representations (ICLR), 2025

480

29

0

10 Feb 2025

Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning

Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical ReasoningInternational Conference on Neural Information Processing (ICONIP), 2023

Michael Witbrock

679

8

0

20 Jan 2025

Page 1 of 6