ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.16635
21
0

Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models

25 September 2024
Sungjune Park
Daeseon Choi
    LRM
    AILaw
ArXivPDFHTML
Abstract

This paper proposes a novel prompt engineering technique called Judgment of Thought (JoT) that is specifically tailored for binary logical reasoning tasks. JoT employs three roles\unicodex2014\unicode{x2014}\unicodex2014lawyer, prosecutor, and judge\unicodex2014\unicode{x2014}\unicodex2014to facilitate more reliable and accurate reasoning by the model. In this framework, the judge utilizes a high\unicodex2010\unicode{x2010}\unicodex2010level model, while the lawyer and prosecutor utilize low\unicodex2010\unicode{x2010}\unicodex2010level models. This structure helps the judge better understand the responses from both the lawyer and prosecutor, enabling a more accurate judgment. Experimental results on large language model (LLM) benchmark datasets, such as BigBenchHard and Winogrande, demonstrate that JoT outperforms existing methods, including Chain of Thought (CoT) and Self\unicodex2010\unicode{x2010}\unicodex2010Consistency (SC), in binary logical reasoning tasks. Additionally, in real\unicodex2010\unicode{x2010}\unicodex2010world tasks, such as Fake News Detection and SMS Spam Detection, JoT shows comparable or improved performance compared to existing techniques. JoT significantly enhances the accuracy and reliability of models in binary reasoning tasks and show potential for practical applicability across various domains. Future research should aim to further broaden the applicability of JoT and optimize its implementation for real\unicodex2010\unicode{x2010}\unicodex2010world problem\unicodex2010\unicode{x2010}\unicodex2010solving.

View on arXiv
Comments on this paper