ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.09702
  4. Cited By
Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go
  without Hallucination?
v1v2v3 (latest)

Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?

16 November 2023
Bangzheng Li
Ben Zhou
Fei Wang
Xingyu Fu
Dan Roth
Muhao Chen
    HILMLRM
ArXiv (abs)PDFHTML

Papers citing "Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?"

20 / 20 papers shown
Title
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Xiao Ye
Jacob Dineen
Zhaonan Li
Zhikun Xu
Weiyu Chen
...
Ji-Eun Irene Yum
Muhammad Ali Khan
Muhammad Umar Afzal
Irbaz B. Riaz
Ben Zhou
LM&MAELM
134
1
0
20 Oct 2025
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
Yi Liu
Xiangrong Zhu
Xiangyu Liu
Wei Wei
Wei Hu
KELM
72
0
0
09 Sep 2025
MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
Tomer Wolfson
H. Trivedi
Mor Geva
Yoav Goldberg
Dan Roth
Tushar Khot
Ashish Sabharwal
Reut Tsarfaty
RALMLRM
249
5
0
15 Aug 2025
CC-LEARN: Cohort-based Consistency Learning
CC-LEARN: Cohort-based Consistency Learning
Xiao Ye
Shaswat Shrivastava
Zhaonan Li
Jacob Dineen
Shijie Lu
Avneet Ahuja
Ming shen
Zhikun Xu
Ben Zhou
OffRLLRM
250
2
0
18 Jun 2025
BOW: Reinforcement Learning for Bottlenecked Next Word Prediction
BOW: Reinforcement Learning for Bottlenecked Next Word Prediction
Ming shen
Zhikun Xu
Xiao Ye
Jacob Dineen
Ben Zhou
OffRLLRM
172
0
0
16 Jun 2025
A Variational Approach for Mitigating Entity Bias in Relation Extraction
A Variational Approach for Mitigating Entity Bias in Relation ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Samuel Mensah
Elena Kochkina
Jabez Magomere
Joy Prakash Sain
Simerjot Kaur
Charese Smiley
140
1
0
13 Jun 2025
AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents
Fengze Liu
Haoyu Wang
Joonhyuk Cho
Dan Roth
Andrew W. Lo
113
1
0
04 Jun 2025
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning
Wei Liu
Siya Qi
Xinyu Wang
Chen Qian
Yali Du
Petr Slovak
OffRLLRM
244
3
0
21 May 2025
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
Iván Arcuschin
Jett Janiak
Robert Krzyzanowski
Senthooran Rajamanoharan
Neel Nanda
Arthur Conmy
ReLMLRM
499
67
0
11 Mar 2025
DeepSeek vs. ChatGPT vs. Claude: A Comparative Study for Scientific Computing and Scientific Machine Learning Tasks
DeepSeek vs. ChatGPT vs. Claude: A Comparative Study for Scientific Computing and Scientific Machine Learning TasksTheoretical and Applied Mechanics Letters (TAML), 2025
Qile Jiang
Zhiwei Gao
George Em Karniadakis
LRM
257
0
0
25 Feb 2025
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Yibo Yan
Shen Wang
Jiahao Huo
Jingheng Ye
Zhendong Chu
Xuming Hu
Philip S. Yu
Daniel Schwalbe-Koda
B. Selman
Qingsong Wen
LRM
462
26
0
05 Feb 2025
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem
  Solving with Computational Graph-Based Retrieval
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval
Xiaocong Yang
Jiacheng Lin
Zhenting Wang
Chengxiang Zhai
ReLM
254
0
0
25 Nov 2024
Are Transformers Truly Foundational for Robotics?
Are Transformers Truly Foundational for Robotics?npj Robotics (npj Robotics), 2024
James A. R. Marshall
Andrew B. Barron
AI4CE
263
3
0
25 Nov 2024
Shortcut Learning in In-Context Learning: A Survey
Shortcut Learning in In-Context Learning: A Survey
Rui Song
Yingji Li
Fausto Giunchiglia
Fausto Giunchiglia
Hao Xu
322
3
0
04 Nov 2024
ReasonAgain: Using Extractable Symbolic Programs to Evaluate
  Mathematical Reasoning
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning
Xiaodong Yu
Ben Zhou
Hao Cheng
Dan Roth
ReLMLRM
133
7
0
24 Oct 2024
ToW: Thoughts of Words Improve Reasoning in Large Language Models
ToW: Thoughts of Words Improve Reasoning in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zhikun Xu
Ming shen
Jacob Dineen
Zhaonan Li
Xiao Ye
Shijie Lu
Aswin Rrv
Chitta Baral
Ben Zhou
LRM
843
2
0
21 Oct 2024
MARS: A neurosymbolic approach for interpretable drug discovery
MARS: A neurosymbolic approach for interpretable drug discovery
L. Delong
Yojana Gadiya
Paola Galdi
Jacques D. Fleuriot
Daniel Domingo-Fernández
811
5
0
02 Oct 2024
FamiCom: Further Demystifying Prompts for Language Models with
  Task-Agnostic Performance Estimation
FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Bangzheng Li
Ben Zhou
Xingyu Fu
Fei Wang
Dan Roth
Muhao Chen
155
8
0
17 Jun 2024
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Yu Feng
Ben Zhou
Weidong Lin
Dan Roth
366
12
0
18 Apr 2024
Conceptual and Unbiased Reasoning in Language Models
Conceptual and Unbiased Reasoning in Language Models
Ben Zhou
Hongming Zhang
Sihao Chen
Dian Yu
Hongwei Wang
Baolin Peng
Dan Roth
Dong Yu
ReLMLRMELM
212
19
0
30 Mar 2024
1