ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.04350
  4. Cited By
CLadder: Assessing Causal Reasoning in Language Models

CLadder: Assessing Causal Reasoning in Language Models

7 December 2023
Zhijing Jin
Yuen Chen
Felix Leeb
Luigi Gresele
Ojasv Kamal
Zhiheng Lyu
Kevin Blin
Fernando Gonzalez Adauto
Max Kleiman-Weiner
Mrinmaya Sachan
Bernhard Schölkopf
    ReLM
    ELM
    LRM
ArXivPDFHTML

Papers citing "CLadder: Assessing Causal Reasoning in Language Models"

46 / 46 papers shown
Title
Playpen: An Environment for Exploring Learning Through Conversational Interaction
Playpen: An Environment for Exploring Learning Through Conversational Interaction
Nicola Horst
Davide Mazzaccara
Antonia Schmidt
Michael Sullivan
Filippo Momentè
...
Alexander Koller
Oliver Lemon
David Schlangen
Mario Giulianelli
Alessandro Suglia
OffRL
32
0
0
11 Apr 2025
BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models
BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models
Tian Xie
Tongxin Yin
Vaishakh Keshava
Xueru Zhang
Siddhartha Reddy Jonnalagadda
ELM
LRM
52
0
0
08 Apr 2025
Probabilistic Reasoning with LLMs for k-anonymity Estimation
Jonathan Zheng
Sauvik Das
Alan Ritter
Wei-ping Xu
55
0
0
12 Mar 2025
Cyber Defense Reinvented: Large Language Models as Threat Intelligence Copilots
Cyber Defense Reinvented: Large Language Models as Threat Intelligence Copilots
Xiaoqun Liu
Jiacheng Liang
Qiben Yan
Muchao Ye
Jinyuan Jia
Zhaohan Xi
Jinyuan Jia
Zhaohan Xi
56
0
0
28 Feb 2025
Unveiling and Causalizing CoT: A Causal Pespective
Unveiling and Causalizing CoT: A Causal Pespective
Jiarun Fu
LiZhong Ding
Hao Li
P. Li
Qiuning Wei
Xu Chen
LRM
76
0
0
25 Feb 2025
Testing the limits of fine-tuning to improve reasoning in vision language models
Testing the limits of fine-tuning to improve reasoning in vision language models
Luca M. Schulze Buschoff
Konstantinos Voudouris
Elif Akata
Matthias Bethge
Joshua B. Tenenbaum
Eric Schulz
LRM
VLM
Presented at ResearchTrend Connect | VLM on 14 Mar 2025
120
0
1
24 Feb 2025
Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)
Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)
Damien Sileo
RALM
LRM
33
0
0
24 Feb 2025
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
Martina Miliani
S. Auriemma
Alessandro Bondielli
Emmanuele Chersoni
Lucia Passaro
Irene Sucameli
Alessandro Lenci
LRM
ELM
41
0
0
21 Feb 2025
Prompting Strategies for Enabling Large Language Models to Infer
  Causation from Correlation
Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
Eleni Sgouritsa
Virginia Aglietti
Yee Whye Teh
Arnaud Doucet
A. Gretton
Silvia Chiappa
ReLM
LRM
69
0
0
18 Dec 2024
COLD: Causal reasOning in cLosed Daily activities
COLD: Causal reasOning in cLosed Daily activities
Abhinav Joshi
A. Ahmad
Ashutosh Modi
LRM
ReLM
66
1
0
29 Nov 2024
LLM-initialized Differentiable Causal Discovery
LLM-initialized Differentiable Causal Discovery
Shiv Kampani
David Hidary
Constantijn van der Poel
Martin Ganahl
Brenda Miao
11
0
0
28 Oct 2024
CausalGraph2LLM: Evaluating LLMs for Causal Queries
CausalGraph2LLM: Evaluating LLMs for Causal Queries
Ivaxi Sheth
Bahare Fatemi
Mario Fritz
17
0
0
21 Oct 2024
Causality for Large Language Models
Causality for Large Language Models
Anpeng Wu
Kun Kuang
Minqin Zhu
Yingrong Wang
Yujia Zheng
Kairong Han
B. Li
Guangyi Chen
Fei Wu
Kun Zhang
LRM
44
4
0
20 Oct 2024
Are UFOs Driving Innovation? The Illusion of Causality in Large Language
  Models
Are UFOs Driving Innovation? The Illusion of Causality in Large Language Models
María Victoria Carro
Francisca Gauna Selasco
Denise Alejandra Mester
Mario Alejandro Leiva
LRM
18
0
0
15 Oct 2024
Leaving the barn door open for Clever Hans: Simple features predict LLM
  benchmark answers
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers
Lorenzo Pacchiardi
Marko Tesic
Lucy G. Cheke
José Hernández Orallo
31
3
0
15 Oct 2024
Do LLMs Have the Generalization Ability in Conducting Causal Inference?
Do LLMs Have the Generalization Ability in Conducting Causal Inference?
Chen Wang
Dongming Zhao
Bo Wang
Ruifang He
Yuexian Hou
ELM
20
0
0
15 Oct 2024
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian
  Reasoning Scenarios
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios
Timo Pierre Schrader
Lukas Lange
Simon Razniewski
Annemarie Friedrich
UQLM
23
0
0
14 Oct 2024
Reasoning Elicitation in Language Models via Counterfactual Feedback
Reasoning Elicitation in Language Models via Counterfactual Feedback
Alihan Hüyük
Xinnuo Xu
Jacqueline Maasch
Aditya V. Nori
Javier González
ReLM
LRM
47
1
0
02 Oct 2024
Counterfactual Token Generation in Large Language Models
Counterfactual Token Generation in Large Language Models
Ivi Chatzi
N. C. Benz
Eleni Straitouri
Stratis Tsirtsis
Manuel Gomez Rodriguez
LRM
34
3
0
25 Sep 2024
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Eduardo Pignatelli
Johan Ferret
Tim Rockäschel
Edward Grefenstette
Davide Paglieri
Samuel Coward
Laura Toni
30
2
0
19 Sep 2024
Hypothesizing Missing Causal Variables with LLMs
Hypothesizing Missing Causal Variables with LLMs
Ivaxi Sheth
Sahar Abdelnabi
Mario Fritz
CML
LRM
30
4
0
04 Sep 2024
CHECKWHY: Causal Fact Verification via Argument Structure
CHECKWHY: Causal Fact Verification via Argument Structure
Jiasheng Si
Yibo Zhao
Yingjie Zhu
Haiyang Zhu
Wenpeng Lu
Deyu Zhou
CML
HILM
LRM
27
0
0
20 Aug 2024
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural
  Language
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural Language
Hossein Rajaby Faghihi
Aliakbar Nafar
Andrzej Uszok
Hamid Karimian
Parisa Kordjamshidi
27
0
0
30 Jul 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to
  Cutting-Edge Reasoning
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CML
LRM
29
2
0
27 Jun 2024
CELLO: Causal Evaluation of Large Vision-Language Models
CELLO: Causal Evaluation of Large Vision-Language Models
Meiqi Chen
Bo Peng
Yan Zhang
Chaochao Lu
LRM
ELM
28
0
0
27 Jun 2024
Do Large Language Models Exhibit Cognitive Dissonance? Studying the
  Difference Between Revealed Beliefs and Stated Answers
Do Large Language Models Exhibit Cognitive Dissonance? Studying the Difference Between Revealed Beliefs and Stated Answers
Manuel Mondal
Ljiljana Dolamic
Gérôme Bovet
Philippe Cudré-Mauroux
Julien Audiffren
24
2
0
21 Jun 2024
Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries
Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries
Roberto Ceraolo
Dmitrii Kharlapenko
Amélie Reymond
Rada Mihalcea
Mrinmaya Sachan
Bernhard Schölkopf
Zhijing Jin
Zhijing Jin
CML
14
2
0
30 May 2024
Simulating Policy Impacts: Developing a Generative Scenario Writing
  Method to Evaluate the Perceived Effects of Regulation
Simulating Policy Impacts: Developing a Generative Scenario Writing Method to Evaluate the Perceived Effects of Regulation
Julia Barnett
Kimon Kieslich
Nicholas Diakopoulos
19
3
0
15 May 2024
Risks and Opportunities of Open-Source Generative AI
Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksander Petrov
Bertie Vidgen
Christian Schroeder
Fabio Pizzati
...
Matthew Jackson
Phillip H. S. Torr
Trevor Darrell
Y. Lee
Jakob N. Foerster
34
18
0
14 May 2024
EconLogicQA: A Question-Answering Benchmark for Evaluating Large
  Language Models in Economic Sequential Reasoning
EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning
Yinzhu Quan
Zefang Liu
27
2
0
13 May 2024
Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans
  and Language Models
Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
Jan-Philipp Fränken
Kanishk Gandhi
Tori Qiu
Ayesha Khawaja
Noah D. Goodman
Tobias Gerstenberg
ELM
27
1
0
17 Apr 2024
From Explainable to Interpretable Deep Learning for Natural Language
  Processing in Healthcare: How Far from Reality?
From Explainable to Interpretable Deep Learning for Natural Language Processing in Healthcare: How Far from Reality?
Guangming Huang
Yingya Li
Shoaib Jameel
Yunfei Long
G. Papanastasiou
24
16
0
18 Mar 2024
Causal Prompting: Debiasing Large Language Model Prompting based on
  Front-Door Adjustment
Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment
Congzhi Zhang
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
CML
AI4CE
LRM
44
15
0
05 Mar 2024
Cause and Effect: Can Large Language Models Truly Understand Causality?
Cause and Effect: Can Large Language Models Truly Understand Causality?
Swagata Ashwani
Kshiteesh Hegde
Nishith Reddy Mannuru
Mayank Jindal
Dushyant Singh Sengar
Krishna Chaitanya Rao Kathala
Dishant Banga
Vinija Jain
Aman Chadha
LRM
19
17
0
28 Feb 2024
Is Knowledge All Large Language Models Needed for Causal Reasoning?
Is Knowledge All Large Language Models Needed for Causal Reasoning?
Hengrui Cai
Shengjie Liu
Rui Song
LRM
ELM
12
6
0
30 Dec 2023
CRAB: Assessing the Strength of Causal Relationships Between Real-world
  Events
CRAB: Assessing the Strength of Causal Relationships Between Real-world Events
Angelika Romanou
Syrielle Montariol
Debjit Paul
Leo Laugier
Karl Aberer
Antoine Bosselut
NAI
13
19
0
07 Nov 2023
The Mystery of In-Context Learning: A Comprehensive Survey on
  Interpretation and Analysis
The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis
Yuxiang Zhou
Jiazheng Li
Yanzheng Xiang
Hanqi Yan
Lin Gui
Yulan He
22
13
0
01 Nov 2023
Teaching Probabilistic Logical Reasoning to Transformers
Teaching Probabilistic Logical Reasoning to Transformers
Aliakbar Nafar
K. Venable
Parisa Kordjamshidi
ReLM
LRM
11
3
0
22 May 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
203
2,232
0
22 Mar 2023
Emerging Synergies in Causality and Deep Generative Models: A Survey
Emerging Synergies in Causality and Deep Generative Models: A Survey
Guanglin Zhou
Shaoan Xie
Guang-Yuan Hao
Shiming Chen
Biwei Huang
Xiwei Xu
Chen Wang
Liming Zhu
Lina Yao
Kun Zhang
AI4CE
33
11
0
29 Jan 2023
Deep Causal Learning: Representation, Discovery and Inference
Deep Causal Learning: Representation, Discovery and Inference
Zizhen Deng
Xiaolong Zheng
Hu Tian
D. Zeng
CML
BDL
26
11
0
07 Nov 2022
WikiWhy: Answering and Explaining Cause-and-Effect Questions
WikiWhy: Answering and Explaining Cause-and-Effect Questions
Matthew Ho
Aditya Sharma
Justin Chang
Michael Stephen Saxon
Sharon Levy
Yujie Lu
William Yang Wang
ReLM
KELM
LRM
58
16
0
21 Oct 2022
A Causal Framework to Quantify the Robustness of Mathematical Reasoning
  with Language Models
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
Alessandro Stolfo
Zhijing Jin
Kumar Shridhar
Bernhard Schölkopf
Mrinmaya Sachan
ELM
OOD
LRM
14
61
0
21 Oct 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
273
1,561
0
18 Sep 2019
Language Models as Knowledge Bases?
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
393
2,216
0
03 Sep 2019
1