ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.12521
  4. Cited By
Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights

Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights

18 February 2025
Shubham Parashar
Blake Olson
Sambhav Khurana
Eric Li
Hongyi Ling
James Caverlee
Shuiwang Ji
    LRMReLM
ArXiv (abs)PDFHTMLGithub (30★)

Papers citing "Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights"

9 / 9 papers shown
Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol
Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol
Niklas Jobs
Luis Miguel Vieira da Silva
Jayanth Somashekaraiah
Maximilian Weigand
David Kube
Felix Gehlhoff
LLMAGLM&Ro
252
0
0
03 Dec 2025
Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages
Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages
Lechen Zhang
Yusheng Zhou
Tolga Ergen
Lajanugen Logeswaran
Moontae Lee
David Jurgens
LRM
227
2
0
02 Dec 2025
PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models
PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models
Patrick Haller
Fabio Barth
Jonas Golde
Georg Rehm
Alan Akbik
LRM
420
1
0
27 Oct 2025
Reasoning with Preference Constraints: A Benchmark for Language Models in Many-to-One Matching Markets
Reasoning with Preference Constraints: A Benchmark for Language Models in Many-to-One Matching Markets
Marylou Fauchard
Florian Carichon
Margarida Carvalho
G. Farnadi
LRM
186
0
0
16 Sep 2025
OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics
OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics
Yaoming Zhu
Junxin Wang
Yiyang Li
Lin Qiu
Zongyu Wang
...
Xuezhi Cao
Yuhuai Wei
Mingshi Wang
Xunliang Cai
Rong Ma
LRM
453
5
0
12 Jun 2025
Cost-of-Pass: An Economic Framework for Evaluating Language Models
Cost-of-Pass: An Economic Framework for Evaluating Language Models
Mehmet Hamza Erol
Batu El
Mirac Suzgun
Mert Yuksekgonul
J. Zou
ELM
398
20
0
17 Apr 2025
Efficient Reasoning Models: A Survey
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLMLRM
994
60
0
15 Apr 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Andrew Wen
Shaochen
Zhong
Hanjie Chen
Helen Zhou
OffRLReLMLRM
844
338
0
20 Mar 2025
Complex LLM Planning via Automated Heuristics Discovery
Complex LLM Planning via Automated Heuristics Discovery
Hongyi Ling
Shubham Parashar
Sambhav Khurana
Blake Olson
Anwesha Basu
Gaurangi Sinha
Zhuowen Tu
James Caverlee
Shuiwang Ji
362
9
0
26 Feb 2025
1
Page 1 of 1