ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.05201
  4. Cited By
The Effect of Sampling Temperature on Problem Solving in Large Language
  Models
v1v2 (latest)

The Effect of Sampling Temperature on Problem Solving in Large Language Models

7 February 2024
Matthew Renze
Erhan Guven
ArXiv (abs)PDFHTMLGithub (21★)

Papers citing "The Effect of Sampling Temperature on Problem Solving in Large Language Models"

50 / 60 papers shown
Temperature in SLMs: Impact on Incident Categorization in On-Premises Environments
Temperature in SLMs: Impact on Incident Categorization in On-Premises Environments
Marcio Pohlmann
Alex Severo
Gefté Almeida
Diego Kreutz
Tiago Heinrich
Lourenço Pereira
81
1
0
21 Nov 2025
The Shifting Landscape of Vaccine Discourse: Insights From a Decade of Pre- to Post-COVID-19 Vaccine Posts on Social Media
The Shifting Landscape of Vaccine Discourse: Insights From a Decade of Pre- to Post-COVID-19 Vaccine Posts on Social MediaPLoS ONE (PLoS ONE), 2025
Nikesh Gyawali
Doina Caragea
Cornelia Caragea
Saif M. Mohammad
94
0
0
20 Nov 2025
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Alexis Audran-Reiss
Jordi Armengol-Estapé
Karen Hambardzumyan
Amar Budhiraja
Martin Josifoski
...
Jenny Zhang
Taco Cohen
Yossi Adi
Tatiana Shavrina
Yoram Bachrach
259
4
0
19 Nov 2025
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
Sushant Gautam
Michael A. Riegler
Pål Halvorsen
VLM
251
3
0
16 Nov 2025
PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework
PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework
Sina Montazeri
Yunhe Feng
Kewei Sha
LLMAGAI4TS
227
2
0
04 Nov 2025
Optimal Attention Temperature Enhances In-Context Learning under Distribution Shift
Optimal Attention Temperature Enhances In-Context Learning under Distribution Shift
Samet Demir
Zafer Dogan
157
0
0
03 Nov 2025
G2: Guided Generation for Enhanced Output Diversity in LLMs
G2: Guided Generation for Enhanced Output Diversity in LLMs
Zhiwen Ruan
Yixia Li
Y. Liu
Yun-Nung Chen
Weihua Luo
P. Li
Yang Liu
Guanhua Chen
166
5
0
01 Nov 2025
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Junichiro Niimi
217
0
0
15 Oct 2025
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning
Chenghao Yang
Lin Gui
Chenxiao Yang
Victor Veitch
Lizhu Zhang
Zhuokai Zhao
OffRL
208
6
0
06 Oct 2025
OptAgent: Optimizing Query Rewriting for E-commerce via Multi-Agent Simulation
OptAgent: Optimizing Query Rewriting for E-commerce via Multi-Agent Simulation
Divij Handa
David Blincoe
Orson Adams
Yinlin Fu
232
2
0
04 Oct 2025
On the Role of Temperature Sampling in Test-Time Scaling
On the Role of Temperature Sampling in Test-Time Scaling
Yuheng Wu
Azalia Mirhoseini
Thierry Tambe
ALMLRM
175
5
1
02 Oct 2025
When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMs
When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMs
Shree Harsha Bokkahalli Satish
G. Henter
Éva Székely
355
2
0
01 Oct 2025
Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models
Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models
Morgan McCarty
Jorge Morales
LRM
154
1
0
27 Sep 2025
Automated Extraction of Material Properties using LLM-based AI Agents
Automated Extraction of Material Properties using LLM-based AI Agents
Subham Ghosh
Abhishek Tewari
143
0
0
23 Sep 2025
Control the Temperature: Selective Sampling for Diverse and High-Quality LLM Outputs
Control the Temperature: Selective Sampling for Diverse and High-Quality LLM Outputs
S. Troshin
Wafaa Mohammed
Yan Meng
Christof Monz
Antske Fokkens
Vlad Niculae
176
6
0
20 Sep 2025
Benchmarking Contextual and Paralinguistic Reasoning in Speech-LLMs: A Case Study with In-the-Wild Data
Benchmarking Contextual and Paralinguistic Reasoning in Speech-LLMs: A Case Study with In-the-Wild Data
Qiongqiong Wang
Hardik B. Sailor
Tianchi Liu
Wenyu Zhang
Muhammad Huzaifah
Nattadaporn Lertcheva
Shuo Sun
Nancy F. Chen
Jinyang Wu
AiTi Aw
217
2
0
20 Sep 2025
LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning
LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning
Jiaqi Wang
Binquan Ji
Haibo Luo
Yiyang Qi
Ruiting Li
Huiyan Wang
Yuantao Han
Cangyi Yang
jiaxu Zhang
Feiliang Ren
BDLLRM
324
2
0
16 Sep 2025
Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety
Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety
Denis Janiak
Julia Moska
Dawid Motyka
Karolina Seweryn
Paweł Walkowiak
Bartosz Żuk
Arkadiusz Janz
ALM
224
0
0
16 Sep 2025
Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization
Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization
Jiahao Yu
Zelei Cheng
Xian Wu
Xinyu Xing
256
2
0
15 Sep 2025
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Xue Yang
Yuxin Zuo
Jiale Yu
Xicheng Zhang
Z. Yang
...
Shanghang Zhang
Y. Wang
Yao Mu
Bowen Zhou
Ning Ding
OffRLLRM
326
69
0
11 Sep 2025
Acquiescence Bias in Large Language Models
Acquiescence Bias in Large Language Models
Daniel Braun
AI4CE
244
1
0
10 Sep 2025
ReCode: Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation
ReCode: Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation
Yicong Zhao
Shisong Chen
Jiacheng Zhang
Zhixu Li
204
4
0
02 Sep 2025
Error Notebook-Guided, Training-Free Part Retrieval in 3D CAD Assemblies via Vision-Language Models
Error Notebook-Guided, Training-Free Part Retrieval in 3D CAD Assemblies via Vision-Language Models
Yunqing Liu
Nan Zhang
Zhiming Tan
RALM3DV
280
0
0
01 Sep 2025
Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation
Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation
Jinyi Han
Tingyun Li
Shisong Chen
Jie Shi
X. Wang
...
Jiaqing Liang
Xin Lin
Liqian Wen
Zulong Chen
Yanghua Xiao
149
2
0
16 Aug 2025
Are Large Language Models Dynamic Treatment Planners? An In Silico Study from a Prior Knowledge Injection Angle
Are Large Language Models Dynamic Treatment Planners? An In Silico Study from a Prior Knowledge Injection Angle
Zhiyao Luo
T. Zhu
OffRLLM&MA
171
1
0
06 Aug 2025
Trae Agent: An LLM-based Agent for Software Engineering with Test-time Scaling
Trae Agent: An LLM-based Agent for Software Engineering with Test-time Scaling
Trae Research Team
Pengfei Gao
Zhao Tian
Xiangxin Meng
Xinchen Wang
...
Cuiyun Gao
Yun Lin
Y. Xiong
Chao Peng
Xia Liu
LLMAGAIFin
155
68
0
31 Jul 2025
Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri
Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri
Felix Kraus
Nicolas Blumenröhr
Danah Tonne
Achim Streit
197
0
0
22 Jul 2025
From Queries to Criteria: Understanding How Astronomers Evaluate LLMs
From Queries to Criteria: Understanding How Astronomers Evaluate LLMs
Alina Hyk
Kiera McCormick
Mian Zhong
I. Ciucă
Sanjib Sharma
John F. Wu
J. E. G. Peek
K. Iyer
Ziang Xiao
Anjalie Field
217
4
0
21 Jul 2025
Self-Correction Bench: Uncovering and Addressing the Self-Correction Blind Spot in Large Language Models
Self-Correction Bench: Uncovering and Addressing the Self-Correction Blind Spot in Large Language Models
Ken Tsui
KELMLRM
294
6
0
03 Jul 2025
Semantic-guided Diverse Decoding for Large Language Model
Semantic-guided Diverse Decoding for Large Language Model
Weijie Shi
Yue Cui
Yaguang Wu
J. Fang
Shibo Zhang
Mengze Li
Sirui Han
Jia Zhu
Jiajie Xu
Xiaofang Zhou
292
3
0
30 Jun 2025
LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Chenghao Yang
Ari Holtzman
Ari Holtzman
329
3
0
22 Jun 2025
Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models
Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models
Haonan Yin
Shai Vardi
Vidyanand Choudhary
284
3
0
17 Jun 2025
Don't throw the baby out with the bathwater: How and why deep learning for ARC
Don't throw the baby out with the bathwater: How and why deep learning for ARC
Jack Cole
Mohamed Osman
LRM
409
6
0
17 Jun 2025
Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making
Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making
Xiaopeng Yuan
X. R. Zhang
Ke Xu
Yifan Xu
Lijun Yu
Jindong Wang
Yushun Dong
Haohan Wang
LRM
375
4
0
13 Jun 2025
Knowing Before Saying: LLM Representations Encode Information About Chain-of-Thought Success Before Completion
Knowing Before Saying: LLM Representations Encode Information About Chain-of-Thought Success Before CompletionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Anum Afzal
Florian Matthes
Gal Chechik
Yftah Ziser
LRM
319
11
0
30 May 2025
VModA: An Effective Framework for Adaptive NSFW Image Moderation
VModA: An Effective Framework for Adaptive NSFW Image Moderation
Han Bao
Qinying Wang
Zhi Chen
Qingming Li
Xuhong Zhang
Changjiang Li
Zonghui Wang
Shouling Ji
Wenzhi Chen
268
2
0
29 May 2025
CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Arnav Verma
Kushin Mukherjee
Christopher Potts
Elisa Kreiss
Judith E. Fan
246
3
0
22 May 2025
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
Yige Xu
Xu Guo
Zhiwei Zeng
Chunyan Miao
BDLLRM
739
18
0
16 May 2025
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs
Lake Yin
Fan Huang
350
1
0
15 May 2025
Atomic Consistency Preference Optimization for Long-Form Question Answering
Atomic Consistency Preference Optimization for Long-Form Question Answering
Jingfeng Chen
Raghuveer Thirukovalluru
Junlin Wang
Kaiwei Luo
Bhuwan Dhingra
KELMHILM
332
2
0
14 May 2025
Can Large Language Models Predict Parallel Code Performance?
Can Large Language Models Predict Parallel Code Performance?IEEE International Symposium on High-Performance Parallel Distributed Computing (HPDC), 2025
Gregory Bolet
Giorgis Georgakoudis
Harshitha Menon
K. Parasyris
N. Hasabnis
Hayden Estes
Kirk W. Cameron
Gal Oren
298
5
0
06 May 2025
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval
Muhammad Rafsan Kabir
Rafeed Mohammad Sultan
Fuad Rahman
M. R. Amin
Sifat Momen
Nabeel Mohammed
Shafin Rahman
AILaw
285
6
0
19 Apr 2025
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Junxiong Wang
Wen-Ding Li
Daniele Paliotta
Daniel Ritter
Alexander M. Rush
Tri Dao
LRM
458
18
0
14 Apr 2025
Has the Creativity of Large-Language Models peaked? An analysis of inter- and intra-LLM variability
Has the Creativity of Large-Language Models peaked? An analysis of inter- and intra-LLM variability
Jennifer Haase
P. Hanel
Sebastian Pokutta
ALMLRM
329
11
0
10 Apr 2025
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Christian Schroeder de Witt
Matthias Bethge
ReLMALMLRM
691
79
0
09 Apr 2025
Emotion Recognition Using Convolutional Neural Networks
Emotion Recognition Using Convolutional Neural Networks
Shaoyuan Xu
Yang Cheng
Qian Lin
J. Allebach
428
7
0
03 Apr 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian Sun
Wei Ma
721
44
0
27 Mar 2025
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhicheng Guo
Sijie Cheng
Yuchen Niu
Hao Wang
Sicheng Zhou
Wenbing Huang
Yang Liu
CLLOffRL
514
9
0
26 Mar 2025
Agents in the Sandbox: End-to-End Crash Bug Reproduction for Minecraft
Agents in the Sandbox: End-to-End Crash Bug Reproduction for Minecraft
Eray Yapağcı
Yavuz Alp Sencer Öztürk
Eray Tüzün
235
2
0
25 Mar 2025
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
LEMMA: Learning from Errors for MatheMatical Advancement in LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhuoshi Pan
Yu Li
Honglin Lin
Qizhi Pei
Zinan Tang
Wei Wu
Chenlin Ming
H. Vicky Zhao
Bin Wang
Lijun Wu
LRM
510
24
0
21 Mar 2025
12
Next
Page 1 of 2