Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.05201
Cited By
The Effect of Sampling Temperature on Problem Solving in Large Language Models
7 February 2024
Matthew Renze
Erhan Guven
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Effect of Sampling Temperature on Problem Solving in Large Language Models"
37 / 37 papers shown
Title
Can Large Language Models Predict Parallel Code Performance?
Gregory Bolet
Giorgis Georgakoudis
Harshitha Menon
K. Parasyris
N. Hasabnis
Hayden Estes
Kirk W. Cameron
Gal Oren
28
0
0
06 May 2025
Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection
Myrthe Reuver
Indira Sen
Matteo Melis
Gabriella Lapesa
20
0
0
21 Apr 2025
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval
Muhammad Rafsan Kabir
Rafeed Mohammad Sultan
Fuad Rahman
M. R. Amin
Sifat Momen
Nabeel Mohammed
Shafin Rahman
AILaw
42
0
0
19 Apr 2025
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Junxiong Wang
Wen-Ding Li
Daniele Paliotta
Daniel Ritter
Alexander M. Rush
Tri Dao
LRM
24
0
0
14 Apr 2025
Has the Creativity of Large-Language Models peaked? An analysis of inter- and intra-LLM variability
Jennifer Haase
P. Hanel
Sebastian Pokutta
ALM
LRM
60
0
0
10 Apr 2025
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Ameya Prabhu
Matthias Bethge
ReLM
ALM
LRM
74
4
0
09 Apr 2025
Can AI Master Construction Management (CM)? Benchmarking State-of-the-Art Large Language Models on CM Certification Exams
Ruoxin Xiong
Yanyu Wang
Suat Gunhan
Yimin Zhu
Charles Berryman
ELM
26
0
0
04 Apr 2025
Emotion Recognition Using Convolutional Neural Networks
Shaoyuan Xu
Yang Cheng
Qian Lin
J. Allebach
29
6
0
03 Apr 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian-jun Sun
Wei Ma
58
1
0
27 Mar 2025
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs
Zhicheng Guo
Sijie Cheng
Yuchen Niu
Hao Wang
Sicheng Zhou
Wenbing Huang
Yang Liu
CLL
OffRL
83
0
0
26 Mar 2025
BugCraft: End-to-End Crash Bug Reproduction Using LLM Agents in Minecraft
Eray Yapağcı
Yavuz Alp Sencer Öztürk
Eray Tüzün
33
0
0
25 Mar 2025
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
Zhuoshi Pan
Yu-Hu Li
Honglin Lin
Qizhi Pei
Zinan Tang
Wei Yu Wu
Chenlin Ming
H. V. Zhao
Conghui He
Lijun Wu
LRM
59
0
0
21 Mar 2025
ProtTeX: Structure-In-Context Reasoning and Editing of Proteins with Large Language Models
Zicheng Ma
Chuanliu Fan
Zhicong Wang
Zhenyu Chen
Xiaohan Lin
Y. Li
Shihao Feng
Jun Zhang
Ziqiang Cao
Y. Gao
43
0
0
11 Mar 2025
Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors
Renshuang Jiang
Pan Dong
Zhenling Duan
Yu Shi
Xiaoxiang Fang
Yan Ding
Jun Ma
Shuai Zhao
Zhe Jiang
33
0
0
04 Mar 2025
How Diversely Can Language Models Solve Problems? Exploring the Algorithmic Diversity of Model-Generated Code
Seonghyeon Lee
Heejae Chon
Joonwon Jang
Dongha Lee
Hwanjo Yu
ALM
39
0
0
02 Mar 2025
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Yiwei Li
Ji Zhang
Shaoxiong Feng
Peiwen Yuan
X. Wang
...
Y. Zhang
Chuyi Tan
Boyuan Pan
Yao Hu
Kan Li
HILM
39
1
0
27 Feb 2025
Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners
Daniele Paliotta
Junxiong Wang
Matteo Pagliardini
Kevin Y. Li
Aviv Bick
J. Zico Kolter
Albert Gu
F. Fleuret
Tri Dao
ReLM
LRM
43
7
0
27 Feb 2025
AuPair: Golden Example Pairs for Code Repair
Aditi Mavalankar
Hassan Mansoor
Zita Marinho
Masha Samsikova
Tom Schaul
KELM
LRM
67
0
0
12 Feb 2025
Optimizing Temperature for Language Models with Multi-Sample Inference
Weihua Du
Yiming Yang
Sean Welleck
54
2
0
07 Feb 2025
ChatHTTPFuzz: Large Language Model-Assisted IoT HTTP Fuzzing
Zhe Yang
Hao Peng
Yanling Jiang
X. Li
Haohua Du
Shuhai Wang
Jianwei Liu
71
2
0
18 Nov 2024
Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models
Yeming Wen
Swarat Chaudhuri
29
0
0
11 Nov 2024
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology
Junior Cedric Tonga
Benjamin Clément
Pierre-Yves Oudeyer
LRM
28
2
0
05 Nov 2024
Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models
Mohammad Beigi
Sijia Wang
Ying Shen
Zihao Lin
Adithya Kulkarni
...
Ming Jin
Jin-Hee Cho
Dawei Zhou
Chang-Tien Lu
Lifu Huang
21
1
0
26 Oct 2024
Cognitive Overload Attack:Prompt Injection for Long Context
Bibek Upadhayay
Vahid Behzadan
Amin Karbasi
AAML
28
2
0
15 Oct 2024
CaLMFlow: Volterra Flow Matching using Causal Language Models
Sizhuang He
Daniel Levine
Ivan Vrkic
Marco Francesco Bressana
David Zhang
S. Rizvi
Yangtian Zhang
E. Zappala
David van Dijk
17
0
0
03 Oct 2024
Evaluating the Performance of Large Language Models in Competitive Programming: A Multi-Year, Multi-Grade Analysis
Adrian Marius Dumitran
Adrian Catalin Badea
Stefan-Gabriel Muscalu
ELM
LRM
18
1
0
31 Aug 2024
Beyond Labels: Aligning Large Language Models with Human-like Reasoning
Muhammad Rafsan Kabir
Rafeed Mohammad Sultan
Ihsanul Haque Asif
Jawad Ibn Ahad
Fuad Rahman
Mohammad Ruhul Amin
Nabeel Mohammed
Shafin Rahman
LRM
22
2
0
20 Aug 2024
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
Xuanliang Zhang
Dingzirui Wang
Longxu Dou
Baoxin Wang
Dayong Wu
Qingfu Zhu
Wanxiang Che
LMTD
ReLM
39
2
0
16 Aug 2024
Large Language Models have Intrinsic Self-Correction Ability
Dancheng Liu
Amir Nassereldine
Ziming Yang
Chenhui Xu
Yuting Hu
Jiajie Li
Utkarsh Kumar
Changjae Lee
Jinjun Xiong
KELM
ReLM
LRM
23
9
0
21 Jun 2024
Easy Problems That LLMs Get Wrong
Sean Williams
James Huckle
LRM
19
10
0
30 May 2024
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Matthew Renze
Erhan Guven
LRM
LLMAG
31
34
0
05 May 2024
Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs
Shu Yang
Jiayuan Su
Han Jiang
Mengdi Li
Keyuan Cheng
Muhammad Asif Ali
Lijie Hu
Di Wang
16
5
0
30 Mar 2024
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency
Zhishuai Li
Xiang Wang
Jingjing Zhao
Sun Yang
Guoqing Du
...
Bin Zhang
Yuxiao Ye
Ziyue Li
Rui Zhao
Hangyu Mao
AI4TS
LRM
25
9
0
13 Mar 2024
Recitation-Augmented Language Models
Zhiqing Sun
Xuezhi Wang
Yi Tay
Yiming Yang
Denny Zhou
RALM
192
60
0
04 Oct 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
A Systematic Evaluation of Large Language Models of Code
Frank F. Xu
Uri Alon
Graham Neubig
Vincent J. Hellendoorn
ELM
ALM
196
624
0
26 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
1