Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.17476
Cited By
Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters
29 December 2023
Manikanta Loya
Divya Sinha
Richard Futrell
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters"
20 / 20 papers shown
Title
An overview of model uncertainty and variability in LLM-based sentiment analysis. Challenges, mitigation strategies and the role of explainability
David Herrera-Poyatos
Carlos Peláez-González
Cristina Zuheros
Andrés Herrera-Poyatos
Virilo Tejedor
F. Herrera
Rosana Montes
23
1
0
06 Apr 2025
Generalization Bias in Large Language Model Summarization of Scientific Research
Uwe Peters
Benjamin Chin-Yee
ELM
29
0
0
28 Mar 2025
Benchmarking Prompt Sensitivity in Large Language Models
Amirhossein Razavi
Mina Soltangheis
Negar Arabzadeh
Sara Salamat
Morteza Zihayat
Ebrahim Bagheri
57
1
0
09 Feb 2025
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents
Fanzeng Xia
Hao Liu
Yisong Yue
Tongxin Li
51
1
0
03 Jan 2025
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Anka Reuel
Amelia F. Hardy
Chandler Smith
Max Lamparth
Malcolm Hardy
Mykel J. Kochenderfer
ELM
62
16
0
20 Nov 2024
CoPrompter: User-Centric Evaluation of LLM Instruction Alignment for Improved Prompt Engineering
Ishika Joshi
Simra Shahid
Shreeya Venneti
Manushree Vasu
Yantao Zheng
Yunyao Li
Balaji Krishnamurthy
Gromit Yeuk-Yin Chan
22
0
0
09 Nov 2024
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Mohamed Salim Aissi
Clément Romac
Thomas Carta
Sylvain Lamprier
Pierre-Yves Oudeyer
Olivier Sigaud
Laure Soulier
Nicolas Thome
14
2
0
25 Oct 2024
MultiTalk: Introspective and Extrospective Dialogue for Human-Environment-LLM Alignment
Venkata Naren Devarakonda
Ali Umut Kaypak
Shuaihang Yuan
P. Krishnamurthy
Yi Fang
Farshad Khorrami
LLMAG
32
0
0
24 Sep 2024
Irrelevant Alternatives Bias Large Language Model Hiring Decisions
Kremena Valkanova
Pencho Yordanov
18
0
0
04 Sep 2024
From Text to Emotion: Unveiling the Emotion Annotation Capabilities of LLMs
Minxue Niu
Mimansa Jaiswal
Emily Mower Provost
23
5
0
30 Aug 2024
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks
Yang You
Jiaqi Han
Yinan Yu
Christian Berger
16
2
0
18 Jul 2024
The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Jie M. Zhang
Dongrui Liu
Chao Qian
Ziyue Gan
Yong-jin Liu
Yu Qiao
Jing Shao
LLMAG
PILM
32
12
0
17 Jul 2024
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Wen Lai
Mohsen Mesgar
Alexander M. Fraser
LRM
ALM
33
18
0
03 Jun 2024
The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
Maja Pavlovic
Massimo Poesio
14
17
0
02 May 2024
PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency
Qixiang Fang
Daniel L. Oberski
Dong Nguyen
17
3
0
02 Apr 2024
Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support
Igor Svoboda
D. Lande
6
4
0
12 Feb 2024
GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model
Xuanchang Zhang
Zhuosheng Zhang
Hai Zhao
LRM
ALM
11
2
0
04 Feb 2024
Do large language models resemble humans in language use?
Zhenguang G. Cai
Xufeng Duan
David A. Haslett
Shuqi Wang
M. Pickering
ALM
67
37
0
10 Mar 2023
Using cognitive psychology to understand GPT-3
Marcel Binz
Eric Schulz
ELM
LLMAG
228
435
0
21 Jun 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
1