Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters

29 December 2023

Papers citing "Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters"

20 / 20 papers shown

Title
An overview of model uncertainty and variability in LLM-based sentiment analysis. Challenges, mitigation strategies and the role of explainability David Herrera-Poyatos Carlos Peláez-González Cristina Zuheros Andrés Herrera-Poyatos Virilo Tejedor F. Herrera Rosana Montes 23 1 0 06 Apr 2025
Generalization Bias in Large Language Model Summarization of Scientific Research Uwe Peters Benjamin Chin-Yee ELM 29 0 0 28 Mar 2025
Benchmarking Prompt Sensitivity in Large Language Models Amirhossein Razavi Mina Soltangheis Negar Arabzadeh Sara Salamat Morteza Zihayat Ebrahim Bagheri 57 1 0 09 Feb 2025
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents Fanzeng Xia Hao Liu Yisong Yue Tongxin Li 51 1 0 03 Jan 2025
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices Anka Reuel Amelia F. Hardy Chandler Smith Max Lamparth Malcolm Hardy Mykel J. Kochenderfer ELM 62 16 0 20 Nov 2024
CoPrompter: User-Centric Evaluation of LLM Instruction Alignment for Improved Prompt Engineering Ishika Joshi Simra Shahid Shreeya Venneti Manushree Vasu Yantao Zheng Yunyao Li Balaji Krishnamurthy Gromit Yeuk-Yin Chan 22 0 0 09 Nov 2024
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting Mohamed Salim Aissi Clément Romac Thomas Carta Sylvain Lamprier Pierre-Yves Oudeyer Olivier Sigaud Laure Soulier Nicolas Thome 14 2 0 25 Oct 2024
MultiTalk: Introspective and Extrospective Dialogue for Human-Environment-LLM Alignment Venkata Naren Devarakonda Ali Umut Kaypak Shuaihang Yuan P. Krishnamurthy Yi Fang Farshad Khorrami LLMAG 32 0 0 24 Sep 2024
Irrelevant Alternatives Bias Large Language Model Hiring Decisions Kremena Valkanova Pencho Yordanov 18 0 0 04 Sep 2024
From Text to Emotion: Unveiling the Emotion Annotation Capabilities of LLMs Minxue Niu Mimansa Jaiswal Emily Mower Provost 23 5 0 30 Aug 2024
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks Yang You Jiaqi Han Yinan Yu Christian Berger 16 2 0 18 Jul 2024
The Better Angels of Machine Personality: How Personality Relates to LLM Safety Jie M. Zhang Dongrui Liu Chao Qian Ziyue Gan Yong-jin Liu Yu Qiao Jing Shao LLMAG PILM 32 12 0 17 Jul 2024
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback Wen Lai Mohsen Mesgar Alexander M. Fraser LRM ALM 33 18 0 03 Jun 2024
The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation Maja Pavlovic Massimo Poesio 14 17 0 02 May 2024
PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency Qixiang Fang Daniel L. Oberski Dong Nguyen 17 3 0 02 Apr 2024
Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support Igor Svoboda D. Lande 6 4 0 12 Feb 2024
GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model Xuanchang Zhang Zhuosheng Zhang Hai Zhao LRM ALM 11 2 0 04 Feb 2024
Do large language models resemble humans in language use? Zhenguang G. Cai Xufeng Duan David A. Haslett Shuqi Wang M. Pickering ALM 67 37 0 10 Mar 2023
Using cognitive psychology to understand GPT-3 Marcel Binz Eric Schulz ELM LLMAG 228 435 0 21 Jun 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models Xuezhi Wang Jason W. Wei Dale Schuurmans Quoc Le Ed H. Chi Sharan Narang Aakanksha Chowdhery Denny Zhou ReLM BDL LRM AI4CE 297 3,163 0 21 Mar 2022