ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.17476
  4. Cited By
Exploring the Sensitivity of LLMs' Decision-Making Capabilities:
  Insights from Prompt Variation and Hyperparameters

Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters

29 December 2023
Manikanta Loya
Divya Sinha
Richard Futrell
ArXivPDFHTML

Papers citing "Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters"

20 / 20 papers shown
Title
An overview of model uncertainty and variability in LLM-based sentiment analysis. Challenges, mitigation strategies and the role of explainability
An overview of model uncertainty and variability in LLM-based sentiment analysis. Challenges, mitigation strategies and the role of explainability
David Herrera-Poyatos
Carlos Peláez-González
Cristina Zuheros
Andrés Herrera-Poyatos
Virilo Tejedor
F. Herrera
Rosana Montes
23
1
0
06 Apr 2025
Generalization Bias in Large Language Model Summarization of Scientific Research
Generalization Bias in Large Language Model Summarization of Scientific Research
Uwe Peters
Benjamin Chin-Yee
ELM
29
0
0
28 Mar 2025
Benchmarking Prompt Sensitivity in Large Language Models
Benchmarking Prompt Sensitivity in Large Language Models
Amirhossein Razavi
Mina Soltangheis
Negar Arabzadeh
Sara Salamat
Morteza Zihayat
Ebrahim Bagheri
57
1
0
09 Feb 2025
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents
Fanzeng Xia
Hao Liu
Yisong Yue
Tongxin Li
51
1
0
03 Jan 2025
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and
  Establishing Best Practices
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Anka Reuel
Amelia F. Hardy
Chandler Smith
Max Lamparth
Malcolm Hardy
Mykel J. Kochenderfer
ELM
62
16
0
20 Nov 2024
CoPrompter: User-Centric Evaluation of LLM Instruction Alignment for
  Improved Prompt Engineering
CoPrompter: User-Centric Evaluation of LLM Instruction Alignment for Improved Prompt Engineering
Ishika Joshi
Simra Shahid
Shreeya Venneti
Manushree Vasu
Yantao Zheng
Yunyao Li
Balaji Krishnamurthy
Gromit Yeuk-Yin Chan
22
0
0
09 Nov 2024
Reinforcement Learning for Aligning Large Language Models Agents with
  Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Mohamed Salim Aissi
Clément Romac
Thomas Carta
Sylvain Lamprier
Pierre-Yves Oudeyer
Olivier Sigaud
Laure Soulier
Nicolas Thome
14
2
0
25 Oct 2024
MultiTalk: Introspective and Extrospective Dialogue for
  Human-Environment-LLM Alignment
MultiTalk: Introspective and Extrospective Dialogue for Human-Environment-LLM Alignment
Venkata Naren Devarakonda
Ali Umut Kaypak
Shuaihang Yuan
P. Krishnamurthy
Yi Fang
Farshad Khorrami
LLMAG
32
0
0
24 Sep 2024
Irrelevant Alternatives Bias Large Language Model Hiring Decisions
Irrelevant Alternatives Bias Large Language Model Hiring Decisions
Kremena Valkanova
Pencho Yordanov
18
0
0
04 Sep 2024
From Text to Emotion: Unveiling the Emotion Annotation Capabilities of
  LLMs
From Text to Emotion: Unveiling the Emotion Annotation Capabilities of LLMs
Minxue Niu
Mimansa Jaiswal
Emily Mower Provost
23
5
0
30 Aug 2024
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks
Yang You
Jiaqi Han
Yinan Yu
Christian Berger
16
2
0
18 Jul 2024
The Better Angels of Machine Personality: How Personality Relates to LLM
  Safety
The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Jie M. Zhang
Dongrui Liu
Chao Qian
Ziyue Gan
Yong-jin Liu
Yu Qiao
Jing Shao
LLMAG
PILM
32
12
0
17 Jul 2024
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with
  Cross-Lingual Feedback
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Wen Lai
Mohsen Mesgar
Alexander M. Fraser
LRM
ALM
33
18
0
03 Jun 2024
The Effectiveness of LLMs as Annotators: A Comparative Overview and
  Empirical Analysis of Direct Representation
The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
Maja Pavlovic
Massimo Poesio
14
17
0
02 May 2024
PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A
  Case Study of Mathematics Proficiency
PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency
Qixiang Fang
Daniel L. Oberski
Dong Nguyen
17
3
0
02 Apr 2024
Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic
  Hierarchy Process and GPT-4 for Automated Decision Support
Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support
Igor Svoboda
D. Lande
6
4
0
12 Feb 2024
GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large
  Language Model
GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model
Xuanchang Zhang
Zhuosheng Zhang
Hai Zhao
LRM
ALM
11
2
0
04 Feb 2024
Do large language models resemble humans in language use?
Do large language models resemble humans in language use?
Zhenguang G. Cai
Xufeng Duan
David A. Haslett
Shuqi Wang
M. Pickering
ALM
67
37
0
10 Mar 2023
Using cognitive psychology to understand GPT-3
Using cognitive psychology to understand GPT-3
Marcel Binz
Eric Schulz
ELM
LLMAG
228
435
0
21 Jun 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
1