ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.10962
  4. Cited By
Measuring and Controlling Instruction (In)Stability in Language Model
  Dialogs
v1v2v3 (latest)

Measuring and Controlling Instruction (In)Stability in Language Model Dialogs

13 February 2024
Kenneth Li
Tianle Liu
Naomi Bashkansky
David Bau
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
ArXiv (abs)PDFHTML

Papers citing "Measuring and Controlling Instruction (In)Stability in Language Model Dialogs"

15 / 15 papers shown
Persistent Instability in LLM's Personality Measurements: Effects of Scale, Reasoning, and Conversation History
Persistent Instability in LLM's Personality Measurements: Effects of Scale, Reasoning, and Conversation History
Tommaso Tosato
Saskia Helbling
Yorguin-Jose Mantilla-Ramos
Mahmood Hegazy
Alberto Tosato
David John Lemay
Irina Rish
G. Dumas
118
6
0
24 Dec 2025
Drift No More? Context Equilibria in Multi-Turn LLM Interactions
Drift No More? Context Equilibria in Multi-Turn LLM Interactions
Vardhan Dongre
Ryan Rossi
Viet Dac Lai
David Seunghyun Yoon
Dilek Hakkani-Tur
Trung H. Bui
121
2
0
09 Oct 2025
When Instructions Multiply: Measuring and Estimating LLM Capabilities of Multiple Instructions Following
When Instructions Multiply: Measuring and Estimating LLM Capabilities of Multiple Instructions Following
Keno Harada
Yudai Yamazaki
Masachika Taniguchi
Edison Marrese-Taylor
Takeshi Kojima
Yusuke Iwasawa
Yutaka Matsuo
ALM
147
0
0
25 Sep 2025
Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models
Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models
Stephen Fitz
P. Romero
Steven Basart
Sipeng Chen
Jose Hernandez-Orallo
136
1
0
19 Sep 2025
IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
Yuzhuo Bai
Shitong Duan
Muhua Huang
Jing Yao
Zhenghao Liu
Peng Zhang
Tun Lu
Xiaoyuan Yi
Maosong Sun
Xing Xie
173
1
0
12 Aug 2025
Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations
Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations
Peng Lai
Jianjie Zheng
Sijie Cheng
Yun-Nung Chen
Peng Li
Yang Liu
Guanhua Chen
235
2
0
05 Aug 2025
Goal Alignment in LLM-Based User Simulators for Conversational AI
Goal Alignment in LLM-Based User Simulators for Conversational AI
Shuhaib Mehri
Xiaocheng Yang
Takyoung Kim
Gokhan Tur
Shikib Mehri
Dilek Hakkani-Tur
LLMAG
148
4
0
27 Jul 2025
When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation
When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Daniela Occhipinti
Marco Guerini
Malvina Nissim
257
0
0
30 May 2025
Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs)
Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs)Conference on Fairness, Accountability and Transparency (FAccT), 2025
Anna Neumann
Elisabeth Kirsten
Muhammad Bilal Zafar
Jatinder Singh
346
8
0
27 May 2025
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target AtomsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mengru Wang
Ziwen Xu
Shengyu Mao
Shumin Deng
Zhaopeng Tu
Ningyu Zhang
Ningyu Zhang
LLMSV
445
10
0
23 May 2025
Exploiting Fine-Grained Skip Behaviors for Micro-Video Recommendation
Exploiting Fine-Grained Skip Behaviors for Micro-Video RecommendationAAAI Conference on Artificial Intelligence (AAAI), 2025
Sanghyuck Lee
Sangkeun Park
Jaesung Lee
258
1
0
04 Apr 2025
Focus Directions Make Your Language Models Pay More Attention to Relevant Contexts
Focus Directions Make Your Language Models Pay More Attention to Relevant Contexts
Youxiang Zhu
Ruochen Li
Danqing Wang
Daniel Haehn
Xiaohui Liang
LRM
326
8
0
30 Mar 2025
Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents
Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents
Haochen Sun
Shuwen Zhang
Lujie Niu
Lei Ren
Hao Xu
Hao Fu
Fangkun Zhao
Caixia Yuan
Caixia Yuan
LLMAGELM
496
9
0
27 Feb 2025
An Auditing Test To Detect Behavioral Shift in Language Models
An Auditing Test To Detect Behavioral Shift in Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Leo Richter
Xuanli He
Pasquale Minervini
Matt J. Kusner
443
0
0
25 Oct 2024
CopyBench: Measuring Literal and Non-Literal Reproduction of
  Copyright-Protected Text in Language Model Generation
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation
Tong Chen
Akari Asai
Niloofar Mireshghallah
Sewon Min
James Grimmelmann
Yejin Choi
Hannaneh Hajishirzi
Luke Zettlemoyer
Pang Wei Koh
285
39
0
09 Jul 2024
1