
Title |
|---|
![]() Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical ApproachEthics: An International Journal of Social, Political, and Legal Philosophy (Ethics), 2025 |
![]() How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to
Challenge AI Safety by Humanizing LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
![]() AI model GPT-3 (dis)informs us better than humansScience Advances (Sci Adv), 2023 |
![]() Human heuristics for AI-generated language are flawedProceedings of the National Academy of Sciences of the United States of America (PNAS), 2022 |
![]() All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2021 |
![]() Effects of Persuasive Dialogues: Testing Bot Identities and Inquiry
StrategiesInternational Conference on Human Factors in Computing Systems (CHI), 2020 |