v1v2 (latest)

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023

27 March 2023

Papers citing "ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"

50 / 297 papers shown

Title
Identify, Isolate, and Purge: Mitigating Hallucinations in LVLMs via Self-Evolving Distillation Wenhao Li Xiu Su Jingyi Wu Feng Yang Yang-Yang Liu Yi-Ling Chen Shan You Chang Xu VLM 207 0 0 07 Jul 2025
Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games David Guzman Piedrahita Yongjin Yang Mrinmaya Sachan Giorgia Ramponi Bernhard Schölkopf Zhijing Jin LLMAG LRM 180 4 0 29 Jun 2025
Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System Mustafa Akben Aaron Satko 174 0 0 19 Jun 2025
VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents Sam Yu-Te Lee Chenyang Ji Shicheng Wen Lifu Huang Dongyu Liu Kwan-Liu Ma 306 0 0 17 Jun 2025
Evaluating LLM-Contaminated Crowdsourcing Data Without Ground Truth Yichi Zhang Jinlong Pang Zhaowei Zhu Yang Liu 193 2 0 08 Jun 2025
PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems Yi Huang Wajih UI Hassan Yao Guo Xiangqun Chen Ding Li 259 0 0 06 Jun 2025
Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data AnnotationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Mingxuan Xia Haobo Wang Shouqing Yang Zewei Yu Yongfeng Zhang Junbo Zhao Runze Wu 342 1 0 04 Jun 2025
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification Shuzhou Yuan Ercong Nie Lukas Kouba Ashish Yashwanth Kangen Helmut Schmid Hinrich Schütze Michael Färber 242 2 0 02 Jun 2025
Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution Meysam Alizadeh Zeynab Samei Daria Stetsenko Fabrizio Gilardi SILM 304 7 0 01 Jun 2025
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge Xin Jing Jiadong Wang Iosif Tsangko Andreas Triantafyllopoulos Björn Schuller 164 0 0 30 May 2025
Redefining Research Crowdsourcing: Incorporating Human Feedback with LLM-Powered Digital Twins Amanda Chan Catherine Di Joseph Rupertus Gary Smith Varun Nagaraj Rao Manoel Horta Ribeiro Andrés Monroy-Hernández 161 2 0 29 May 2025
Be.FM: Open Foundation Models for Human Behavior Yutong Xie Zhuoheng Li Xiyuan Wang Yijun Pan Qijia Liu ... Xingjian Zhang Jin Huang Walter Yuan Matthew O Jackson Qiaozhu Mei AI4CE 142 2 0 29 May 2025
What Has Been Lost with Synthetic Evaluation? Alexander Gill Abhilasha Ravichander Ana Marasović ELM 324 0 0 28 May 2025
Practical estimation of the optimal classification error with soft labels and calibration Ryota Ushio Takashi Ishida Masashi Sugiyama 239 1 0 27 May 2025
Are Language Models Consequentialist or Deontological Moral Reasoners? Keenan Samway Max Kleiman-Weiner David Guzman Piedrahita Amélie Reymond Bernhard Schölkopf Zhijing Jin ELM LRM 180 3 0 27 May 2025
Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement Meghaj Tarte Agam Shah Chao Zhang Sudheer Chava 297 1 0 26 May 2025
Enhancing Visual Reliance in Text Generation: A Bayesian Perspective on Mitigating Hallucination in Large Vision-Language Models Nanxing Hu Xiaoyue Duan Jinchao Zhang Guoliang Kang MLLM 306 2 0 26 May 2025
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback Peiran Wang Ye Yu Kai Wei Haojing Luo Haohan Wang 238 1 0 26 May 2025
Recalibrating the Compass: Integrating Large Language Models into Classical Research Methods Tai-Quan Peng Xuzhen Yang 264 2 0 26 May 2025
Large Language Models in the Task of Automatic Validation of Text Classifier Predictions Aleksandr Tsymbalov Mikhail Khovrichev 265 0 0 24 May 2025
DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors Tazeek Bin Abdur Rakib Ambuj Mehrish Lay-Ki Soon Wern Han Lim Soujanya Poria OffRL 236 2 0 23 May 2025
Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural RoutingIEEE Journal on Selected Areas in Communications (JSAC), 2025 Andrei-Laurentiu Bornea Fadhel Ayed Antonio De Domenico Nicola Piovesan Tareq Si Salem Ali Maatouk 219 1 0 17 May 2025
Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPO Peter Chen Xiaopeng Li Zhiyu Li Xi Chen Tianyi Lin 503 0 0 16 May 2025
An AI-Powered Research Assistant in the Lab: A Practical Guide for Text Analysis Through Iterative Collaboration with LLMs Gino Carmona-Díaz William Jiménez-Leal María Alejandra Grisales Chandra Sripada Santiago Amaya Michael Inzlicht Juan Pablo Bermúdez 259 1 0 14 May 2025
QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation Mengze Hong Wailing Ng Chen Zhang Chen Zhang ELM 296 7 0 08 May 2025
A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text ClassificationInternational Conference on Applications of Natural Language to Data Bases (NLDB), 2025 Junichiro Niimi 281 5 0 26 Apr 2025
Can Third-parties Read Our Emotions?Annual Meeting of the Association for Computational Linguistics (ACL), 2025 Jiayi Li Yingfan Zhou Pranav Narayanan Venkit Halima Binte Islam Sneha Arya Shomir Wilson Sarah Rajtmajer 676 2 0 25 Apr 2025
Deep literature reviews: an application of fine-tuned language models to migration research Stefano M. Iacus Haodong Qi Jiyoung Han 249 1 0 17 Apr 2025
AI Safety Should Prioritize the Future of Work Sanchaita Hazra Bodhisattwa Prasad Majumder Tuhin Chakrabarty 297 3 0 16 Apr 2025
The Art of Audience Engagement: LLM-Based Thin-Slicing of Scientific TalksFrontiers in Communication (Front. Commun.), 2025 Ralf Schmälzle Sue Lim Yuetong Du Gary Bente 86 0 0 15 Apr 2025
A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future Jialun Zhong Wei Shen Yanzeng Li Songyang Gao Hua Lu Yicheng Chen Yang Zhang Wei Zhou Jinjie Gu Lei Zou LRM 336 27 0 12 Apr 2025
A Fully Automated Pipeline for Conversational Discourse Annotation: Tree Scheme Generation and Labeling with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Kseniia Petukhova Ekaterina Kochmar 301 2 0 11 Apr 2025
Utility-Focused LLM Annotation for Retrieval and Retrieval-Augmented Generation Hengran Zhang Minghao Tang Keping Bi Jiafeng Guo Shihao Liu Daiting Shi Dawei Yin Xueqi Cheng 512 1 0 07 Apr 2025
ArXivBench: When You Should Avoid Using ChatGPT for Academic Writing Ning Li Jingran Zhang Justin Cui 234 0 0 06 Apr 2025
AI-induced sexual harassment: Investigating Contextual Characteristics and User Reactions of Sexual Harassment by a Companion ChatbotProceedings of the ACM on Human-Computer Interaction (PACMHCI), 2025 Mohammad Namvarpour Harrison Pauwels Afsaneh Razi 250 6 0 05 Apr 2025
The Lyme Disease Controversy: An AI-Driven Discourse Analysis of a Quarter Century of Academic Debate and DividesmedRxiv (medRxiv), 2025 Teo Susnjak Cole Palffy Tatiana Zimina Nazgul Altynbekova Kunal Garg Leona Gilbert 812 0 0 04 Apr 2025
Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered AssistanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Bo Yuan Yulin Chen Yin Zhang Wei Jiang NoLa 392 21 0 03 Apr 2025
Evaluating how LLM annotations represent diverse views on contentious topics Megan A. Brown Shubham Atreja Libby Hemphill Patrick Y. Wu 928 3 0 29 Mar 2025
Elite Political Discourse has Become More Toxic in Western Countries Petter Törnberg Juliana Chueri 150 1 0 28 Mar 2025
Navigating the Risks of Using Large Language Models for Text Annotation in Social Science Research Hao Lin Yongjun Zhang 206 2 0 27 Mar 2025
Synthetic Data Augmentation for Cross-domain Implicit Discourse Relation Recognition Frances Yung Varsha Suresh Zaynab Reza Mansoor Ahmad Vera Demberg 347 1 0 26 Mar 2025
"Whose Side Are You On?" Estimating Ideology of Political and News Content Using Large Language Models and Few-shot Demonstration Selection Muhammad Haroon Magdalena Wojcieszak Anshuman Chhabra 412 0 0 23 Mar 2025
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning Sheng Wang Pengan Chen Jingqi Zhou Qintong Li Jingwei Dong Lei Li Boyang Xue Jiyue Jiang Dianbo Sui Chuan Wu SyDa 432 0 0 21 Mar 2025
Reassessing Active Learning Adoption in Contemporary NLP: A Community Survey Julia Romberg Christopher Schröder Julius Gonsior Katrin Tomanek Fredrik Olsson 384 2 0 12 Mar 2025
How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale Jeanette Falk Yiyi Chen Janet Rafner Mike Zhang Johannes Bjerva Alexander Nolte 274 2 0 06 Mar 2025
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering DatasetsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025 Preetam Prabhu Srikar Dammu Himanshu Naidu Chirag Shah 459 3 0 06 Mar 2025
Figurative Archive: an open dataset and web-based application for the study of metaphor Maddalena Bressler Veronica Mangiaterra Paolo Canal Federico Frau Fabrizio Luciani ... Chiara Battaglini Chiara Pompei Fortunata Romeo L. Bischetti V. Bambini 251 3 0 01 Mar 2025
Text-to-SQL Domain Adaptation via Human-LLM Collaborative Data AnnotationInternational Conference on Intelligent User Interfaces (IUI), 2025 Yuan Tian Daniel Lee Fei Wu Tung Mai Kun Qian Siddhartha Sahai Tianyi Zhang Yunyao Li SyDa 600 5 0 21 Feb 2025
SEFL: Enhancing Educational Assignment Feedback with LLM Agents Mike Zhang Amalie Pernille Dilling Léon Gondelman Niels Erik Ruan Lyngdorf Euan D Lindsay Johannes Bjerva AI4Ed SyDa 294 1 0 18 Feb 2025
Reasoning on a Spectrum: Aligning LLMs to System 1 and System 2 Thinking Alireza S. Ziabari Nona Ghazizadeh Zhivar Sourati Farzan Karimi-Malekabadi Payam Piray Morteza Dehghani LRM 271 13 0 18 Feb 2025

All Papers

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Papers citing "ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"