ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.15056
  4. Cited By
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
v1v2 (latest)

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
27 March 2023
Fabrizio Gilardi
Meysam Alizadeh
M. Kubli
    AI4MH
ArXiv (abs)PDFHTML

Papers citing "ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"

50 / 301 papers shown
Improving Alignment Between Human and Machine Codes: An Empirical Assessment of Prompt Engineering for Construct Identification in Psychology
Improving Alignment Between Human and Machine Codes: An Empirical Assessment of Prompt Engineering for Construct Identification in Psychology
Kylie L. Anglin
Stephanie Milan
Brittney Hernandez
Claudia Ventura
LLMAG
150
0
0
03 Dec 2025
A Comparison of Human and ChatGPT Classification Performance on Complex Social Media Data
A Comparison of Human and ChatGPT Classification Performance on Complex Social Media Data
Breanna E. Green
Ashley L. Shea
Pengfei Zhao
Drew Margolin
AI4MH
178
0
0
29 Nov 2025
MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation
MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation
Mahdi Rahmani
AmirHossein Saffari
Reyhane Rahmani
125
0
0
28 Nov 2025
Constructing and Benchmarking: a Labeled Email Dataset for Text-Based Phishing and Spam Detection Framework
Constructing and Benchmarking: a Labeled Email Dataset for Text-Based Phishing and Spam Detection Framework
Rebeka Tóth
Tamás Bisztray
Richard A. Dubniczky
180
0
0
26 Nov 2025
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
Peiran Xu
Sudong Wang
Yao Zhu
Jianing Li
Yunjian Zhang
LRM
352
2
0
26 Nov 2025
Generative AI in Sociological Research: State of the Discipline
Generative AI in Sociological Research: State of the DisciplineSociological Science (Sociol Sci), 2025
AJ Alvero
Dustin S. Stoltz
Oscar Stuhler
Marshall A. Taylor
171
1
0
21 Nov 2025
Applying Large Language Models to Characterize Public Narratives
Applying Large Language Models to Characterize Public Narratives
Elinor Poole-Dayan
Daniel Kessler
Hannah Chiou
Margaret Hughes
Emily S Lin
Marshall Ganz
Deb Roy
145
0
0
17 Nov 2025
Increasing AI Explainability by LLM Driven Standard Processes
Increasing AI Explainability by LLM Driven Standard Processes
Marc Jansen
Marcel Pehlke
175
0
0
10 Nov 2025
Can LLM Annotations Replace User Clicks for Learning to Rank?
Can LLM Annotations Replace User Clicks for Learning to Rank?
Lulu Yu
Keping Bi
Jiafeng Guo
Shihao Liu
Shuaiqiang Wang
D. Yin
Xueqi Cheng
177
0
0
10 Nov 2025
Who Is the Story About? Protagonist Entity Recognition in News
Who Is the Story About? Protagonist Entity Recognition in News
Jorge Gabín
M. E. Ares
Javier Parapar
271
0
0
10 Nov 2025
Computational Turing Test Reveals Systematic Differences Between Human and AI Language
Computational Turing Test Reveals Systematic Differences Between Human and AI Language
Nicolò Pagan
Petter Törnberg
Christopher Bail
Anikó Hannák
Christopher Barrie
192
0
0
06 Nov 2025
Black Box Absorption: LLMs Undermining Innovative Ideas
Black Box Absorption: LLMs Undermining Innovative Ideas
Wenjun Cao
140
0
0
23 Oct 2025
Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
Ewelina Gajewska
Arda Derbent
Jaroslaw A Chudziak
K. Budzynska
102
1
0
22 Oct 2025
Online In-Context Distillation for Low-Resource Vision Language Models
Online In-Context Distillation for Low-Resource Vision Language Models
Zhiqi Kang
Rahaf Aljundi
Vaggelis Dorovatas
Karteek Alahari
VLM
111
0
0
20 Oct 2025
SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
Y. Huang
Liang Shi
Yitian Zhang
Yi Tian Xu
Yun Fu
AAML
92
0
0
18 Oct 2025
Reliability of Large Language Model Generated Clinical Reasoning in Assisted Reproductive Technology: Blinded Comparative Evaluation Study
Reliability of Large Language Model Generated Clinical Reasoning in Assisted Reproductive Technology: Blinded Comparative Evaluation Study
Dou Liu
Ying Long
Sophia Zuoqiu
Di Liu
Kang Li
Yiting Lin
Hanyi Liu
Rong Yin
Tian Tang
ELM
196
1
0
17 Oct 2025
DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
Bingsheng Yao
Bo Sun
Yuanzhe Dong
Yuxuan Lu
Dakuo Wang
337
0
0
16 Oct 2025
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Junichiro Niimi
147
0
0
15 Oct 2025
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
Shengming Yuan
Xinyu Lyu
Shuailong Wang
Beitao Chen
Jingkuan Song
Lianli Gao
LRM
287
0
0
13 Oct 2025
Repurposing Annotation Guidelines to Instruct LLM Annotators: A Case Study
Repurposing Annotation Guidelines to Instruct LLM Annotators: A Case StudyInternational Conference on Applications of Natural Language to Data Bases (NLDB), 2025
Kon Woo Kim
Rezarta Islamaj
Jin-Dong Kim
Florian Boudin
Akiko Aizawa
126
0
0
13 Oct 2025
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question Decomposition
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question Decomposition
Y. Huang
Yizhou Wang
Yun Fu
VLM
103
0
0
09 Oct 2025
Populism Meets AI: Advancing Populism Research with LLMs
Populism Meets AI: Advancing Populism Research with LLMs
Eduardo Ryô Tamaki
Eduardo Ryô Tamaki
Julia Chatterley
Grant Mitchell
Semir Dzebo
Cristóbal Sandoval
Levente Littvay
Kirk Hawkins
220
0
0
08 Oct 2025
What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification
What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification
Andrew Halterman
Katherine A. Keith
143
0
0
03 Oct 2025
Unspoken Hints: Accuracy Without Acknowledgement in LLM Reasoning
Unspoken Hints: Accuracy Without Acknowledgement in LLM Reasoning
Arash Marioriyad
Shaygan Adim
Nima Alighardashi
Mahdieh Soleymani Banghshah
M. Rohban
LRM
95
1
0
30 Sep 2025
Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis
Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis
Leitian Tao
Xuefeng Du
Shouqing Yang
SyDa
215
1
0
30 Sep 2025
Building Benchmarks from the Ground Up: Community-Centered Evaluation of LLMs in Healthcare Chatbot Settings
Building Benchmarks from the Ground Up: Community-Centered Evaluation of LLMs in Healthcare Chatbot Settings
Hamna
G. Bhat
Sourabrata Mukherjee
Faisal Lalani
Evan Hadfield
Divya Siddarth
Kalika Bali
Sunayana Sitaram
LM&MAAI4MH
186
1
0
29 Sep 2025
Building Data-Driven Occupation Taxonomies: A Bottom-Up Multi-Stage Approach via Semantic Clustering and Multi-Agent Collaboration
Building Data-Driven Occupation Taxonomies: A Bottom-Up Multi-Stage Approach via Semantic Clustering and Multi-Agent Collaboration
Nan Li
Bo Kang
T. D. Bie
106
0
0
19 Sep 2025
We Argue to Agree: Towards Personality-Driven Argumentation-Based Negotiation Dialogue Systems for Tourism
We Argue to Agree: Towards Personality-Driven Argumentation-Based Negotiation Dialogue Systems for Tourism
Priyanshu Priya
Saurav Dudhate
Desai Vishesh Yasheshbhai
Asif Ekbal
155
1
0
14 Sep 2025
Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
Bastián González-Bustamante
Nando Verelst
Carla Cisternas
SyDa
134
0
0
11 Sep 2025
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
Cheng Chen
Haiyan Yin
Ivor Tsang
158
1
0
10 Sep 2025
Timing the Message: Language-Based Notifications for Time-Critical Assistive Settings
Timing the Message: Language-Based Notifications for Time-Critical Assistive Settings
Ya-Chuan Hsu
Jonathan A. DeCastro
Andrew Silva
Guy Rosman
120
1
0
09 Sep 2025
PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
Yixuan Tang
Yi Yang
Ahmed Abbasi
212
1
0
09 Sep 2025
CURE: Controlled Unlearning for Robust Embeddings - Mitigating Conceptual Shortcuts in Pre-Trained Language Models
CURE: Controlled Unlearning for Robust Embeddings - Mitigating Conceptual Shortcuts in Pre-Trained Language Models
Aysenur Kocak
Shuo Yang
Bardh Prenkaj
Gjergji Kasneci
108
0
0
05 Sep 2025
Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain
Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain
Shakiba Amirshahi
Amin Bigdeli
Charles L. A. Clarke
Amira Ghenai
AAML
145
2
0
04 Sep 2025
Leveraging Media Frames to Improve Normative Diversity in News Recommendations
Leveraging Media Frames to Improve Normative Diversity in News Recommendations
Sourabh Dattawad
Agnese Daffara
Tanise Ceron
86
1
0
02 Sep 2025
PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance
PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance
Mengxiao Wang
Yuxuan Zhang
Guofei Gu
AAMLSILM
160
0
0
28 Aug 2025
Neither Valid nor Reliable? Investigating the Use of LLMs as Judges
Neither Valid nor Reliable? Investigating the Use of LLMs as Judges
Khaoula Chehbouni
Mohammed Haddou
Jackie CK Cheung
G. Farnadi
LLMAG
340
8
0
25 Aug 2025
The Impact of Annotator Personas on LLM Behavior Across the Perspectivism Spectrum
The Impact of Annotator Personas on LLM Behavior Across the Perspectivism Spectrum
O. O. Sarumi
Charles F Welch
Daniel Braun
Jorg Schlotterer
104
0
0
23 Aug 2025
The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities
The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities
Xiancheng Li
Georgios D. Karampatakis
Helen E. Wood
Chris J. Griffiths
Borislava Mihaylova
Neil S. Coulson
Alessio Pasinato
P. Panzarasa
Marco Viviani
Anna De Simoni
AI4MHLM&MA
131
0
0
19 Aug 2025
"Not in My Backyard": LLMs Uncover Online and Offline Social Biases Against Homelessness
"Not in My Backyard": LLMs Uncover Online and Offline Social Biases Against Homelessness
Jonathan A. Karr Jr.
Benjamin F. Herbst
Ting Hua
Matthew Hauenstein
Georgina Curto
Nitesh Chawla
Georgina Curto
Nitesh V. Chawla
93
0
0
14 Aug 2025
SYNAPSE-G: Bridging Large Language Models and Graph Learning for Rare Event Classification
SYNAPSE-G: Bridging Large Language Models and Graph Learning for Rare Event Classification
S. Tavakkol
Lin Chen
Max Springer
Abigail Schantz
Blaž Bratanič
Vincent Cohen-Addad
M. Bateni
157
0
0
13 Aug 2025
Evaluating Large Language Models as Expert Annotators
Evaluating Large Language Models as Expert Annotators
Yu-Min Tseng
Wei-Lin Chen
Chung-Chi Chen
Hsin-Hsi Chen
156
2
0
11 Aug 2025
A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
Zhilong Zhao
Yindi Liu
151
0
0
04 Aug 2025
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
Ping Yu
Jack Lanchantin
Tianlu Wang
Weizhe Yuan
O. Yu. Golovneva
I. Kulikov
Sainbayar Sukhbaatar
Jason Weston
Jing Xu
SyDaReLMLRM
294
12
0
31 Jul 2025
Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
Shuo Yang
Zheyu Zhang
Bardh Prenkaj
Gjergji Kasneci
201
4
0
25 Jul 2025
Hybrid Annotation for Propaganda Detection: Integrating LLM Pre-Annotations with Human Intelligence
Hybrid Annotation for Propaganda Detection: Integrating LLM Pre-Annotations with Human Intelligence
Ariana Sahitaj
Premtim Sahitaj
Veronika Solopova
Jiaao Li
Sebastian Möller
Vera Schmitt
99
1
0
24 Jul 2025
AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs
AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs
Xiaopeng Ke
Hexuan Deng
Xuebo Liu
Jun Rao
Zhenxi Song
Jun-chen Yu
Min Zhang
SyDa
234
1
0
24 Jul 2025
VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL
VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL
Shubham Mohole
Sainyam Galhotra
AAML
149
1
0
23 Jul 2025
Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries
Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries
Victor Hartman
Petter Törnberg
127
0
0
23 Jul 2025
Backtranslation and paraphrasing in the LLM era? Comparing data augmentation methods for emotion classification
Backtranslation and paraphrasing in the LLM era? Comparing data augmentation methods for emotion classificationInternational Conference on Conceptual Structures (ICCS), 2025
Łukasz Radliński
Mateusz Guściora
Jan Kocoñ
132
2
0
19 Jul 2025
1234567
Next
Page 1 of 7