ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.15056
  4. Cited By
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
v1v2 (latest)

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
27 March 2023
Fabrizio Gilardi
Meysam Alizadeh
M. Kubli
    AI4MH
ArXiv (abs)PDFHTML

Papers citing "ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"

50 / 285 papers shown
Title
Constructing and Benchmarking: a Labeled Email Dataset for Text-Based Phishing and Spam Detection Framework
Constructing and Benchmarking: a Labeled Email Dataset for Text-Based Phishing and Spam Detection Framework
Rebeka Tóth
Tamás Bisztray
Richard A. Dubniczky
125
0
0
26 Nov 2025
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
Peiran Xu
Sudong Wang
Yao Zhu
Jianing Li
Yunjian Zhang
LRM
234
0
0
26 Nov 2025
Applying Large Language Models to Characterize Public Narratives
Applying Large Language Models to Characterize Public Narratives
Elinor Poole-Dayan
Daniel Kessler
Hannah Chiou
Margaret Hughes
Emily S Lin
Marshall Ganz
Deb Roy
77
0
0
17 Nov 2025
Increasing AI Explainability by LLM Driven Standard Processes
Increasing AI Explainability by LLM Driven Standard Processes
Marc Jansen
Marcel Pehlke
126
0
0
10 Nov 2025
Who Is the Story About? Protagonist Entity Recognition in News
Who Is the Story About? Protagonist Entity Recognition in News
Jorge Gabín
M. E. Ares
Javier Parapar
202
0
0
10 Nov 2025
Can LLM Annotations Replace User Clicks for Learning to Rank?
Can LLM Annotations Replace User Clicks for Learning to Rank?
Lulu Yu
Keping Bi
Jiafeng Guo
Shihao Liu
Shuaiqiang Wang
D. Yin
Xueqi Cheng
108
0
0
10 Nov 2025
Computational Turing Test Reveals Systematic Differences Between Human and AI Language
Computational Turing Test Reveals Systematic Differences Between Human and AI Language
Nicolò Pagan
Petter Törnberg
Christopher Bail
Anikó Hannák
Christopher Barrie
128
0
0
06 Nov 2025
Black Box Absorption: LLMs Undermining Innovative Ideas
Black Box Absorption: LLMs Undermining Innovative Ideas
Wenjun Cao
108
0
0
23 Oct 2025
Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
Ewelina Gajewska
Arda Derbent
Jaroslaw A Chudziak
K. Budzynska
56
0
0
22 Oct 2025
Online In-Context Distillation for Low-Resource Vision Language Models
Online In-Context Distillation for Low-Resource Vision Language Models
Zhiqi Kang
Rahaf Aljundi
Vaggelis Dorovatas
Karteek Alahari
VLM
68
0
0
20 Oct 2025
SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
Y. Huang
Liang Shi
Yitian Zhang
Yi Tian Xu
Yun Fu
AAML
76
0
0
18 Oct 2025
Reliability of Large Language Model Generated Clinical Reasoning in Assisted Reproductive Technology: Blinded Comparative Evaluation Study
Reliability of Large Language Model Generated Clinical Reasoning in Assisted Reproductive Technology: Blinded Comparative Evaluation Study
Dou Liu
Ying Long
Sophia Zuoqiu
Di Liu
Kang Li
Yiting Lin
Hanyi Liu
Rong Yin
Tian Tang
ELM
117
1
0
17 Oct 2025
DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
Bingsheng Yao
Bo Sun
Yuanzhe Dong
Yuxuan Lu
Dakuo Wang
271
0
0
16 Oct 2025
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Junichiro Niimi
100
0
0
15 Oct 2025
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
Shengming Yuan
Xinyu Lyu
Shuailong Wang
Beitao Chen
Jingkuan Song
Lianli Gao
LRM
146
0
0
13 Oct 2025
Repurposing Annotation Guidelines to Instruct LLM Annotators: A Case Study
Repurposing Annotation Guidelines to Instruct LLM Annotators: A Case StudyInternational Conference on Applications of Natural Language to Data Bases (NLDB), 2025
Kon Woo Kim
Rezarta Islamaj
Jin-Dong Kim
Florian Boudin
Akiko Aizawa
72
0
0
13 Oct 2025
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question Decomposition
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question Decomposition
Y. Huang
Yizhou Wang
Yun Fu
VLM
66
0
0
09 Oct 2025
Populism Meets AI: Advancing Populism Research with LLMs
Populism Meets AI: Advancing Populism Research with LLMs
Eduardo Ryô Tamaki
Eduardo Ryô Tamaki
Julia Chatterley
Grant Mitchell
Semir Dzebo
Cristóbal Sandoval
Levente Littvay
Kirk Hawkins
170
0
0
08 Oct 2025
What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification
What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification
Andrew Halterman
Katherine A. Keith
90
0
0
03 Oct 2025
Unspoken Hints: Accuracy Without Acknowledgement in LLM Reasoning
Unspoken Hints: Accuracy Without Acknowledgement in LLM Reasoning
Arash Marioriyad
Shaygan Adim
Nima Alighardashi
Mahdieh Soleymani Banghshah
M. Rohban
LRM
55
1
0
30 Sep 2025
Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis
Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis
Leitian Tao
Xuefeng Du
Shouqing Yang
SyDa
160
0
0
30 Sep 2025
Building Benchmarks from the Ground Up: Community-Centered Evaluation of LLMs in Healthcare Chatbot Settings
Building Benchmarks from the Ground Up: Community-Centered Evaluation of LLMs in Healthcare Chatbot Settings
Hamna
G. Bhat
Sourabrata Mukherjee
Faisal Lalani
Evan Hadfield
Divya Siddarth
Kalika Bali
Sunayana Sitaram
LM&MAAI4MH
124
0
0
29 Sep 2025
Building Data-Driven Occupation Taxonomies: A Bottom-Up Multi-Stage Approach via Semantic Clustering and Multi-Agent Collaboration
Building Data-Driven Occupation Taxonomies: A Bottom-Up Multi-Stage Approach via Semantic Clustering and Multi-Agent Collaboration
Nan Li
Bo Kang
T. D. Bie
72
0
0
19 Sep 2025
We Argue to Agree: Towards Personality-Driven Argumentation-Based Negotiation Dialogue Systems for Tourism
We Argue to Agree: Towards Personality-Driven Argumentation-Based Negotiation Dialogue Systems for Tourism
Priyanshu Priya
Saurav Dudhate
Desai Vishesh Yasheshbhai
Asif Ekbal
92
0
0
14 Sep 2025
Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
Bastián González-Bustamante
Nando Verelst
Carla Cisternas
SyDa
72
0
0
11 Sep 2025
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
Cheng Chen
Haiyan Yin
Ivor Tsang
108
1
0
10 Sep 2025
PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
Yixuan Tang
Yi Yang
Ahmed Abbasi
164
0
0
09 Sep 2025
Timing the Message: Language-Based Notifications for Time-Critical Assistive Settings
Timing the Message: Language-Based Notifications for Time-Critical Assistive Settings
Ya-Chuan Hsu
Jonathan A. DeCastro
Andrew Silva
Guy Rosman
52
1
0
09 Sep 2025
CURE: Controlled Unlearning for Robust Embeddings - Mitigating Conceptual Shortcuts in Pre-Trained Language Models
CURE: Controlled Unlearning for Robust Embeddings - Mitigating Conceptual Shortcuts in Pre-Trained Language Models
Aysenur Kocak
Shuo Yang
Bardh Prenkaj
Gjergji Kasneci
77
0
0
05 Sep 2025
Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain
Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain
Shakiba Amirshahi
Amin Bigdeli
Charles L. A. Clarke
Amira Ghenai
AAML
84
1
0
04 Sep 2025
Leveraging Media Frames to Improve Normative Diversity in News Recommendations
Leveraging Media Frames to Improve Normative Diversity in News Recommendations
Sourabh Dattawad
Agnese Daffara
Tanise Ceron
48
1
0
02 Sep 2025
PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance
PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance
Mengxiao Wang
Yuxuan Zhang
Guofei Gu
AAMLSILM
120
0
0
28 Aug 2025
Neither Valid nor Reliable? Investigating the Use of LLMs as Judges
Neither Valid nor Reliable? Investigating the Use of LLMs as Judges
Khaoula Chehbouni
Mohammed Haddou
Jackie CK Cheung
G. Farnadi
LLMAG
277
5
0
25 Aug 2025
The Impact of Annotator Personas on LLM Behavior Across the Perspectivism Spectrum
The Impact of Annotator Personas on LLM Behavior Across the Perspectivism Spectrum
O. O. Sarumi
Charles F Welch
Daniel Braun
Jorg Schlotterer
72
0
0
23 Aug 2025
The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities
The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities
Xiancheng Li
Georgios D. Karampatakis
Helen E. Wood
Chris J. Griffiths
Borislava Mihaylova
Neil S. Coulson
Alessio Pasinato
P. Panzarasa
Marco Viviani
Anna De Simoni
AI4MHLM&MA
91
0
0
19 Aug 2025
Combating Homelessness Stigma with LLMs: A New Multi-Modal Dataset for Bias Detection
Combating Homelessness Stigma with LLMs: A New Multi-Modal Dataset for Bias Detection
Jonathan A. Karr Jr.
Benjamin F. Herbst
Ting Hua
Matthew Hauenstein
Georgina Curto
Nitesh Chawla
60
0
0
14 Aug 2025
SYNAPSE-G: Bridging Large Language Models and Graph Learning for Rare Event Classification
SYNAPSE-G: Bridging Large Language Models and Graph Learning for Rare Event Classification
S. Tavakkol
Lin Chen
Max Springer
Abigail Schantz
Blaž Bratanič
Vincent Cohen-Addad
M. Bateni
120
0
0
13 Aug 2025
Evaluating Large Language Models as Expert Annotators
Evaluating Large Language Models as Expert Annotators
Yu-Min Tseng
Wei-Lin Chen
Chung-Chi Chen
Hsin-Hsi Chen
92
1
0
11 Aug 2025
A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
Zhilong Zhao
Yindi Liu
114
0
0
04 Aug 2025
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
Ping Yu
Jack Lanchantin
Tianlu Wang
Weizhe Yuan
O. Yu. Golovneva
I. Kulikov
Sainbayar Sukhbaatar
Jason Weston
Jing Xu
SyDaReLMLRM
223
10
0
31 Jul 2025
Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
Shuo Yang
Zheyu Zhang
Bardh Prenkaj
Gjergji Kasneci
145
4
0
25 Jul 2025
AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs
AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs
Xiaopeng Ke
Hexuan Deng
Xuebo Liu
Jun Rao
Zhenxi Song
Jun-chen Yu
Min Zhang
SyDa
178
1
0
24 Jul 2025
Hybrid Annotation for Propaganda Detection: Integrating LLM Pre-Annotations with Human Intelligence
Hybrid Annotation for Propaganda Detection: Integrating LLM Pre-Annotations with Human Intelligence
Ariana Sahitaj
Premtim Sahitaj
Veronika Solopova
Jiaao Li
Sebastian Möller
Vera Schmitt
53
1
0
24 Jul 2025
Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries
Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries
Victor Hartman
Petter Törnberg
66
0
0
23 Jul 2025
VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL
VeriMinder: Mitigating Analytical Vulnerabilities in NL2SQL
Shubham Mohole
Sainyam Galhotra
AAML
96
1
0
23 Jul 2025
Backtranslation and paraphrasing in the LLM era? Comparing data augmentation methods for emotion classification
Backtranslation and paraphrasing in the LLM era? Comparing data augmentation methods for emotion classificationInternational Conference on Conceptual Structures (ICCS), 2025
Łukasz Radliński
Mateusz Guściora
Jan Kocoñ
55
1
0
19 Jul 2025
Identify, Isolate, and Purge: Mitigating Hallucinations in LVLMs via Self-Evolving Distillation
Identify, Isolate, and Purge: Mitigating Hallucinations in LVLMs via Self-Evolving Distillation
Wenhao Li
Xiu Su
Jingyi Wu
Feng Yang
Yang-Yang Liu
Yi-Ling Chen
Shan You
Chang Xu
VLM
143
0
0
07 Jul 2025
Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games
Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games
David Guzman Piedrahita
Yongjin Yang
Mrinmaya Sachan
Giorgia Ramponi
Bernhard Schölkopf
Zhijing Jin
LLMAGLRM
156
4
0
29 Jun 2025
Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System
Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System
Mustafa Akben
Aaron Satko
142
0
0
19 Jun 2025
VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents
VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents
Sam Yu-Te Lee
Chenyang Ji
Shicheng Wen
Lifu Huang
Dongyu Liu
Kwan-Liu Ma
230
0
0
17 Jun 2025
123456
Next