ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.15056
  4. Cited By
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
v1v2 (latest)

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
27 March 2023
Fabrizio Gilardi
Meysam Alizadeh
M. Kubli
    AI4MH
ArXiv (abs)PDFHTML

Papers citing "ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"

50 / 301 papers shown
SEFL: Enhancing Educational Assignment Feedback with LLM Agents
SEFL: Enhancing Educational Assignment Feedback with LLM Agents
Mike Zhang
Amalie Pernille Dilling
Léon Gondelman
Niels Erik Ruan Lyngdorf
Euan D Lindsay
Johannes Bjerva
AI4EdSyDa
330
1
0
18 Feb 2025
Escaping Collapse: The Strength of Weak Data for Large Language Model Training
Escaping Collapse: The Strength of Weak Data for Large Language Model Training
Kareem Amin
Sara Babakniya
Alex Bie
Weiwei Kong
Umar Syed
Sergei Vassilvitskii
418
5
0
13 Feb 2025
Measuring Diversity in Synthetic Datasets
Measuring Diversity in Synthetic Datasets
Yuchang Zhu
Huizhe Zhang
Bingzhe Wu
Jintang Li
Zibin Zheng
Peilin Zhao
Liang Chen
Yatao Bian
460
0
0
12 Feb 2025
AI Alignment at Your Discretion
AI Alignment at Your DiscretionConference on Fairness, Accountability and Transparency (FAccT), 2025
Maarten Buyl
Hadi Khalaf
C. M. Verdun
Lucas Monteiro Paes
Caio Vieira Machado
Flavio du Pin Calmon
337
10
0
10 Feb 2025
Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy
Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy
Kamyar Kazari
Yong Chen
Zahra Shakeri
AI4MH
298
3
0
10 Feb 2025
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Thomas Zeng
Shuibai Zhang
Shutong Wu
Christian Classen
Daewon Chae
...
Jungtaek Kim
H. Koo
Kannan Ramchandran
Dimitris Papailiopoulos
Kangwook Lee
LRM
336
18
0
10 Feb 2025
Few-shot LLM Synthetic Data with Distribution Matching
Few-shot LLM Synthetic Data with Distribution MatchingThe Web Conference (WWW), 2025
Jiyuan Ren
Zhaocheng Du
Zhihao Wen
Qinglin Jia
Sunhao Dai
Chuhan Wu
Zhenhua Dong
SyDa
445
5
0
09 Feb 2025
Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion GenerationInternational Journal of Computer Vision (IJCV), 2025
Yin Wang
Mu Li
Jiapeng Liu
Zhiying Leng
Frederick W. B. Li
Ziyao Zhang
Xiaohui Liang
258
35
0
08 Feb 2025
Aligning Black-box Language Models with Human Judgments
Aligning Black-box Language Models with Human JudgmentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Gerrit J. J. van den Burg
Gen Suzuki
Wei Liu
Murat Sensoy
ALM
313
2
0
07 Feb 2025
Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Berk Atil
Vipul Gupta
Sarkar Snigdha Sarathi Das
R. Passonneau
850
0
0
07 Feb 2025
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop StrategyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tunazzina Islam
Dan Goldwasser
438
9
0
28 Jan 2025
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Thomas Walshe
S. Moon
Chunyang Xiao
Yawwani Gunawardana
Fran Silavong
266
5
0
21 Jan 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
566
45
0
10 Jan 2025
Predictable Artificial Intelligence
Predictable Artificial Intelligence
Lexin Zhou
Pablo Antonio Moreno Casares
Fernando Martínez-Plumed
John Burden
Ryan Burnell
...
Seán Ó hÉigeartaigh
Danaja Rutar
Wout Schellaert
Konstantinos Voudouris
José Hernández-Orallo
513
8
0
08 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
481
44
0
03 Jan 2025
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking AgentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Sudipta Singha Roy
Zhumin Chen
D. Yin
Zhaochun Ren
RALMALMELMLRMLM&MA
688
437
0
31 Dec 2024
STAYKATE: Hybrid In-Context Example Selection Combining Representativeness Sampling and Retrieval-based Approach -- A Case Study on Science Domains
STAYKATE: Hybrid In-Context Example Selection Combining Representativeness Sampling and Retrieval-based Approach -- A Case Study on Science Domains
Chencheng Zhu
Kazutaka Shimada
Tomoki Taniguchi
Tomoko Ohkuma
214
1
0
31 Dec 2024
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
Lynn Greschner
Roman Klinger
390
10
0
20 Dec 2024
Empowering LLMs to Understand and Generate Complex Vector Graphics
Empowering LLMs to Understand and Generate Complex Vector GraphicsComputer Vision and Pattern Recognition (CVPR), 2024
Ximing Xing
Juncheng Hu
Guotao Liang
Jing Zhang
Dong Xu
Qian Yu
556
30
0
15 Dec 2024
A Scoping Review of ChatGPT Research in Accounting and Finance
A Scoping Review of ChatGPT Research in Accounting and FinanceInternational Journal of Accounting Information Systems (IJAIS), 2024
Mengming Michael Dong
Theophanis C. Stratopoulos
Victor Xiaoqi Wang
306
57
0
07 Dec 2024
Large corpora and large language models: a replicable method for automating grammatical annotationLinguistics Vanguard (LV), 2024
Cameron Morin
Matti Marttinen Larsson
349
4
0
18 Nov 2024
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tomas Horych
Christoph Mandl
Terry Ruas
André Greiner-Petter
Bela Gipp
Akiko Aizawa
Timo Spinde
552
15
0
17 Nov 2024
Performance-Guided LLM Knowledge Distillation for Efficient Text
  Classification at Scale
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at ScaleConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Flavio Di Palo
Prateek Singhi
Bilal Fadlallah
135
17
0
07 Nov 2024
Evaluating Creative Short Story Generation in Humans and Large Language Models
Evaluating Creative Short Story Generation in Humans and Large Language Models
Mete Ismayilzada
Claire Stevenson
Lonneke van der Plas
LM&MALRM
552
13
0
04 Nov 2024
A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?
A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?
QiHong Chen
Jiawei Li
Jiecheng Deng
Jiachen Yu
Justin Tian Jin Chen
Iftekhar Ahmed
350
6
0
03 Nov 2024
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
John Wu
David Wu
Jimeng Sun
459
1
0
31 Oct 2024
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with
  Annual Updates
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual UpdatesNeural Information Processing Systems (NeurIPS), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
258
7
0
28 Oct 2024
PRISM: A Methodology for Auditing Biases in Large Language Models
PRISM: A Methodology for Auditing Biases in Large Language Models
Leif Azzopardi
Yashar Moshfeghi
222
3
0
24 Oct 2024
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
Omer Nahum
Nitay Calderon
Orgad Keller
Idan Szpektor
Roi Reichart
280
11
0
24 Oct 2024
PAPILLON: Privacy Preservation from Internet-based and Local Language Model Ensembles
PAPILLON: Privacy Preservation from Internet-based and Local Language Model EnsemblesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Li Siyan
Vethavikashini Chithrra Raghuram
Omar Khattab
Julia Hirschberg
Zhou Yu
450
30
0
22 Oct 2024
De-mark: Watermark Removal in Large Language Models
De-mark: Watermark Removal in Large Language Models
Ruibo Chen
Yihan Wu
Junfeng Guo
Heng Huang
WaLMVLM
325
8
0
17 Oct 2024
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zonghai Yao
Aditya Parashar
Huixue Zhou
Won Seok Jang
Feiyun Ouyang
Zhichao Yang
Hong-ye Yu
ELM
428
16
0
17 Oct 2024
Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the dataInternational Conference on Learning Representations (ICLR), 2024
Florian E. Dorner
Vivian Y. Nastl
Moritz Hardt
ELMALM
417
23
0
17 Oct 2024
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of
  LLMs
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMsInternational Conference on Computational Linguistics (COLING), 2024
Yijie Li
Yuan Sun
ELM
156
1
0
13 Oct 2024
JurEE not Judges: safeguarding llm interactions with small, specialised
  Encoder Ensembles
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles
Dom Nasrabadi
346
2
0
11 Oct 2024
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
RevisEval: Improving LLM-as-a-Judge via Response-Adapted ReferencesInternational Conference on Learning Representations (ICLR), 2024
Qiyuan Zhang
Yufei Wang
Tiezheng YU
Yuxin Jiang
Chuhan Wu
...
Xin Jiang
Lifeng Shang
Ruiming Tang
Fuyuan Lyu
Chen Ma
395
14
0
07 Oct 2024
Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation
Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation
Tunazzina Islam
Dan Goldwasser
597
8
0
07 Oct 2024
Hate Personified: Investigating the role of LLMs in content moderation
Hate Personified: Investigating the role of LLMs in content moderationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sarah Masud
Sahajpreet Singh
Viktor Hangya
Kangyang Luo
Tanmoy Chakraborty
213
20
0
03 Oct 2024
'Simulacrum of Stories': Examining Large Language Models as Qualitative
  Research Participants
'Simulacrum of Stories': Examining Large Language Models as Qualitative Research ParticipantsInternational Conference on Human Factors in Computing Systems (CHI), 2024
Shivani Kapania
William Agnew
Motahhare Eslami
Hoda Heidari
Sarah E Fox
226
19
0
28 Sep 2024
Learning to Localize Actions in Instructional Videos with LLM-Based
  Multi-Pathway Text-Video Alignment
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video AlignmentEuropean Conference on Computer Vision (ECCV), 2024
Yuxiao Chen
Keqin Li
Wentao Bao
Deep Patel
Yu Kong
Martin Renqiang Min
Dimitris N. Metaxas
DiffM
297
5
0
22 Sep 2024
Human Interest or Conflict? Leveraging LLMs for Automated Framing
  Analysis in TV Shows
Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows
David Alonso del Barrio
Max Tiel
D. Gática-Pérez
269
6
0
19 Sep 2024
What Would You Ask When You First Saw $a^2+b^2=c^2$? Evaluating LLM on Curiosity-Driven Questioning
What Would You Ask When You First Saw a2+b2=c2a^2+b^2=c^2a2+b2=c2? Evaluating LLM on Curiosity-Driven Questioning
Shashidhar Reddy Javaji
Zining Zhu
ELMALM
350
1
0
19 Sep 2024
Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation
  with LLMs
Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Yifan Wang
David Stevens
Pranay Shah
Wenwen Jiang
Miao Liu
...
Boying Gong
Daniel Lee
Jiabo Hu
Ning Zhang
Bob Kamma
247
5
0
16 Sep 2024
Keeping Humans in the Loop: Human-Centered Automated Annotation with
  Generative AI
Keeping Humans in the Loop: Human-Centered Automated Annotation with Generative AIInternational Conference on Web and Social Media (ICWSM), 2024
Nicholas Pangakis
Samuel Wolken
308
13
0
14 Sep 2024
Safeguarding Decentralized Social Media: LLM Agents for Automating
  Community Rule Compliance
Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance
Lucio La Cava
Andrea Tagarelli
LLMAG
169
2
0
13 Sep 2024
Your Weak LLM is Secretly a Strong Teacher for Alignment
Your Weak LLM is Secretly a Strong Teacher for AlignmentInternational Conference on Learning Representations (ICLR), 2024
Leitian Tao
Yixuan Li
583
15
0
13 Sep 2024
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
A. Lupidi
Carlos Gemmell
Nicola Cancedda
Jane Dwivedi-Yu
Jason Weston
Jakob Foerster
Roberta Raileanu
Maria Lomeli
SyDa
443
22
0
12 Sep 2024
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training
  Data
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data
Hossein Hajipour
Lea Schönherr
Thorsten Holz
Mario Fritz
AAMLSyDa
157
8
0
10 Sep 2024
Content Moderation by LLM: From Accuracy to Legitimacy
Content Moderation by LLM: From Accuracy to LegitimacyArtificial Intelligence Review (Artif Intell Rev), 2024
Tao Huang
AILaw
375
23
0
05 Sep 2024
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
Can Unconfident LLM Annotations Be Used for Confident Conclusions?North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kristina Gligorić
Tijana Zrnic
Cinoo Lee
Emmanuel J. Candès
Dan Jurafsky
400
33
0
27 Aug 2024
Previous
1234567
Next
Page 3 of 7