v1v2 (latest)

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023

27 March 2023

Papers citing "ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"

50 / 311 papers shown

SEFL: A Framework for Generating Synthetic Educational Assignment Feedback with LLM Agents

Mike Zhang

Amalie Pernille Dilling

Léon Gondelman

Niels Erik Ruan Lyngdorf

Euan D Lindsay

Johannes Bjerva

AI4Ed SyDa

355

18 Feb 2025

Reasoning on a Spectrum: Aligning LLMs to System 1 and System 2 Thinking

Alireza S. Ziabari

Nona Ghazizadeh

Zhivar Sourati

Farzan Karimi-Malekabadi

Payam Piray

Morteza Dehghani

LRM

330

18 Feb 2025

Escaping Collapse: The Strength of Weak Data for Large Language Model Training

461

13 Feb 2025

Measuring Diversity in Synthetic Datasets

482

12 Feb 2025

AI Alignment at Your DiscretionConference on Fairness, Accountability and Transparency (FAccT), 2025

351

10 Feb 2025

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

...

Dimitris Papailiopoulos

Kangwook Lee

LRM

395

10 Feb 2025

Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy

351

10 Feb 2025

Few-shot LLM Synthetic Data with Distribution MatchingThe Web Conference (WWW), 2025

502

09 Feb 2025

Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion GenerationInternational Journal of Computer Vision (IJCV), 2025

314

08 Feb 2025

Aligning Black-box Language Models with Human JudgmentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Gerrit J. J. van den Burg

332

07 Feb 2025

Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet

Berk Atil

Vipul Gupta

Sarkar Snigdha Sarathi Das

R. Passonneau

859

07 Feb 2025

Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop StrategyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Tunazzina Islam

Dan Goldwasser

450

28 Jan 2025

Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration

269

21 Jan 2025

LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs

...

605

10 Jan 2025

Predictable Artificial Intelligence

Lexin Zhou

Pablo Antonio Moreno Casares

Fernando Martínez-Plumed

...

Konstantinos Voudouris

José Hernández-Orallo

638

08 Jan 2025

LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

556

03 Jan 2025

Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking AgentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

RALM ALM ELM LRM LM&MA

799

459

31 Dec 2024

STAYKATE: Hybrid In-Context Example Selection Combining Representativeness Sampling and Retrieval-based Approach -- A Case Study on Science Domains

228

31 Dec 2024

Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs

Lynn Greschner

Roman Klinger

403

20 Dec 2024

Empowering LLMs to Understand and Generate Complex Vector GraphicsComputer Vision and Pattern Recognition (CVPR), 2024

619

15 Dec 2024

A Scoping Review of ChatGPT Research in Accounting and FinanceInternational Journal of Accounting Information Systems (IJAIS), 2024

Mengming Michael Dong

Theophanis C. Stratopoulos

Victor Xiaoqi Wang

360

07 Dec 2024

Large corpora and large language models: a replicable method for automating grammatical annotationLinguistics Vanguard (LV), 2024

Cameron Morin

Matti Marttinen Larsson

370

18 Nov 2024

The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

618

17 Nov 2024

Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at ScaleConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Flavio Di Palo

Prateek Singhi

Bilal Fadlallah

154

07 Nov 2024

One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversityNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

490

07 Nov 2024

Evaluating Creative Short Story Generation in Humans and Large Language Models

596

04 Nov 2024

A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?

413

03 Nov 2024

Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

John Wu

David Wu

Jimeng Sun

515

31 Oct 2024

NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual UpdatesNeural Information Processing Systems (NeurIPS), 2024

306

28 Oct 2024

PRISM: A Methodology for Auditing Biases in Large Language Models

Leif Azzopardi

Yashar Moshfeghi

248

24 Oct 2024

Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance

316

24 Oct 2024

PAPILLON: Privacy Preservation from Internet-based and Local Language Model EnsemblesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Li Siyan

Vethavikashini Chithrra Raghuram

Omar Khattab

Julia Hirschberg

Zhou Yu

463

22 Oct 2024

De-mark: Watermark Removal in Large Language Models

342

17 Oct 2024

MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

460

17 Oct 2024

Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the dataInternational Conference on Learning Representations (ICLR), 2024

455

17 Oct 2024

EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMsInternational Conference on Computational Linguistics (COLING), 2024

Yijie Li

Yuan Sun

ELM

165

13 Oct 2024

JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles

Dom Nasrabadi

381

11 Oct 2024

Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation

Tunazzina Islam

Dan Goldwasser

688

07 Oct 2024

RevisEval: Improving LLM-as-a-Judge via Response-Adapted ReferencesInternational Conference on Learning Representations (ICLR), 2024

Yufei Wang

...

Lifeng Shang

Chen Ma

462

07 Oct 2024

Hate Personified: Investigating the role of LLMs in content moderationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tanmoy Chakraborty

272

03 Oct 2024

'Simulacrum of Stories': Examining Large Language Models as Qualitative Research ParticipantsInternational Conference on Human Factors in Computing Systems (CHI), 2024

263

28 Sep 2024

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video AlignmentEuropean Conference on Computer Vision (ECCV), 2024

Yu Kong

Martin Renqiang Min

Dimitris N. Metaxas

DiffM

314

22 Sep 2024

Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows

David Alonso del Barrio

Max Tiel

D. Gática-Pérez

277

19 Sep 2024

What Would You Ask When You First Saw

a^2+b^2=c^2

? Evaluating LLM on Curiosity-Driven Questioning

Shashidhar Reddy Javaji

Zining Zhu

ELM ALM

426

19 Sep 2024

Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs

Yifan Wang

...

Jiabo Hu

Ning Zhang

Bob Kamma

262

16 Sep 2024

Keeping Humans in the Loop: Human-Centered Automated Annotation with Generative AIInternational Conference on Web and Social Media (ICWSM), 2024

Nicholas Pangakis

Samuel Wolken

320

14 Sep 2024

Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance

Lucio La Cava

Andrea Tagarelli

LLMAG

190

13 Sep 2024

Your Weak LLM is Secretly a Strong Teacher for AlignmentInternational Conference on Learning Representations (ICLR), 2024

Leitian Tao

Yixuan Li

595

13 Sep 2024

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

515

12 Sep 2024

HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data

Lea Schönherr

165

10 Sep 2024