Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.15056
Cited By
v1
v2 (latest)
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
27 March 2023
Fabrizio Gilardi
Meysam Alizadeh
M. Kubli
AI4MH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks"
50 / 311 papers shown
SEFL: A Framework for Generating Synthetic Educational Assignment Feedback with LLM Agents
Mike Zhang
Amalie Pernille Dilling
Léon Gondelman
Niels Erik Ruan Lyngdorf
Euan D Lindsay
Johannes Bjerva
AI4Ed
SyDa
355
1
0
18 Feb 2025
Reasoning on a Spectrum: Aligning LLMs to System 1 and System 2 Thinking
Alireza S. Ziabari
Nona Ghazizadeh
Zhivar Sourati
Farzan Karimi-Malekabadi
Payam Piray
Morteza Dehghani
LRM
330
16
0
18 Feb 2025
Escaping Collapse: The Strength of Weak Data for Large Language Model Training
Kareem Amin
Sara Babakniya
Alex Bie
Weiwei Kong
Umar Syed
Sergei Vassilvitskii
461
5
0
13 Feb 2025
Measuring Diversity in Synthetic Datasets
Yuchang Zhu
Huizhe Zhang
Bingzhe Wu
Jintang Li
Zibin Zheng
Peilin Zhao
Liang Chen
Yatao Bian
482
0
0
12 Feb 2025
AI Alignment at Your Discretion
Conference on Fairness, Accountability and Transparency (FAccT), 2025
Maarten Buyl
Hadi Khalaf
C. M. Verdun
Lucas Monteiro Paes
Caio Vieira Machado
Flavio du Pin Calmon
351
11
0
10 Feb 2025
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Thomas Zeng
Shuibai Zhang
Shutong Wu
Christian Classen
Daewon Chae
...
Jungtaek Kim
H. Koo
Kannan Ramchandran
Dimitris Papailiopoulos
Kangwook Lee
LRM
395
23
0
10 Feb 2025
Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy
Kamyar Kazari
Yong Chen
Zahra Shakeri
AI4MH
351
3
0
10 Feb 2025
Few-shot LLM Synthetic Data with Distribution Matching
The Web Conference (WWW), 2025
Jiyuan Ren
Zhaocheng Du
Zhihao Wen
Qinglin Jia
Sunhao Dai
Chuhan Wu
Zhenhua Dong
SyDa
502
6
0
09 Feb 2025
Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation
International Journal of Computer Vision (IJCV), 2025
Yin Wang
Mu Li
Jiapeng Liu
Zhiying Leng
Frederick W. B. Li
Ziyao Zhang
Xiaohui Liang
314
38
0
08 Feb 2025
Aligning Black-box Language Models with Human Judgments
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Gerrit J. J. van den Burg
Gen Suzuki
Wei Liu
Murat Sensoy
ALM
332
2
0
07 Feb 2025
Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Berk Atil
Vipul Gupta
Sarkar Snigdha Sarathi Das
R. Passonneau
859
0
0
07 Feb 2025
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tunazzina Islam
Dan Goldwasser
450
9
0
28 Jan 2025
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Thomas Walshe
S. Moon
Chunyang Xiao
Yawwani Gunawardana
Fran Silavong
269
6
0
21 Jan 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
605
46
0
10 Jan 2025
Predictable Artificial Intelligence
Lexin Zhou
Pablo Antonio Moreno Casares
Fernando Martínez-Plumed
John Burden
Ryan Burnell
...
Seán Ó hÉigeartaigh
Danaja Rutar
Wout Schellaert
Konstantinos Voudouris
José Hernández-Orallo
638
8
0
08 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
556
57
0
03 Jan 2025
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Sudipta Singha Roy
Zhumin Chen
D. Yin
Zhaochun Ren
RALM
ALM
ELM
LRM
LM&MA
799
459
0
31 Dec 2024
STAYKATE: Hybrid In-Context Example Selection Combining Representativeness Sampling and Retrieval-based Approach -- A Case Study on Science Domains
Chencheng Zhu
Kazutaka Shimada
Tomoki Taniguchi
Tomoko Ohkuma
228
1
0
31 Dec 2024
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
Lynn Greschner
Roman Klinger
403
10
0
20 Dec 2024
Empowering LLMs to Understand and Generate Complex Vector Graphics
Computer Vision and Pattern Recognition (CVPR), 2024
Ximing Xing
Juncheng Hu
Guotao Liang
Jing Zhang
Dong Xu
Qian Yu
619
38
0
15 Dec 2024
A Scoping Review of ChatGPT Research in Accounting and Finance
International Journal of Accounting Information Systems (IJAIS), 2024
Mengming Michael Dong
Theophanis C. Stratopoulos
Victor Xiaoqi Wang
360
61
0
07 Dec 2024
Large corpora and large language models: a replicable method for automating grammatical annotation
Linguistics Vanguard (LV), 2024
Cameron Morin
Matti Marttinen Larsson
370
8
0
18 Nov 2024
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tomas Horych
Christoph Mandl
Terry Ruas
André Greiner-Petter
Bela Gipp
Akiko Aizawa
Timo Spinde
618
16
0
17 Nov 2024
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Flavio Di Palo
Prateek Singhi
Bilal Fadlallah
154
19
0
07 Nov 2024
One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Sonia K. Murthy
Tomer Ullman
Jennifer Hu
ALM
490
38
0
07 Nov 2024
Evaluating Creative Short Story Generation in Humans and Large Language Models
Mete Ismayilzada
Claire Stevenson
Lonneke van der Plas
LM&MA
LRM
596
17
0
04 Nov 2024
A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?
QiHong Chen
Jiawei Li
Jiecheng Deng
Jiachen Yu
Justin Tian Jin Chen
Iftekhar Ahmed
413
8
0
03 Nov 2024
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
John Wu
David Wu
Jimeng Sun
515
2
0
31 Oct 2024
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Neural Information Processing Systems (NeurIPS), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
306
8
0
28 Oct 2024
PRISM: A Methodology for Auditing Biases in Large Language Models
Leif Azzopardi
Yashar Moshfeghi
248
4
0
24 Oct 2024
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
Omer Nahum
Nitay Calderon
Orgad Keller
Idan Szpektor
Roi Reichart
316
17
0
24 Oct 2024
PAPILLON: Privacy Preservation from Internet-based and Local Language Model Ensembles
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Li Siyan
Vethavikashini Chithrra Raghuram
Omar Khattab
Julia Hirschberg
Zhou Yu
463
37
0
22 Oct 2024
De-mark: Watermark Removal in Large Language Models
Ruibo Chen
Yihan Wu
Junfeng Guo
Heng Huang
WaLM
VLM
342
8
0
17 Oct 2024
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zonghai Yao
Aditya Parashar
Huixue Zhou
Won Seok Jang
Feiyun Ouyang
Zhichao Yang
Hong-ye Yu
ELM
460
19
0
17 Oct 2024
Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
International Conference on Learning Representations (ICLR), 2024
Florian E. Dorner
Vivian Y. Nastl
Moritz Hardt
ELM
ALM
455
27
0
17 Oct 2024
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs
International Conference on Computational Linguistics (COLING), 2024
Yijie Li
Yuan Sun
ELM
165
3
0
13 Oct 2024
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles
Dom Nasrabadi
381
2
0
11 Oct 2024
Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation
Tunazzina Islam
Dan Goldwasser
688
8
0
07 Oct 2024
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
International Conference on Learning Representations (ICLR), 2024
Qiyuan Zhang
Yufei Wang
Tiezheng YU
Yuxin Jiang
Chuhan Wu
...
Xin Jiang
Lifeng Shang
Ruiming Tang
Fuyuan Lyu
Chen Ma
462
18
0
07 Oct 2024
Hate Personified: Investigating the role of LLMs in content moderation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sarah Masud
Sahajpreet Singh
Viktor Hangya
Kangyang Luo
Tanmoy Chakraborty
272
21
0
03 Oct 2024
'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants
International Conference on Human Factors in Computing Systems (CHI), 2024
Shivani Kapania
William Agnew
Motahhare Eslami
Hoda Heidari
Sarah E Fox
263
26
0
28 Sep 2024
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
European Conference on Computer Vision (ECCV), 2024
Yuxiao Chen
Keqin Li
Wentao Bao
Deep Patel
Yu Kong
Martin Renqiang Min
Dimitris N. Metaxas
DiffM
314
8
0
22 Sep 2024
Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows
David Alonso del Barrio
Max Tiel
D. Gática-Pérez
277
7
0
19 Sep 2024
What Would You Ask When You First Saw
a
2
+
b
2
=
c
2
a^2+b^2=c^2
a
2
+
b
2
=
c
2
? Evaluating LLM on Curiosity-Driven Questioning
Shashidhar Reddy Javaji
Zining Zhu
ELM
ALM
426
1
0
19 Sep 2024
Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs
Yifan Wang
David Stevens
Pranay Shah
Wenwen Jiang
Miao Liu
...
Boying Gong
Daniel Lee
Jiabo Hu
Ning Zhang
Bob Kamma
262
5
0
16 Sep 2024
Keeping Humans in the Loop: Human-Centered Automated Annotation with Generative AI
International Conference on Web and Social Media (ICWSM), 2024
Nicholas Pangakis
Samuel Wolken
320
20
0
14 Sep 2024
Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance
Lucio La Cava
Andrea Tagarelli
LLMAG
190
3
0
13 Sep 2024
Your Weak LLM is Secretly a Strong Teacher for Alignment
International Conference on Learning Representations (ICLR), 2024
Leitian Tao
Yixuan Li
595
16
0
13 Sep 2024
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
A. Lupidi
Carlos Gemmell
Nicola Cancedda
Jane Dwivedi-Yu
Jason Weston
Jakob Foerster
Roberta Raileanu
Maria Lomeli
SyDa
515
22
0
12 Sep 2024
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data
Hossein Hajipour
Lea Schönherr
Thorsten Holz
Mario Fritz
AAML
SyDa
165
13
0
10 Sep 2024
Previous
1
2
3
4
5
6
7
Next
Page 3 of 7