Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.04538
Cited By
v1
v2 (latest)
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
9 February 2022
Yu Meng
Jiaxin Huang
Yu Zhang
Jiawei Han
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generating Training Data with Language Models: Towards Zero-Shot Language Understanding"
50 / 175 papers shown
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
Ruida Wang
Jipeng Zhang
Yizhen Jia
Boyao Wang
Shizhe Diao
Renjie Pi
Tong Zhang
LRM
280
45
0
03 Jul 2024
Fairness and Bias in Multimodal AI: A Survey
Tosin Adewumi
Lama Alkhaled
Namrata Gurung
G. V. Boven
Irene Pagliai
330
23
0
27 Jun 2024
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization
Sungbin Shin
Wonpyo Park
Jaeho Lee
Namhoon Lee
172
6
0
21 Jun 2024
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics
Seungbeen Lee
Seungwon Lim
Seungju Han
Giyeong Oh
Hyungjoo Chae
...
Beong-woo Kwak
Yeonsoo Lee
Dongha Lee
Jinyoung Yeo
Youngjae Yu
333
39
0
20 Jun 2024
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tianyuan Zou
Yang Liu
Ziwei Sun
Jianqing Zhang
Jingjing Liu
Ya-Qin Zhang
248
3
0
18 Jun 2024
Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
Reza Akbarian Bafghi
Nidhin Harilal
C. Monteleoni
M. Raissi
DiffM
281
1
0
18 Jun 2024
Self-training Large Language Models through Knowledge Detection
Wei Jie Yeo
Teddy Ferdinan
Przemyslaw Kazienko
Frank Xing
Erik Cambria
235
15
0
17 Jun 2024
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Lin Long
Rui Wang
Ruixuan Xiao
Junbo Zhao
Xiao Ding
Gang Chen
Haobo Wang
SyDa
294
259
0
14 Jun 2024
Is Programming by Example solved by LLMs?
Wen-Ding Li
Kevin Ellis
281
26
0
12 Jun 2024
Contrastive Learning from Synthetic Audio Doppelgängers
International Conference on Learning Representations (ICLR), 2024
Manuel Cherep
Nikhil Singh
358
1
0
09 Jun 2024
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
LRM
276
6
0
06 Jun 2024
On The Persona-based Summarization of Domain-Specific Documents
Ankan Mullick
Sombit Bose
Rounak Saha
Ayan Kumar Bhowmick
Pawan Goyal
Niloy Ganguly
Prasenjit Dey
Ravi Kokku
203
6
0
06 Jun 2024
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
Yusen Zhang
Nan Zhang
Jiawei Han
Rui Zhang
LRM
384
150
0
03 Jun 2024
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
523
511
0
01 Jun 2024
PGA-SciRE: Harnessing LLM on Data Augmentation for Enhancing Scientific Relation Extraction
Yang Zhou
Shimin Shan
Hongkui Wei
Zhehuan Zhao
Wenshuo Feng
225
3
0
30 May 2024
Automated Real-World Sustainability Data Generation from Images of Buildings
Peter J Bentley
Soo Ling Lim
Rajat Mathur
Siddhart Narang
237
4
0
28 May 2024
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yiming Chen
Chen Zhang
Danqing Luo
L. F. D’Haro
R. Tan
Haizhou Li
AAML
ELM
224
3
0
23 May 2024
SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Abhishek Divekar
Greg Durrett
400
13
0
16 May 2024
Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the Video
ACM Multimedia Asia (MMAsia), 2024
Tomoya Sugihara
Shuntaro Masuda
Ling Xiao
Toshihiko Yamasaki
160
4
0
14 May 2024
Selective Fine-tuning on LLM-labeled Data May Reduce Reliance on Human Annotation: A Case Study Using Schedule-of-Event Table Detection
Machine Learning in Health Care (MLHC), 2024
Bhawesh Kumar
Jonathan Amar
Eric Yang
Nan Li
Yugang Jia
242
4
0
09 May 2024
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
Juhwan Choi
Yeonghwa Kim
Seunguk Yu
Jungmin Yun
Youngbin Kim
191
8
0
02 May 2024
Empowering Large Language Models for Textual Data Augmentation
Yichuan Li
Kaize Ding
Jianling Wang
Kyumin Lee
260
20
0
26 Apr 2024
Forcing Diffuse Distributions out of Language Models
Yiming Zhang
Avi Schwarzschild
Nicholas Carlini
Zico Kolter
Daphne Ippolito
ALM
DiffM
315
32
0
16 Apr 2024
Generative Text Steganography with Large Language Model
Jiaxuan Wu
Zhengxian Wu
Yiming Xue
Juan Wen
Wanli Peng
228
18
0
16 Apr 2024
Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation
Haozhe Zhao
Zefan Cai
Shuzheng Si
Liang Chen
Yufeng He
Kaikai An
Baobao Chang
175
1
0
12 Apr 2024
Best Practices and Lessons Learned on Synthetic Data for Language Models
Ruibo Liu
Jerry W. Wei
Fangyu Liu
Chenglei Si
Yanzhe Zhang
...
Steven Zheng
Daiyi Peng
Diyi Yang
Denny Zhou
Andrew M. Dai
SyDa
EgoV
304
112
0
11 Apr 2024
LLM-Augmented Retrieval: Enhancing Retrieval Models Through Language Models and Doc-Level Embedding
Mingrui Wu
Sheng Cao
KELM
RALM
173
6
0
08 Apr 2024
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Knowledge Discovery and Data Mining (KDD), 2024
Hanyu Lai
Xiao Liu
Iat Long Iong
Shuntian Yao
Yuxuan Chen
...
Hao Yu
Hanchen Zhang
Xiaohan Zhang
Yuxiao Dong
Jie Tang
LM&Ro
LLMAG
207
19
0
04 Apr 2024
Edisum: Summarizing and Explaining Wikipedia Edits at Scale
Marija Sakota
Isaac Johnson
Guosheng Feng
Robert West
SyDa
KELM
174
3
0
04 Apr 2024
GPT-DETOX: An In-Context Learning-Based Paraphraser for Text Detoxification
International Conference on Machine Learning and Applications (ICMLA), 2023
Ali Pesaranghader
Nikhil Verma
Manasa Bharadwaj
253
7
0
03 Apr 2024
Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation
Rohan Chaudhury
Mihir Godbole
Aakash Garg
Jinsil Hwaryoung Seo
146
1
0
31 Mar 2024
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
Yuzhao Heng
Chun-Ying Deng
Yitong Li
Yue Yu
Yinghao Li
Rongzhi Zhang
Chao Zhang
252
10
0
17 Mar 2024
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation
Minhyuk Seo
Diganta Misra
Seongwon Cho
Minjae Lee
Jonghyun Choi
CLL
348
12
0
16 Mar 2024
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
European Conference on Computer Vision (ECCV), 2024
Renjie Pi
Tianyang Han
Wei Xiong
Jipeng Zhang
Runtao Liu
Boyao Wang
Tong Zhang
MLLM
374
75
0
13 Mar 2024
MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions
Conference on Information Technology for Social Good (ITSG), 2024
Vjosa Preniqi
Iacopo Ghinassi
Julia Ive
C. Saitis
Kyriaki Kalimeri
244
15
0
12 Mar 2024
Generative AI for Synthetic Data Generation: Methods, Challenges and the Future
Xu Guo
Yiqiang Chen
SyDa
197
55
0
07 Mar 2024
Improving Event Definition Following For Zero-Shot Event Detection
Zefan Cai
Po-Nien Kung
Ashima Suvarna
Mingyu Derek Ma
Hritik Bansal
Baobao Chang
P. Brantingham
Wei Wang
Nanyun Peng
233
14
0
05 Mar 2024
LUCID: LLM-Generated Utterances for Complex and Interesting Dialogues
Joe Stacey
Jianpeng Cheng
John Torr
Tristan Guigue
Joris Driesen
Alexandru Coca
Mark Gaynor
Anders Johannsen
430
8
0
01 Mar 2024
TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision
Yunyi Zhang
Ruozhen Yang
Xueqiang Xu
Rui Li
Jinfeng Xiao
Jiaming Shen
Jiawei Han
508
40
0
29 Feb 2024
LLM-Assisted Content Conditional Debiasing for Fair Text Embedding
Wenlong Deng
Blair Chen
Beidi Zhao
Chiyu Zhang
Xiaoxiao Li
Christos Thrampoulidis
503
0
0
22 Feb 2024
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Wenlin Yao
Lu Cheng
Huan Liu
SyDa
394
87
0
21 Feb 2024
A Survey on Knowledge Distillation of Large Language Models
Xiaohan Xu
Ming Li
Chongyang Tao
Tao Shen
Reynold Cheng
Jinyang Li
Can Xu
Dacheng Tao
Wanrong Zhu
KELM
VLM
454
229
0
20 Feb 2024
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction
Sizhe Zhou
Yu Meng
Sara Szymkuć
Jiawei Han
153
16
0
17 Feb 2024
Scaling laws for learning with real and surrogate data
Ayush Jain
Andrea Montanari
Eren Sasoglu
307
22
0
06 Feb 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
158
7
0
31 Jan 2024
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
474
49
0
27 Jan 2024
Distilling Vision-Language Models on Millions of Videos
Computer Vision and Pattern Recognition (CVPR), 2024
Yue Zhao
Long Zhao
Xingyi Zhou
Jialin Wu
Chun-Te Chu
...
Hartwig Adam
Ting Liu
Boqing Gong
Philipp Krahenbuhl
Liangzhe Yuan
VLM
279
20
0
11 Jan 2024
Learning Vision from Models Rivals Learning Vision from Data
Computer Vision and Pattern Recognition (CVPR), 2023
Yonglong Tian
Lijie Fan
Kaifeng Chen
Dina Katabi
Dilip Krishnan
Phillip Isola
274
73
0
28 Dec 2023
Large Language Models for Conducting Advanced Text Analytics Information Systems Research
Benjamin Ampel
Chi-Heng Yang
Junjie Hu
Hsinchun Chen
349
12
0
27 Dec 2023
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
406
74
0
18 Dec 2023
Previous
1
2
3
4
Next