ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.09689
  4. Cited By
Unnatural Instructions: Tuning Language Models with (Almost) No Human
  Labor

Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor

19 December 2022
Or Honovich
Thomas Scialom
Omer Levy
Timo Schick
    ALM
ArXivPDFHTML

Papers citing "Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor"

50 / 291 papers shown
Title
A Survey on Self-Evolution of Large Language Models
A Survey on Self-Evolution of Large Language Models
Zhengwei Tao
Ting-En Lin
Xiancai Chen
Hangyu Li
Yuchuan Wu
Yongbin Li
Zhi Jin
Fei Huang
Dacheng Tao
Jingren Zhou
LRM
LM&Ro
46
21
0
22 Apr 2024
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional
  Dialogues
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Jiao Ou
Jiayu Wu
Che Liu
Fuzheng Zhang
Di Zhang
Kun Gai
19
2
0
17 Apr 2024
Forcing Diffuse Distributions out of Language Models
Forcing Diffuse Distributions out of Language Models
Yiming Zhang
Avi Schwarzschild
Nicholas Carlini
Zico Kolter
Daphne Ippolito
ALM
DiffM
36
15
0
16 Apr 2024
Behavior Trees Enable Structured Programming of Language Model Agents
Behavior Trees Enable Structured Programming of Language Model Agents
Richard Kelley
AI4CE
LM&Ro
LLMAG
19
0
0
11 Apr 2024
High-Dimension Human Value Representation in Large Language Models
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
60
5
0
11 Apr 2024
Transferable and Efficient Non-Factual Content Detection via Probe
  Training with Offline Consistency Checking
Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking
Xiaokang Zhang
Zijun Yao
Jing Zhang
Kaifeng Yun
Jifan Yu
Juan-Zi Li
Jie Tang
HILM
24
2
0
10 Apr 2024
CodecLM: Aligning Language Models with Tailored Synthetic Data
CodecLM: Aligning Language Models with Tailored Synthetic Data
Zifeng Wang
Chun-Liang Li
Vincent Perot
Long T. Le
Jin Miao
Zizhao Zhang
Chen-Yu Lee
Tomas Pfister
SyDa
ALM
16
17
0
08 Apr 2024
CantTalkAboutThis: Aligning Language Models to Stay on Topic in
  Dialogues
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
Makesh Narsimhan Sreedhar
Traian Rebedea
Shaona Ghosh
Jiaqi Zeng
Christopher Parisien
ALM
27
4
0
04 Apr 2024
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web
  Navigating Agent
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Hanyu Lai
Xiao Liu
Iat Long Iong
Shuntian Yao
Yuxuan Chen
...
Hao Yu
Hanchen Zhang
Xiaohan Zhang
Yuxiao Dong
Jie Tang
LM&Ro
LLMAG
36
44
0
04 Apr 2024
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for
  Large Language Models
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models
Taiqiang Wu
Chaofan Tao
Jiahao Wang
Zhe Zhao
Ngai Wong
ALM
38
14
0
03 Apr 2024
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
Yuelin Bai
Xinrun Du
Yiming Liang
Yonggang Jin
Ziqiang Liu
...
Chenghua Lin
Jie Fu
Min Yang
Shiwen Ni
Ge Zhang
ALM
35
32
0
26 Mar 2024
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for
  Large Language Models
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Zehui Chen
Kuikun Liu
Qiuchen Wang
Wenwei Zhang
Jiangning Liu
Dahua Lin
Kai-xiang Chen
Feng Zhao
LLMAG
ALM
AIFin
54
27
0
19 Mar 2024
Automated Data Curation for Robust Language Model Fine-Tuning
Automated Data Curation for Robust Language Model Fine-Tuning
Jiuhai Chen
Jonas W. Mueller
ALM
32
19
0
19 Mar 2024
Third-Party Language Model Performance Prediction from Instruction
Third-Party Language Model Performance Prediction from Instruction
Rahul Nadkarni
Yizhong Wang
Noah A. Smith
ELM
LRM
34
0
0
19 Mar 2024
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
Ahmed Masry
Mehrad Shahmohammadi
Md. Rizwan Parvez
Enamul Hoque
Shafiq R. Joty
26
31
0
14 Mar 2024
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large
  Language Model
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
Cheng Chen
Junchen Zhu
Xu Luo
Hengtao Shen
Lianli Gao
Jingkuan Song
CLL
27
12
0
13 Mar 2024
Alignment Studio: Aligning Large Language Models to Particular
  Contextual Regulations
Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations
Swapnaja Achintalwar
Ioana Baldini
Djallel Bouneffouf
Joan Byamugisha
Maria Chang
...
P. Sattigeri
Moninder Singh
S. Thwala
Rosario A. Uceda-Sosa
Kush R. Varshney
37
4
0
08 Mar 2024
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Boshi Wang
Hao Fang
Jason Eisner
Benjamin Van Durme
Yu-Chuan Su
CLL
27
7
0
07 Mar 2024
LUCID: LLM-Generated Utterances for Complex and Interesting Dialogues
LUCID: LLM-Generated Utterances for Complex and Interesting Dialogues
Joe Stacey
Jianpeng Cheng
John Torr
Tristan Guigue
Joris Driesen
Alexandru Coca
Mark Gaynor
Anders Johannsen
22
3
0
01 Mar 2024
Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores
Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores
Chantal Shaib
Joe Barrow
Jiuding Sun
Alexa F. Siu
Byron C. Wallace
A. Nenkova
66
31
0
01 Mar 2024
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period
  of Large Language Models
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
Chao Qian
Jie M. Zhang
Wei Yao
Dongrui Liu
Zhen-fei Yin
Yu Qiao
Yong Liu
Jing Shao
LLMSV
LRM
42
13
0
29 Feb 2024
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task
  Adaptation
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation
Nihal V. Nayak
Yiyang Nan
Avi Trost
Stephen H. Bach
SyDa
30
13
0
28 Feb 2024
SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection
SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection
Liangxin Liu
Xuebo Liu
Derek F. Wong
Dongfang Li
Ziyi Wang
Baotian Hu
Min Zhang
45
15
0
26 Feb 2024
CodeS: Towards Building Open-source Language Models for Text-to-SQL
CodeS: Towards Building Open-source Language Models for Text-to-SQL
Haoyang Li
Jing Zhang
Hanbing Liu
Ju Fan
Xiaokang Zhang
Jun Zhu
Renjie Wei
Hongyan Pan
Cuiping Li
Hong Chen
ELM
AI4TS
35
91
0
26 Feb 2024
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Xiangdi Meng
Damai Dai
Weiyao Luo
Zhe Yang
Shaoxiang Wu
Xiaochen Wang
Peiyi Wang
Qingxiu Dong
Liang Chen
Zhifang Sui
109
11
0
25 Feb 2024
Watermarking Makes Language Models Radioactive
Watermarking Makes Language Models Radioactive
Tom Sander
Pierre Fernandez
Alain Durmus
Matthijs Douze
Teddy Furon
WaLM
29
11
0
22 Feb 2024
Towards Robust Instruction Tuning on Multimodal Large Language Models
Towards Robust Instruction Tuning on Multimodal Large Language Models
Wei Han
Hui Chen
Soujanya Poria
MLLM
44
0
0
22 Feb 2024
LexC-Gen: Generating Data for Extremely Low-Resource Languages with
  Large Language Models and Bilingual Lexicons
LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
28
3
0
21 Feb 2024
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity
  within Large Language Models
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
Chenyang Song
Xu Han
Zhengyan Zhang
Shengding Hu
Xiyu Shi
...
Chen Chen
Zhiyuan Liu
Guanglin Li
Tao Yang
Maosong Sun
40
24
0
21 Feb 2024
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks
Minju Seo
Jinheon Baek
James Thorne
Sung Ju Hwang
RALM
29
8
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Jundong Li
Lu Cheng
Huan Liu
SyDa
42
44
0
21 Feb 2024
Explaining Relationships Among Research Papers
Explaining Relationships Among Research Papers
Xiangci Li
Jessica Ouyang
19
0
0
20 Feb 2024
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for
  Language Models
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Haoran Li
Qingxiu Dong
Zhengyang Tang
Chaojun Wang
Xingxing Zhang
...
Wei Lu
Zhifang Sui
Benyou Wang
Wai Lam
Furu Wei
SyDa
56
50
0
20 Feb 2024
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Demin Song
Honglin Guo
Yunhua Zhou
Shuhao Xing
Yudong Wang
...
Wenwei Zhang
Qipeng Guo
Hang Yan
Xipeng Qiu
Dahua Lin
SyDa
47
6
0
20 Feb 2024
PromptKD: Distilling Student-Friendly Knowledge for Generative Language
  Models via Prompt Tuning
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning
Gyeongman Kim
Doohyuk Jang
Eunho Yang
VLM
30
13
0
20 Feb 2024
Reformatted Alignment
Reformatted Alignment
Run-Ze Fan
Xuefeng Li
Haoyang Zou
Junlong Li
Shwai He
Ethan Chern
Jiewen Hu
Pengfei Liu
54
8
0
19 Feb 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated
  Text Detectors Under Attacks
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
28
17
0
18 Feb 2024
Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
S. Hayati
Taehee Jung
Tristan Bodding-Long
Sudipta Kar
A. Sethy
Joo-Kyung Kim
Dongyeop Kang
ALM
LRM
30
6
0
18 Feb 2024
Learning to Learn Faster from Human Feedback with Language Model
  Predictive Control
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Jacky Liang
Fei Xia
Wenhao Yu
Andy Zeng
Montse Gonzalez Arenas
...
N. Heess
Kanishka Rao
Nik Stewart
Jie Tan
Carolina Parada
LM&Ro
49
32
0
18 Feb 2024
Instruction Diversity Drives Generalization To Unseen Tasks
Instruction Diversity Drives Generalization To Unseen Tasks
Dylan Zhang
Justin Wang
Francois Charton
ALM
23
6
0
16 Feb 2024
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned
  LLMs?
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?
Ehsan Doostmohammadi
Oskar Holmstrom
Marco Kuhlmann
27
8
0
16 Feb 2024
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate
  Controllable Controversial Statements
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Tianyi Zhou
66
17
0
16 Feb 2024
Smaller Language Models are capable of selecting Instruction-Tuning
  Training Data for Larger Language Models
Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models
Dheeraj Mekala
Alex Nguyen
Jingbo Shang
ALM
12
18
0
16 Feb 2024
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM
  Workflows
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Ajay Patel
Colin Raffel
Chris Callison-Burch
SyDa
AI4CE
17
25
0
16 Feb 2024
Towards Faithful and Robust LLM Specialists for Evidence-Based
  Question-Answering
Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering
Tobias Schimanski
Jingwei Ni
Mathias Kraus
Elliott Ash
Markus Leippold
19
3
0
13 Feb 2024
Grounding Data Science Code Generation with Input-Output Specifications
Grounding Data Science Code Generation with Input-Output Specifications
Yeming Wen
Pengcheng Yin
Kensen Shi
Henryk Michalewski
Swarat Chaudhuri
A. Polozov
SyDa
22
10
0
12 Feb 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction
  Tuning
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
A. Ustun
Marzieh Fadaee
Sara Hooker
115
115
0
09 Feb 2024
Rethinking Data Selection for Supervised Fine-Tuning
Rethinking Data Selection for Supervised Fine-Tuning
Ming Shen
15
16
0
08 Feb 2024
Large Language Model Meets Graph Neural Network in Knowledge
  Distillation
Large Language Model Meets Graph Neural Network in Knowledge Distillation
Shengxiang Hu
Guobing Zou
Song Yang
Yanglan Gan
Bofeng Zhang
Yixin Chen
32
7
0
08 Feb 2024
DistiLLM: Towards Streamlined Distillation for Large Language Models
DistiLLM: Towards Streamlined Distillation for Large Language Models
Jongwoo Ko
Sungnyun Kim
Tianyi Chen
SeYoung Yun
61
25
0
06 Feb 2024
Previous
123456
Next