Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.02318
Cited By
Diversity Measurement and Subset Selection for Instruction Tuning Datasets
4 February 2024
Peiqi Wang
Yikang Shen
Zhen Guo
Matt Stallone
Yoon Kim
Polina Golland
Rameswar Panda
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diversity Measurement and Subset Selection for Instruction Tuning Datasets"
7 / 7 papers shown
Title
WildChat: 1M ChatGPT Interaction Logs in the Wild
Wenting Zhao
Xiang Ren
Jack Hessel
Claire Cardie
Yejin Choi
Yuntian Deng
40
171
0
02 May 2024
Data Diversity Matters for Robust Instruction Tuning
Alexander Bukharin
Tuo Zhao
72
35
0
21 Nov 2023
The Vendi Score: A Diversity Evaluation Metric for Machine Learning
Dan Friedman
Adji Bousso Dieng
EGVM
76
107
0
05 Oct 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
205
1,651
0
15 Oct 2021
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
237
588
0
14 Jul 2021
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
157
1,123
0
25 Jul 2012
1