Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.03122
Cited By
Outlier Detection for Improved Data Quality and Diversity in Dialog Systems
5 April 2019
Stefan Larson
Anish Mahendran
Andrew Lee
Jonathan K. Kummerfeld
Parker Hill
M. Laurenzano
Johann Hauswald
Lingjia Tang
Jason Mars
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Outlier Detection for Improved Data Quality and Diversity in Dialog Systems"
5 / 5 papers shown
Title
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
88
11
0
31 Dec 2024
LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity
Anjana Arunkumar
Shubham Sharma
Rakhi Agrawal
Sriramakrishnan Chandrasekaran
Chris Bryan
18
0
0
12 Apr 2023
ShabbyPages: A Reproducible Document Denoising and Binarization Dataset
Alexander Groleau
Kok Wei Chee
Stefan Larson
Samay Maini
Jonathan Boarman
14
2
0
16 Mar 2023
Redwood: Using Collision Detection to Grow a Large-Scale Intent Classification Dataset
Stefan Larson
Kevin Leach
11
9
0
12 Apr 2022
Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
14
6
0
26 Nov 2021
1