Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.19759
Cited By
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs
29 September 2024
Yung-Chieh Chan
George Pu
Apaar Shanker
Parth Suresh
Penn Jenks
John Heyer
Sam Denton
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs"
4 / 4 papers shown
Title
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking
Yi-Ling Chung
Aurora Cobo
Pablo Serna
SyDa
HILM
53
0
0
24 Feb 2025
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
111
3
0
06 Feb 2025
MDCure: A Scalable Pipeline for Multi-Document Instruction-Following
Gabrielle Kaili-May Liu
Bowen Shi
Avi Caciularu
Idan Szpektor
Arman Cohan
58
3
0
30 Oct 2024
Hybrid Training Approaches for LLMs: Leveraging Real and Synthetic Data to Enhance Model Performance in Domain-Specific Applications
Alexey Zhezherau
Alexei Yanockin
SyDa
16
3
0
11 Oct 2024
1