Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.13337
Cited By
Unveiling the Secret Recipe: A Guide For Supervised Fine-Tuning Small LLMs
17 December 2024
Aldo Pareja
Nikhil Shivakumar Nayak
Hao Wang
Krishnateja Killamsetty
Shivchander Sudalairaj
Wenlong Zhao
Seungwook Han
Abhishek Bhandwaldar
Guangxuan Xu
Kai Xu
Ligong Han
Luke Inglis
Akash Srivastava
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unveiling the Secret Recipe: A Guide For Supervised Fine-Tuning Small LLMs"
3 / 3 papers shown
Title
A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets
Ryan Lagasse
Aidan Kiernans
Avijit Ghosh
Shiri Dori-Hacohen
19
0
0
09 May 2025
R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation
Meng-Hao Guo
Jiajun Xu
Yi Zhang
Jiaxi Song
Haoyang Peng
...
Yongming Rao
Houwen Peng
Han Hu
Gordon Wetzstein
Shi-Min Hu
ELM
LRM
57
0
0
04 May 2025
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods
Isha Puri
Shivchander Sudalairaj
Guangxuan Xu
Kai Xu
Akash Srivastava
LRM
76
3
0
03 Feb 2025
1