Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.06857
Cited By
What is the Role of Small Models in the LLM Era: A Survey
10 September 2024
Lihu Chen
Gaël Varoquaux
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"What is the Role of Small Models in the LLM Era: A Survey"
20 / 20 papers shown
Title
LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration
Y. Zhang
Pengyue Jia
X. Li
Derong Xu
Maolin Wang
...
Zhaocheng Du
Huifeng Guo
Y. Liu
Ruiming Tang
Xiangyu Zhao
30
0
0
08 May 2025
The Rise of Small Language Models in Healthcare: A Comprehensive Survey
Muskan Garg
Shaina Raza
Shebuti Rayana
Xingyi Liu
Sunghwan Sohn
LM&MA
AILaw
87
0
0
23 Apr 2025
Training Small Reasoning LLMs with Cognitive Preference Alignment
Wenrui Cai
Chengyu Wang
Junbing Yan
Jun Huang
Xiangzhong Fang
LRM
23
0
0
14 Apr 2025
From Punchlines to Predictions: A Metric to Assess LLM Performance in Identifying Humor in Stand-Up Comedy
Adrianna Romanowski
Pedro Valois
Kazuhiro Fukui
31
0
0
12 Apr 2025
A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis
Xin Gao
Qizhi Pei
Zinan Tang
Y. Li
Honglin Lin
Jiang Wu
C. He
Lijun Wu
SyDa
28
0
0
11 Apr 2025
Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models
Hongcheng Guo
Juntao Yao
Boyang Wang
Junjia Du
Shaosheng Cao
Donglin Di
Shun Zhang
Z. Li
MoE
32
0
0
10 Apr 2025
HoarePrompt: Structural Reasoning About Program Correctness in Natural Language
Dimitrios Stamatios Bouras
Yihan Dai
Tairan Wang
Yingfei Xiong
Sergey Mechtaev
LRM
43
0
0
25 Mar 2025
Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent
Humza Nusrat
Bing Luo
Ryan Hall
Joshua Kim
H. Bagher-Ebadian
Anthony Doemer
B. Movsas
Kundan Thind
AI4CE
31
0
0
21 Mar 2025
Investigating Retrieval-Augmented Generation in Quranic Studies: A Study of 13 Open-Source Large Language Models
Zahra Khalila
Arbi Haza Nasution
Winda Monika
Aytug Onan
Yohei Murakami
Yasir Bin Ismail Radi
Noor Mohammad Osmani
RALM
60
0
0
20 Mar 2025
Knowledge Distillation: Enhancing Neural Network Compression with Integrated Gradients
David E. Hernandez
J. Chang
Torbjörn E. M. Nordling
51
0
0
17 Mar 2025
FlexInfer: Breaking Memory Constraint via Flexible and Efficient Offloading for On-Device LLM Inference
Hongchao Du
Shangyu Wu
Arina Kharlamova
Nan Guan
Chun Jason Xue
46
1
0
04 Mar 2025
Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations
Lihu Chen
Shuojie Fu
Gabriel Freedman
Cemre Zor
Guy Martin
James Kinross
Uddhav Vaghela
Ovidiu Serban
Francesca Toni
DeLMO
61
0
0
21 Feb 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
33
0
0
28 Jan 2025
BioAgents: Democratizing Bioinformatics Analysis with Multi-Agent Systems
Nikita Mehandru
Amanda K. Hall
Olesya Melnichenko
Yulia Dubinina
Daniel Tsirulnikov
David Bamman
Ahmed Alaa
Scott Saponas
Venkat S. Malladi
33
2
0
10 Jan 2025
GraphRouter: A Graph-based Router for LLM Selections
Tao Feng
Yanzhen Shen
Jiaxuan You
35
9
0
04 Oct 2024
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
ObjD
LRM
37
31
0
24 Sep 2024
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Guillermo Marco
Luz Rello
Julio Gonzalo
LM&MA
ALM
31
6
0
17 Sep 2024
LESS: Selecting Influential Data for Targeted Instruction Tuning
Mengzhou Xia
Sadhika Malladi
Suchin Gururangan
Sanjeev Arora
Danqi Chen
65
180
0
06 Feb 2024
Model Interpretability through the Lens of Computational Complexity
Pablo Barceló
Mikaël Monet
Jorge A. Pérez
Bernardo Subercaseaux
111
93
0
23 Oct 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
1