ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.01799
  4. Cited By
Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward
v1v2 (latest)

Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward

2 February 2024
Arnav Chavan
Raghav Magazine
Shubham Kushwaha
M. Debbah
Deepak Gupta
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)Github (42★)

Papers citing "Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward"

14 / 14 papers shown
Title
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic
Kanghyun Choi
Hyeyoon Lee
S. Park
Dain Kwon
Jinho Lee
MQ
148
0
0
28 Oct 2025
HALO: Memory-Centric Heterogeneous Accelerator with 2.5D Integration for Low-Batch LLM Inference
HALO: Memory-Centric Heterogeneous Accelerator with 2.5D Integration for Low-Batch LLM Inference
Shubham Negi
Kaushik Roy
109
0
0
03 Oct 2025
Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models
Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models
Xudong Han
Junjie Yang
Pohsun Feng
Ziqian Bi
Xinyuan Song
Junfeng Hao
Junhao Song
LM&MAALM
362
4
0
24 Aug 2025
Consiglieres in the Shadow: Understanding the Use of Uncensored Large Language Models in Cybercrimes
Consiglieres in the Shadow: Understanding the Use of Uncensored Large Language Models in Cybercrimes
Zilong Lin
Zichuan Li
Xiaojing Liao
XiaoFeng Wang
92
1
0
18 Aug 2025
Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping
Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping
Nitin Sharma
Thomas Wolfers
Çağatay Yıldız
ALM
141
0
0
09 Jun 2025
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Adarsh Prasad Behera
J. Champati
Roberto Morabito
Sasu Tarkoma
J. Gross
180
5
0
06 Jun 2025
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models
Nicolas Baumann
Cheng Hu
Paviththiren Sivasothilingam
Haotong Qin
Lei Xie
Michele Magno
Luca Benini
300
5
0
15 Apr 2025
Large Language Models for Code Generation: A Comprehensive Survey of Challenges, Techniques, Evaluation, and Applications
Large Language Models for Code Generation: A Comprehensive Survey of Challenges, Techniques, Evaluation, and Applications
Nam Huynh
Beiyu Lin
LM&MA
341
27
0
03 Mar 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
288
3
0
28 Jan 2025
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large
  Language Models
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language ModelsACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2024
Elias Frantar
Roberto L. Castro
Jiale Chen
Torsten Hoefler
Dan Alistarh
MQ
198
26
0
21 Aug 2024
Grammar-based Game Description Generation using Large Language Models
Grammar-based Game Description Generation using Large Language Models
Tsunehiko Tanaka
Edgar Simo-Serra
304
6
0
24 Jul 2024
Parameter Efficient Fine Tuning: A Comprehensive Analysis Across
  Applications
Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications
Charith Chandra Sai Balne
S. Bhaduri
Tamoghna Roy
Vinija Jain
Vasu Sharma
309
33
0
21 Apr 2024
Language-Grounded Dynamic Scene Graphs for Interactive Object Search
  with Mobile Manipulation
Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile ManipulationIEEE Robotics and Automation Letters (RA-L), 2024
Daniel Honerkamp
Martin Buchner
Fabien Despinoy
Tim Welschehold
Abhinav Valada
LM&Ro
272
75
0
13 Mar 2024
Towards Message Brokers for Generative AI: Survey, Challenges, and
  Opportunities
Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities
Alaa Saleh
Roberto Morabito
Sasu Tarkoma
Susanna Pirttikangas
Lauri Lovén
307
8
0
22 Dec 2023
1