ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.02038
  4. Cited By
Understanding LLMs: A Comprehensive Overview from Training to Inference

Understanding LLMs: A Comprehensive Overview from Training to Inference

4 January 2024
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
Jiaming Tian
Yutong Zhang
Jiaqi Wang
Xiaohui Gao
Tianyang Zhong
Yi Pan
Shaochen Xu
Zihao Wu
Zheng Liu
Xin Zhang
Shu Zhang
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
    SyDa
ArXivPDFHTML

Papers citing "Understanding LLMs: A Comprehensive Overview from Training to Inference"

50 / 58 papers shown
Title
Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents
Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents
Schaun Wheeler
Olivier Jeunen
LLMAG
28
0
0
06 May 2025
A Deep User Interface for Exploring LLaMa
A Deep User Interface for Exploring LLaMa
Divya Perumal
Swaroop Panda
36
0
0
28 Feb 2025
LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM
LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM
Yehonathan Refael
Iftach Arbel
Ofir Lindenbaum
Tom Tirer
64
0
0
26 Feb 2025
Reasoning about Affordances: Causal and Compositional Reasoning in LLMs
Reasoning about Affordances: Causal and Compositional Reasoning in LLMs
Magnus F. Gjerde
Vanessa Cheung
David Lagnado
ReLM
LRM
48
0
0
23 Feb 2025
Position: Standard Benchmarks Fail -- LLM Agents Present Overlooked Risks for Financial Applications
Position: Standard Benchmarks Fail -- LLM Agents Present Overlooked Risks for Financial Applications
Zichen Chen
Jiaao Chen
Jianda Chen
Misha Sra
ELM
34
1
0
21 Feb 2025
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Yiheng Liu
Xiaohui Gao
Haiyang Sun
Bao Ge
Tianming Liu
Junwei Han
X. Hu
36
0
0
13 Feb 2025
Harnessing the Potential of Large Language Models in Modern Marketing Management: Applications, Future Directions, and Strategic Recommendations
Harnessing the Potential of Large Language Models in Modern Marketing Management: Applications, Future Directions, and Strategic Recommendations
Raha Aghaei
Ali A. Kiaei
Mahnaz Boush
Javad Vahidi
Mohammad Zavvar
Zeynab Barzegar
Mahan Rofoosheh
OffRL
31
0
0
18 Jan 2025
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
Yehonathan Refael
Jonathan Svirsky
Boris Shustin
Wasim Huleihel
Ofir Lindenbaum
32
3
0
31 Dec 2024
Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding
Jiancan Wu
Yancheng Yuan
Jinda Lu
Kai Zhang
Alex Su
Xiang Wang
Xiangnan He
MU
KELM
97
6
0
30 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
J. Chen
Z. Liu
H. Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
33
6
0
18 Nov 2024
Software Performance Engineering for Foundation Model-Powered Software
  (FMware)
Software Performance Engineering for Foundation Model-Powered Software (FMware)
Haoxiang Zhang
Shi Chang
Arthur Leung
Kishanthan Thangarajah
Boyuan Chen
Hanan Lutfiyya
Ahmed E. Hassan
54
0
0
14 Nov 2024
The Future of Intelligent Healthcare: A Systematic Analysis and
  Discussion on the Integration and Impact of Robots Using Large Language
  Models for Healthcare
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare
Souren Pashangpour
Goldie Nejat
LM&MA
42
7
0
05 Nov 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Aaron C. Courville
OffRL
77
4
0
23 Oct 2024
What Do Speech Foundation Models Not Learn About Speech?
What Do Speech Foundation Models Not Learn About Speech?
Abdul Waheed
Hanin Atwany
Bhiksha Raj
Rita Singh
SSL
27
1
0
16 Oct 2024
ESPACE: Dimensionality Reduction of Activations for Model Compression
ESPACE: Dimensionality Reduction of Activations for Model Compression
Charbel Sakr
Brucek Khailany
15
2
0
07 Oct 2024
ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation
ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation
Fillipe dos Santos Silva
Gabriel Kenzo Kakimoto
Julio Cesar dos Reis
Marcelo S. Reis
22
0
0
01 Oct 2024
GP-GPT: Large Language Model for Gene-Phenotype Mapping
GP-GPT: Large Language Model for Gene-Phenotype Mapping
Yanjun Lyu
Zihao Wu
Lu Zhang
Jing Zhang
Yiwei Li
...
Rongjie Liu
Chao Huang
Wentao Li
Tianming Liu
Dajiang Zhu
LM&MA
22
3
0
15 Sep 2024
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System
Ningyu Zhang
Zekun Xi
Yujie Luo
Peng Wang
Bozhong Tian
...
Lei Liang
Zhiqiang Zhang
Xiaowei Zhu
Jun Zhou
Huajun Chen
KELM
36
6
0
09 Sep 2024
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
Xuanliang Zhang
Dingzirui Wang
Longxu Dou
Baoxin Wang
Dayong Wu
Qingfu Zhu
Wanxiang Che
LMTD
ReLM
39
2
0
16 Aug 2024
Recognizing Emotion Regulation Strategies from Human Behavior with Large
  Language Models
Recognizing Emotion Regulation Strategies from Human Behavior with Large Language Models
Philipp Müller
Alexander Heimerl
Sayed Muddashir Hossain
Lea Siegel
Jan Alexandersson
Patrick Gebhard
Elisabeth André
T. Schneeberger
17
0
0
08 Aug 2024
In2Core: Leveraging Influence Functions for Coreset Selection in
  Instruction Finetuning of Large Language Models
In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models
Ayrton San Joaquin
Bin Wang
Zhengyuan Liu
Nicholas Asher
Brian Lim
Philippe Muller
Nancy Chen
22
0
0
07 Aug 2024
A Comprehensive Review of Multimodal Large Language Models: Performance
  and Challenges Across Different Tasks
A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks
Jiaqi Wang
Hanqi Jiang
Yi-Hsueh Liu
Chong Ma
Xu-Yao Zhang
...
Xin Zhang
Wei Zhang
Dinggang Shen
Tianming Liu
Shu Zhang
VLM
AI4TS
42
30
0
02 Aug 2024
Efficient Training of Large Language Models on Distributed
  Infrastructures: A Survey
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
69
7
0
29 Jul 2024
Performance Modeling and Workload Analysis of Distributed Large Language
  Model Training and Inference
Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference
Joyjit Kundu
Wenzhe Guo
Ali BanaGozar
Udari De Alwis
Sourav Sengupta
Puneet Gupta
Arindam Mallik
24
3
0
19 Jul 2024
Revolutionizing Bridge Operation and maintenance with LLM-based Agents:
  An Overview of Applications and Insights
Revolutionizing Bridge Operation and maintenance with LLM-based Agents: An Overview of Applications and Insights
Xinyu-Chen
Lianzhen-Zhang
LLMAG
AI4CE
37
1
0
14 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
33
37
0
09 Jul 2024
FoldGPT: Simple and Effective Large Language Model Compression Scheme
FoldGPT: Simple and Effective Large Language Model Compression Scheme
Songwei Liu
Chao Zeng
Lianqiang Li
Chenqian Yan
Lean Fu
Xing Mei
Fangmin Chen
37
4
0
01 Jul 2024
Human-AI Collaborative Taxonomy Construction: A Case Study in
  Profession-Specific Writing Assistants
Human-AI Collaborative Taxonomy Construction: A Case Study in Profession-Specific Writing Assistants
Minhwa Lee
Zae Myung Kim
Vivek A. Khetan
Dongyeop Kang
39
3
0
26 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
69
49
0
12 Jun 2024
Technical Language Processing for Telecommunications Specifications
Technical Language Processing for Telecommunications Specifications
Felipe A. Rodriguez Y.
16
0
0
04 Jun 2024
Can LLMs Master Math? Investigating Large Language Models on Math Stack
  Exchange
Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange
Ankit Satpute
Noah Giessing
André Greiner-Petter
M. Schubotz
O. Teschke
Akiko Aizawa
Bela Gipp
ELM
LRM
26
18
0
30 Mar 2024
A Moral Imperative: The Need for Continual Superalignment of Large
  Language Models
A Moral Imperative: The Need for Continual Superalignment of Large Language Models
Gokul Puthumanaillam
Manav Vora
Pranay Thangeda
Melkior Ornik
29
7
0
13 Mar 2024
Revolutionizing Finance with LLMs: An Overview of Applications and
  Insights
Revolutionizing Finance with LLMs: An Overview of Applications and Insights
Huaqin Zhao
Zheng Liu
Zihao Wu
Yiwei Li
Tianze Yang
...
Haixing Dai
Lin Zhao
Gengchen Mai
Ninghao Liu
Tianming Liu
AIFin
34
78
0
22 Jan 2024
Large Language Models for Robotics: Opportunities, Challenges, and
  Perspectives
Large Language Models for Robotics: Opportunities, Challenges, and Perspectives
Jiaqi Wang
Zihao Wu
Yiwei Li
Hanqi Jiang
Peng Shu
...
Lin Zhao
Bao Ge
Xiang Li
Tianming Liu
Shu Zhang
LM&Ro
27
60
0
09 Jan 2024
Transformation vs Tradition: Artificial General Intelligence (AGI) for
  Arts and Humanities
Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Zheng Liu
Yiwei Li
Qian Cao
Junwen Chen
Tianze Yang
...
John Gibbs
Khaled Rasheed
Ninghao Liu
Gengchen Mai
Tianming Liu
AI4CE
36
10
0
30 Oct 2023
AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology
AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology
Haixing Dai
Yiwei Li
Zheng Liu
Lin Zhao
Zihao Wu
...
Quanzheng Li
Zhuo Chen
D. Zhang
Gengchen Mai
Tianming Liu
LM&MA
39
28
0
16 Jun 2023
Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Zhe Xiao
Yuzhong Chen
Lu Zhang
Jun Yao
Zihao Wu
...
Yixuan Yuan
Dinggang Shen
Dajiang Zhu
Tianming Liu
Xi Jiang
VLM
MLLM
55
17
0
29 Apr 2023
Prompt Engineering for Healthcare: Methodologies and Applications
Prompt Engineering for Healthcare: Methodologies and Applications
Jiaqi Wang
Enze Shi
Sigang Yu
Zihao Wu
Chong Ma
...
Dajiang Zhu
Yixuan Yuan
Dinggang Shen
Tianming Liu
Shu Zhang
LM&MA
42
106
0
28 Apr 2023
FlexGen: High-Throughput Generative Inference of Large Language Models
  with a Single GPU
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
144
365
0
13 Mar 2023
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
240
1,070
0
05 Oct 2022
Improving alignment of dialogue agents via targeted human judgements
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
225
495
0
28 Sep 2022
MVP: Multi-task Supervised Pre-training for Natural Language Generation
MVP: Multi-task Supervised Pre-training for Natural Language Generation
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
35
24
0
24 Jun 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
PromptSource: An Integrated Development Environment and Repository for
  Natural Language Prompts
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Stephen H. Bach
Victor Sanh
Zheng-Xin Yong
Albert Webson
Colin Raffel
...
Khalid Almubarak
Xiangru Tang
Dragomir R. Radev
Mike Tian-Jian Jiang
Alexander M. Rush
VLM
215
335
0
02 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
203
1,651
0
15 Oct 2021
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally
  Across Scales and Tasks
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Xiao Liu
Kaixuan Ji
Yicheng Fu
Weng Lam Tam
Zhengxiao Du
Zhilin Yang
Jie Tang
VLM
236
780
0
14 Oct 2021
Train Short, Test Long: Attention with Linear Biases Enables Input
  Length Extrapolation
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
237
690
0
27 Aug 2021
Deduplicating Training Data Makes Language Models Better
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
237
588
0
14 Jul 2021
12
Next