Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.04511
Cited By
An LLM Compiler for Parallel Function Calling
7 December 2023
Sehoon Kim
Suhong Moon
Ryan Tabrizi
Nicholas Lee
Michael W. Mahoney
Kurt Keutzer
A. Gholami
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An LLM Compiler for Parallel Function Calling"
44 / 44 papers shown
Title
TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching
Majd Abdallah
Sigve Nakken
Mariska Bierkens
Johanna Galvis
Alexis Groppi
...
Rodrigo Dienstmann
Remond Fijneman
Eivind Hovig
Gerrit Meijer
Macha Nikolski
14
0
0
13 May 2025
VTS-LLM: Domain-Adaptive LLM Agent for Enhancing Awareness in Vessel Traffic Services through Natural Language
Sijin Sun
Liangbin Zhao
Ming Deng
Xiuju Fu
19
0
0
02 May 2025
Template-Based Financial Report Generation in Agentic and Decomposed Information Retrieval
Yong-En Tian
Yu-Chien Tang
Kuang-Da Wang
An-Zi Yen
Wen-Chih Peng
AIFin
44
0
0
19 Apr 2025
Orchestrating Agents and Data for Enterprise: A Blueprint Architecture for Compound AI
Eser Kandogan
Nikita Bhutani
Dan Zhang
Rafael Li Chen
Sairam Gurajada
Estevam R. Hruschka
AIFin
34
0
0
10 Apr 2025
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Gleb Rodionov
Roman Garipov
Alina Shutova
George Yakushev
Vage Egiazarian
Anton Sinitsin
Denis Kuznedelev
Dan Alistarh
LRM
27
1
0
08 Apr 2025
Affordable AI Assistants with Knowledge Graph of Thoughts
Maciej Besta
Lorenzo Paleari
Jia Hao Andrea Jiang
Robert Gerstenberger
You Wu
...
Jón Gunnar Hannesson
Grzegorz Kwa'sniewski
Marcin Copik
H. Niewiadomski
Torsten Hoefler
LLMAG
RALM
70
0
0
03 Apr 2025
A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks
Hongjia Liu
Jinlong Li
LRM
47
0
0
29 Mar 2025
Benchmarking Failures in Tool-Augmented Language Models
Eduardo Treviño
Hugo Contant
James Ngai
Graham Neubig
Zora Zhiruo Wang
61
0
0
18 Mar 2025
ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning
Xinyi Wang
Jiashui Wang
Peng Chen
Jinbo Su
Yanming Liu
Long Liu
Yangdong Wang
Qiyuan Chen
Kai Yun
Chunfu Jia
42
0
0
14 Mar 2025
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks
Lutfi Eren Erdogan
Nicholas Lee
Sehoon Kim
Suhong Moon
Hiroki Furuta
Gopala Anumanchipalli
K. K.
Amir Gholami
LLMAG
LM&Ro
AIFin
76
2
0
12 Mar 2025
Multi-Agent Image Restoration
Xu Jiang
G. Li
Bin Chen
Jian Andrew Zhang
50
0
0
12 Mar 2025
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin
Keisuke Kamahori
Yiyu Liu
Xiaoxiang Shi
Madhav Kashyap
...
Stephanie Wang
Arvind Krishnamurthy
Rohan Kadekodi
Luis Ceze
Baris Kasikci
3DV
VLM
61
1
0
28 Feb 2025
RAG-Optimized Tibetan Tourism LLMs: Enhancing Accuracy and Personalization
Jinhu Qi
Shuai Yan
Yibo Zhang
Wentao Zhang
R. L. Jin
Y. Hu
Ke Wang
3DV
47
1
0
21 Feb 2025
AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Zhiqiang Xie
Hao Kang
Ying Sheng
Tushar Krishna
Kayvon Fatahalian
Christos Kozyrakis
LRM
AI4CE
LLMAG
LM&Ro
35
1
0
05 Nov 2024
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh
Pradyot Prakash
Alexander Radovic
Akshay Shekher
Denis Savenkov
LRM
48
1
0
30 Oct 2024
Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks
Graziano A. Manduzio
Federico A. Galatolo
M. G. Cimino
Enzo Pasquale Scilingo
Lorenzo Cominelli
LRM
19
1
0
24 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
Arijit Das
16
1
0
21 Oct 2024
Smart Audit System Empowered by LLM
Xu Yao
Xiaoxu Wu
Xi Li
Huan Xu
Chenlei Li
Ping-Chia Huang
Si Li
Xiaoning Ma
Jiulong Shan
16
0
0
10 Oct 2024
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Qiqiang Lin
Muning Wen
Qiuying Peng
Guanyu Nie
Junwei Liao
...
Jiamu Zhou
Cheng Cheng
Yin Zhao
Jun Wang
Weinan Zhang
38
15
0
06 Oct 2024
TinyAgent: Function Calling at the Edge
Lutfi Eren Erdogan
Nicholas Lee
Siddharth Jha
Sehoon Kim
Ryan Tabrizi
Suhong Moon
Coleman Hooper
Gopala Anumanchipalli
Kurt Keutzer
Amir Gholami
LLMAG
39
10
0
01 Sep 2024
A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWares
Stav Cohen
Ron Bitton
Ben Nassi
SILM
33
5
0
09 Aug 2024
Dynamic Fog Computing for Enhanced LLM Execution in Medical Applications
Philipp Zagar
Vishnu Ravi
Lauren Aalami
Stephan Krusche
Oliver Aalami
Paul Schmiedmayer
28
2
0
08 Aug 2024
Revolutionizing Bridge Operation and maintenance with LLM-based Agents: An Overview of Applications and Insights
Xinyu-Chen
Lianzhen-Zhang
LLMAG
AI4CE
37
1
0
14 Jul 2024
Teola: Towards End-to-End Optimization of LLM-based Applications
Xin Tan
Yimin Jiang
Yitao Yang
Hong-Yu Xu
57
5
0
29 Jun 2024
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts
Honghua Dong
Qidong Su
Yubo Gao
Zhaoyu Li
Yangjun Ruan
Gennady Pekhimenko
Chris J. Maddison
Xujie Si
LLMAG
26
1
0
19 Jun 2024
LLM-dCache: Improving Tool-Augmented LLMs with GPT-Driven Localized Data Caching
Simranjit Singh
Michael Fore
Andreas Karatzas
Chaehong Lee
Yanan Jian
Longfei Shangguan
Fuxun Yu
Iraklis Anagnostopoulos
Dimitrios Stamoulis
RALM
22
2
0
10 Jun 2024
RATT: A Thought Structure for Coherent and Correct LLM Reasoning
Jinghan Zhang
Xiting Wang
Weijieying Ren
Lu Jiang
Dongjie Wang
Kunpeng Liu
LRM
32
8
0
04 Jun 2024
CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems
Yanlin Feng
Sajjadur Rahman
Aaron Feng
Vincent Chen
Eser Kandogan
38
4
0
02 Jun 2024
Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution
Yechen Xu
Xinhao Kong
Tingjun Chen
Danyang Zhuo
LLMAG
22
2
0
29 May 2024
Can Github issues be solved with Tree Of Thoughts?
Ricardo La Rosa
Corey Hulse
Bangdi Liu
LRM
LLMAG
30
4
0
20 May 2024
An LLM-Tool Compiler for Fused Parallel Function Calling
Simranjit Singh
Andreas Karatzas
Michael Fore
Iraklis Anagnostopoulos
Dimitrios Stamoulis
LLMAG
27
6
0
07 May 2024
GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots
Simranjit Singh
Michael Fore
Dimitrios Stamoulis
LLMAG
22
11
0
23 Apr 2024
LLMs as Compiler for Arabic Programming Language
Serry Sibaee
Omar Najar
Lahouri Ghouti
Anis Koubaa
24
0
0
24 Mar 2024
ALTO: An Efficient Network Orchestrator for Compound AI Systems
Keshav Santhanam
Deepti Raghavan
Muhammad Shahir Rahman
Thejas Venkatesh
Neha Kunjal
Pratiksha Thaker
Philip Levis
Matei A. Zaharia
24
1
0
07 Mar 2024
Budget-Constrained Tool Learning with Planning
Yuanhang Zheng
Peng Li
Mingshi Yan
Ji Zhang
Fei Huang
Yang Janet Liu
24
3
0
25 Feb 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
63
26
0
25 Jan 2024
SGLang: Efficient Execution of Structured Language Model Programs
Lianmin Zheng
Liangsheng Yin
Zhiqiang Xie
Chuyue Sun
Jeff Huang
...
Christos Kozyrakis
Ion Stoica
Joseph E. Gonzalez
Clark W. Barrett
Ying Sheng
LRM
29
102
0
12 Dec 2023
TypeFly: Flying Drones with Large Language Model
Guojun Chen
Xiaojing Yu
Lin Zhong
35
8
0
08 Dec 2023
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
229
2,413
0
06 Oct 2022
Is a Question Decomposition Unit All We Need?
Pruthvi H. Patel
Swaroop Mishra
Mihir Parmar
Chitta Baral
ReLM
136
50
0
25 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
194
614
0
20 May 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
278
3,784
0
18 Apr 2021
1