Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.06619
Cited By
RT-LM: Uncertainty-Aware Resource Management for Real-Time Inference of Language Models
12 September 2023
Yufei Li
Zexin Li
Wei Yang
Cong Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RT-LM: Uncertainty-Aware Resource Management for Real-Time Inference of Language Models"
11 / 11 papers shown
Title
FogROS2-FT: Fault Tolerant Cloud Robotics
Kaiyuan Chen
Kush Hari
Trinity Chung
Michael Wang
Nan Tian
...
Jeffrey Ichnowski
Liu Ren
John Kubiatowicz
Ion Stoica
Ken Goldberg
64
0
0
06 Dec 2024
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
28
0
0
09 Sep 2024
Beyond the Bridge: Contention-Based Covert and Side Channel Attacks on Multi-GPU Interconnect
Yicheng Zhang
Ravan Nazaraliyev
S. B. Dutta
Nael B. Abu-Ghazaleh
Andres Marquez
Kevin Barker
GNN
17
4
0
05 Apr 2024
Genie: Smart ROS-based Caching for Connected Autonomous Robots
Zexin Li
Soroush Bateni
Cong Liu
27
1
0
29 Feb 2024
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack
Yu Fu
Yufei Li
Wen Xiao
Cong Liu
Yue Dong
AAML
29
5
0
12 Dec 2023
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework
Yiming Chen
Yan Zhang
Bin Wang
Zuozhu Liu
Haizhou Li
27
24
0
30 Oct 2022
DeepPerform: An Efficient Approach for Performance Testing of Resource-Constrained Neural Networks
Simin Chen
Mirazul Haque
Cong Liu
Wei Yang
39
21
0
10 Oct 2022
Demand Layering for Real-Time DNN Inference with Minimized Memory Usage
Min-Zhi Ji
Saehanseul Yi
Chang-Mo Koo
Sol Ahn
Dongjoo Seo
N. Dutt
Jong-Chan Kim
29
16
0
08 Oct 2022
LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language Models
Simin Chen
Cong Liu
Mirazul Haque
Wei Yang
34
21
0
07 Oct 2022
NVRadarNet: Real-Time Radar Obstacle and Free Space Detection for Autonomous Driving
A. Popov
Patrik Gebhardt
Ke Chen
Ryan Oldja
Heeseok Lee
S. Murray
Ruchita Bhargava
Nikolai Smolyanskiy
39
25
0
29 Sep 2022
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
247
9,042
0
06 Jun 2015
1