ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.06619
  4. Cited By
RT-LM: Uncertainty-Aware Resource Management for Real-Time Inference of
  Language Models

RT-LM: Uncertainty-Aware Resource Management for Real-Time Inference of Language Models

12 September 2023
Yufei Li
Zexin Li
Wei Yang
Cong Liu
ArXivPDFHTML

Papers citing "RT-LM: Uncertainty-Aware Resource Management for Real-Time Inference of Language Models"

11 / 11 papers shown
Title
FogROS2-FT: Fault Tolerant Cloud Robotics
FogROS2-FT: Fault Tolerant Cloud Robotics
Kaiyuan Chen
Kush Hari
Trinity Chung
Michael Wang
Nan Tian
...
Jeffrey Ichnowski
Liu Ren
John Kubiatowicz
Ion Stoica
Ken Goldberg
64
0
0
06 Dec 2024
Real-Time Human Action Recognition on Embedded Platforms
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
28
0
0
09 Sep 2024
Beyond the Bridge: Contention-Based Covert and Side Channel Attacks on
  Multi-GPU Interconnect
Beyond the Bridge: Contention-Based Covert and Side Channel Attacks on Multi-GPU Interconnect
Yicheng Zhang
Ravan Nazaraliyev
S. B. Dutta
Nael B. Abu-Ghazaleh
Andres Marquez
Kevin Barker
GNN
17
4
0
05 Apr 2024
Genie: Smart ROS-based Caching for Connected Autonomous Robots
Genie: Smart ROS-based Caching for Connected Autonomous Robots
Zexin Li
Soroush Bateni
Cong Liu
27
1
0
29 Feb 2024
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an
  In-Context Attack
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack
Yu Fu
Yufei Li
Wen Xiao
Cong Liu
Yue Dong
AAML
29
5
0
12 Dec 2023
Generate, Discriminate and Contrast: A Semi-Supervised Sentence
  Representation Learning Framework
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework
Yiming Chen
Yan Zhang
Bin Wang
Zuozhu Liu
Haizhou Li
27
24
0
30 Oct 2022
DeepPerform: An Efficient Approach for Performance Testing of
  Resource-Constrained Neural Networks
DeepPerform: An Efficient Approach for Performance Testing of Resource-Constrained Neural Networks
Simin Chen
Mirazul Haque
Cong Liu
Wei Yang
39
21
0
10 Oct 2022
Demand Layering for Real-Time DNN Inference with Minimized Memory Usage
Demand Layering for Real-Time DNN Inference with Minimized Memory Usage
Min-Zhi Ji
Saehanseul Yi
Chang-Mo Koo
Sol Ahn
Dongjoo Seo
N. Dutt
Jong-Chan Kim
29
16
0
08 Oct 2022
LLMEffiChecker: Understanding and Testing Efficiency Degradation of
  Large Language Models
LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language Models
Simin Chen
Cong Liu
Mirazul Haque
Wei Yang
34
21
0
07 Oct 2022
NVRadarNet: Real-Time Radar Obstacle and Free Space Detection for
  Autonomous Driving
NVRadarNet: Real-Time Radar Obstacle and Free Space Detection for Autonomous Driving
A. Popov
Patrik Gebhardt
Ke Chen
Ryan Oldja
Heeseok Lee
S. Murray
Ruchita Bhargava
Nikolai Smolyanskiy
39
25
0
29 Sep 2022
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
247
9,042
0
06 Jun 2015
1