Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.02413
Cited By
Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks
IEEE Transactions on Mobile Computing (IEEE TMC), 2025
5 May 2025
Baoxia Du
H. Du
Dusit Niyato
Ruidong Li
Re-assign community
ArXiv (abs)
PDF
HTML
Github (54811★)
Papers citing
"Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks"
42 / 42 papers shown
Semantic Edge-Cloud Communication for Real-Time Urban Traffic Surveillance with ViT and LLMs over Mobile Networks
IEEE Transactions on Network Science and Engineering (IEEE TNS&E), 2025
Murat Arda Onsu
Poonam Lohan
Burak Kantarci
Aisha Syed
Matthew Andrews
Sean Kennedy
216
4
0
25 Sep 2025
Edge-Cloud Collaborative Motion Planning for Autonomous Driving with Large Language Models
International Conference on Speech Technology and Human-Computer Dialogue (ICSTHD), 2024
Jiao Chen
Suyan Dai
Fangfang Chen
Zuohong Lv
Jianhua Tang
249
10
0
19 Aug 2024
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Wenhao Shi
Zhiqiang Hu
Yi Bin
Junhua Liu
Yang Yang
See-Kiong Ng
Lidong Bing
Roy Ka-Wei Lee
SyDa
MLLM
LRM
436
123
0
25 Jun 2024
A Superalignment Framework in Autonomous Driving with Large Language Models
Xiangrui Kong
Thomas Braunl
Marco Fahmi
Yue Wang
241
17
0
09 Jun 2024
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models
Chunjiang Ge
Sijie Cheng
Xiangqi Jin
Jiale Yuan
Yuan Gao
Jun Song
Shiji Song
Gao Huang
Bo Zheng
MLLM
VLM
222
25
0
24 May 2024
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services
Zheming Yang
Yuanhao Yang
Chang Zhao
Qi Guo
Wenkai He
Wen Ji
272
38
0
23 May 2024
Imp: Highly Capable Large Multimodal Models for Mobile Devices
Zhenwei Shao
Zhou Yu
Jun Yu
Xuecheng Ouyang
Lihao Zheng
Zhenbiao Gai
Mingyang Wang
Jiajun Ding
361
24
0
20 May 2024
Efficient Multimodal Large Language Models: A Survey
Yizhang Jin
Jian Li
Yexin Liu
Tianjun Gu
Kai Wu
...
Xin Tan
Zhenye Gan
Yabiao Wang
Chengjie Wang
Lizhuang Ma
LRM
352
97
0
17 May 2024
Resource Allocation in Large Language Model Integrated 6G Vehicular Networks
Chang Liu
Jun Zhao
211
19
0
27 Mar 2024
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Yuzhang Shang
Mu Cai
Bingxin Xu
Yong Jae Lee
Yan Yan
VLM
671
275
0
22 Mar 2024
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
European Conference on Computer Vision (ECCV), 2024
Ruyi Xu
Yuan Yao
Zonghao Guo
Junbo Cui
Zanlin Ni
Chunjiang Ge
Tat-Seng Chua
Zhiyuan Liu
Maosong Sun
Gao Huang
VLM
MLLM
487
190
0
18 Mar 2024
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Guohao Sun
Can Qin
Jiamian Wang
Zeyuan Chen
Ran Xu
Zhiqiang Tao
MLLM
VLM
LRM
340
25
0
17 Mar 2024
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Baichuan Zhou
Ying Hu
Xi Weng
Junlong Jia
Jie Luo
Xien Liu
Ji Wu
Lei Huang
MLLM
213
174
0
22 Feb 2024
LMaaS: Exploring Pricing Strategy of Large Model as a Service for Communication
Panlong Wu
Qi Liu
Yanjie Dong
Fangxin Wang
319
7
0
05 Jan 2024
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Xiangxiang Chu
Limeng Qiao
Xinyang Lin
Shuang Xu
Yang Yang
...
Fei Wei
Xinyu Zhang
Bo Zhang
Xiaolin Wei
Chunhua Shen
MLLM
346
81
0
28 Dec 2023
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Zhengqing Yuan
Zhaoxu Li
Weiran Huang
Yanfang Ye
Lichao Sun
371
79
0
28 Dec 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
395
473
0
21 Nov 2023
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
An Yan
Zhengyuan Yang
Wanrong Zhu
Kevin Qinghong Lin
Linjie Li
...
Yiwu Zhong
Julian McAuley
Jianfeng Gao
Zicheng Liu
Lijuan Wang
LLMAG
LM&Ro
446
150
0
13 Nov 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
347
160
0
22 Oct 2023
Mistral 7B
Albert Q. Jiang
Alexandre Sablayrolles
A. Mensch
Chris Bamford
Devendra Singh Chaplot
...
Teven Le Scao
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoE
LRM
504
3,229
0
10 Oct 2023
Improved Baselines with Visual Instruction Tuning
Computer Vision and Pattern Recognition (CVPR), 2023
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
719
4,630
0
05 Oct 2023
PB-LLM: Partially Binarized Large Language Models
International Conference on Learning Representations (ICLR), 2023
Yuzhang Shang
Zhihang Yuan
Qiang Wu
Zhen Dong
MQ
418
86
0
29 Sep 2023
Drive as You Speak: Enabling Human-Like Interaction with Large Language Models in Autonomous Vehicles
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Ziran Wang
269
162
0
19 Sep 2023
Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Foundations and Trends in Computer Graphics and Vision (FTCGV), 2023
Chunyuan Li
Zhe Gan
Zhengyuan Yang
Jianwei Yang
Linjie Li
Lijuan Wang
Jianfeng Gao
MLLM
460
358
0
18 Sep 2023
A Survey on Model Compression for Large Language Models
Transactions of the Association for Computational Linguistics (TACL), 2023
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
451
396
0
15 Aug 2023
A Survey on Multimodal Large Language Models
National Science Review (NSR), 2023
Xinglong Mao
Chaoyou Fu
Zhengye Zhang
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
576
1,182
0
23 Jun 2023
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Neural Information Processing Systems (NeurIPS), 2023
Chunyuan Li
Cliff Wong
Sheng Zhang
Naoto Usuyama
Haotian Liu
Jianwei Yang
Tristan Naumann
Hoifung Poon
Jianfeng Gao
LM&MA
MedIm
409
1,495
0
01 Jun 2023
On the Hidden Mystery of OCR in Large Multimodal Models
Science China Information Sciences (Sci China Inf Sci), 2023
Yuliang Liu
Zhang Li
Mingxin Huang
Chunyuan Li
Dezhi Peng
Mingyu Liu
Lianwen Jin
Xiang Bai
VLM
MLLM
485
117
0
13 May 2023
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
International Conference on Learning Representations (ICLR), 2023
Deyao Zhu
Jun Chen
Xiaoqian Shen
Xiang Li
Mohamed Elhoseiny
VLM
MLLM
568
2,947
0
20 Apr 2023
Visual Instruction Tuning
Neural Information Processing Systems (NeurIPS), 2023
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
1.3K
8,387
0
17 Apr 2023
Sigmoid Loss for Language Image Pre-Training
IEEE International Conference on Computer Vision (ICCV), 2023
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
2.3K
2,650
0
27 Mar 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
5.2K
22,870
0
15 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
19.7K
18,945
0
27 Feb 2023
Joint Task and Data Oriented Semantic Communications: A Deep Separate Source-channel Coding Scheme
IEEE Internet of Things Journal (IEEE IoT J.), 2023
Jianhao Huang
Dongxu Li
Chenyu Huang
Xiaoqi Qin
Wei Zhang
301
43
0
27 Feb 2023
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Tim Dettmers
M. Lewis
Younes Belkada
Luke Zettlemoyer
MQ
595
907
0
15 Aug 2022
Exploring Attention-Aware Network Resource Allocation for Customized Metaverse Services
IEEE Network (IEEE Netw.), 2022
H. Du
Jiacheng Wang
Dusit Niyato
Jiawen Kang
Zehui Xiong
Xuemin
X. Shen
Dong In Kim
EgoV
286
55
0
31 Jul 2022
A ConvNet for the 2020s
Computer Vision and Pattern Recognition (CVPR), 2022
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
ViT
797
7,672
0
10 Jan 2022
Learning Transferable Visual Models From Natural Language Supervision
International Conference on Machine Learning (ICML), 2021
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
2.1K
45,649
0
26 Feb 2021
Utilising Visual Attention Cues for Vehicle Detection and Tracking
International Conference on Pattern Recognition (ICPR), 2020
Feiyan Hu
M. VenkateshG
Noel E. O'Connor
Alan F. Smeaton
Suzanne Little
168
8
0
31 Jul 2020
Personalized Saliency and its Prediction
Yanyu Xu
Shenghua Gao
Junru Wu
Nianyi Li
Jingyi Yu
379
54
0
09 Oct 2017
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
Junting Pan
Cristian Canton Ferrer
Kevin McGuinness
Noel E. O'Connor
Jordi Torres
E. Sayrol
Xavier Giró-i-Nieto
GAN
380
425
0
04 Jan 2017
DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations
S. Kruthiventi
Kumar Ayush
R. Venkatesh Babu
253
504
0
10 Oct 2015
1
Page 1 of 1