Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.06051
Cited By
Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation
12 April 2023
Yifeng Shi
Feng Lv
Xinliang Wang
Chunlong Xia
Shaojie Li
Shu-Zhen Yang
Teng Xi
Gang Zhang
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation"
10 / 10 papers shown
Title
EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection
Pengyu Li
Chenhe Liu
Tengfei Li
Xinyu Wang
Shihui Zhang
Dongyang Yu
26
1
0
26 Aug 2024
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
Zhen Qin
Daoyuan Chen
Wenhao Zhang
Liuyi Yao
Yilun Huang
Bolin Ding
Yaliang Li
Shuiguang Deng
48
5
0
11 Jul 2024
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene
Ruiyang Zhang
Hu Zhang
Hang Yu
Zhedong Zheng
3DPC
33
1
0
11 Jul 2024
UniRGB-IR: A Unified Framework for Visible-Infrared Semantic Tasks via Adapter Tuning
Maoxun Yuan
Bo Cui
Tianyi Zhao
Xingxing Wei
Shan Fu
Xue Yang
Xingxing Wei
35
0
0
26 Apr 2024
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions
Chunlong Xia
Xinliang Wang
Feng Lv
Xin Hao
Yifeng Shi
ViT
26
45
0
12 Mar 2024
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models
Weijiao Zhang
Jindong Han
Zhao Xu
Hang Ni
Hao Liu
Hui Xiong
Hui Xiong
AI4CE
77
15
0
30 Jan 2024
C
2
\mathbf{C}^2
C
2
Former: Calibrated and Complementary Transformer for RGB-Infrared Object Detection
Maoxun Yuan
Xingxing Wei
ViT
23
38
0
28 Jun 2023
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
39
38
0
10 May 2023
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification
Ardhendu Behera
Zachary Wharton
Pradeep Ruwan Padmasiri Galbokka Hewage
Asish Bera
59
108
0
17 Jan 2021
1