Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.10125
Cited By
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
14 July 2024
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"
10 / 10 papers shown
Title
UniHCP: A Unified Model for Human-Centric Perceptions
Yuanzheng Ci
Yizhou Wang
Meilin Chen
Shixiang Tang
Lei Bai
Feng Zhu
Rui Zhao
F. Yu
Donglian Qi
Wanli Ouyang
77
50
0
06 Mar 2023
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
131
75
0
11 Jul 2022
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Tim Broedermann
Christos Sakaridis
Dengxin Dai
Luc Van Gool
48
28
0
30 Jun 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Whole-Body Human Pose Estimation in the Wild
Sheng Jin
Lumin Xu
Jin Xu
Can Wang
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
3DH
130
235
0
23 Jul 2020
Detection in Crowded Scenes: One Proposal, Multiple Predictions
Xuangeng Chu
Anlin Zheng
X. Zhang
Jian-jun Sun
ObjD
77
170
0
20 Mar 2020
CrowdHuman: A Benchmark for Detecting Human in a Crowd
Shuai Shao
Zijian Zhao
Boxun Li
Tete Xiao
Gang Yu
Xiangyu Zhang
Jian-jun Sun
205
670
0
30 Apr 2018
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
163
21,643
0
09 Dec 2016
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
281
35,677
0
08 Jun 2015
1