ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.10125
  4. Cited By
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model
  and Benchmark Dataset

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

14 July 2024
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
ArXivPDFHTML

Papers citing "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"

10 / 10 papers shown
Title
UniHCP: A Unified Model for Human-Centric Perceptions
UniHCP: A Unified Model for Human-Centric Perceptions
Yuanzheng Ci
Yizhou Wang
Meilin Chen
Shixiang Tang
Lei Bai
Feng Zhu
Rui Zhao
F. Yu
Donglian Qi
Wanli Ouyang
74
50
0
06 Mar 2023
Dual Vision Transformer
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
131
75
0
11 Jul 2022
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object
  Detection
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Tim Broedermann
Christos Sakaridis
Dengxin Dai
Luc Van Gool
48
28
0
30 Jun 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Whole-Body Human Pose Estimation in the Wild
Whole-Body Human Pose Estimation in the Wild
Sheng Jin
Lumin Xu
Jin Xu
Can Wang
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
3DH
130
235
0
23 Jul 2020
Detection in Crowded Scenes: One Proposal, Multiple Predictions
Detection in Crowded Scenes: One Proposal, Multiple Predictions
Xuangeng Chu
Anlin Zheng
X. Zhang
Jian-jun Sun
ObjD
77
170
0
20 Mar 2020
CrowdHuman: A Benchmark for Detecting Human in a Crowd
CrowdHuman: A Benchmark for Detecting Human in a Crowd
Shuai Shao
Zijian Zhao
Boxun Li
Tete Xiao
Gang Yu
Xiangyu Zhang
Jian-jun Sun
205
670
0
30 Apr 2018
Feature Pyramid Networks for Object Detection
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
154
3,574
0
09 Dec 2016
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
269
35,677
0
08 Jun 2015
1