Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.00273
Cited By
Cross-Modality Fusion Transformer for Multispectral Object Detection
30 October 2021
Q. Fang
D. Han
Zhaokui Wang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-Modality Fusion Transformer for Multispectral Object Detection"
33 / 33 papers shown
Title
Transformer-Based Dual-Optical Attention Fusion Crowd Head Point Counting and Localization Network
Fei Zhou
Yi Li
Mingqing Zhu
21
0
0
11 May 2025
Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection
Qishun Wang
Zhengzheng Tu
Chenglong Li
Bo Jiang
VOS
44
0
0
16 Apr 2025
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
Jiaqing Zhang
Mingxiang Cao
Weiying Xie
Jie Lei
Daixun Li
Wenbo Huang
Yunsong Li
Xue Yang
48
4
0
28 Jan 2025
Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection
Ayush Ghadiya
P. Kar
Vishal M. Chudasama
Pankaj Wasnik
41
1
0
31 Dec 2024
IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks
Yaming Zhang
Chenqiang Gao
Fangcen Liu
Junjie Guo
Lan Wang
Xinggan Peng
Deyu Meng
87
0
0
21 Dec 2024
Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Chen Zhou
Peng Cheng
Junfeng Fang
Y. Zhang
Yibo Yan
X. Jia
Yanyan Xu
K. Wang
Xiaochun Cao
71
0
0
27 Nov 2024
SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection
Shuhan Dong
Yunsong Li
Weiying Xie
Jiaqing Zhang
Jiayuan Tian
Danian Yang
Jie Lei
26
0
0
15 Oct 2024
IVGF: The Fusion-Guided Infrared and Visible General Framework
Fangcen Liu
Chenqiang Gao
Fang Chen
Pengcheng Li
Junjie Guo
Deyu Meng
29
0
0
02 Sep 2024
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection
Junjie Guo
Chenqiang Gao
Fangcen Liu
Deyu Meng
ViT
32
1
0
12 Aug 2024
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection
Xiangbo Gao
A. Kanu-Asiegbu
Xiaoxiao Du
Mamba
26
0
0
02 Aug 2024
FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network
Weiying Xie
Yusi Zhang
Tianlin Hui
Jiaqing Zhang
Jie Lei
Yunsong Li
27
1
0
23 Jul 2024
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Zeyu Wang
Jingyu Lin
Yifei Qian
Yi Huang
Shicen Tian
...
Qu Yang
Lan Du
Cunjian Chen
Yufei Guo
Kejie Huang
DiffM
VLM
28
2
0
22 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
32
4
0
14 Jul 2024
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
H. R. Medeiros
David Latortue
Fidel Alejandro Guerrero Peña
Eric Granger
M. Pedersoli
19
0
0
29 Apr 2024
Fusion-Mamba for Cross-modality Object Detection
Wenhao Dong
Haodong Zhu
Shaohui Lin
Xiaoyan Luo
Yunhang Shen
Xuhui Liu
Juan Zhang
Guodong Guo
Baochang Zhang
Mamba
35
26
0
14 Apr 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
38
2
0
22 Mar 2024
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Taeheon Kim
Sebin Shin
Youngjoon Yu
Hak Gu Kim
Y. Ro
33
5
0
02 Mar 2024
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion
Junjie Guo
Chenqiang Gao
Fangcen Liu
Deyu Meng
Xinbo Gao
30
8
0
01 Mar 2024
Pedestrian Detection in Low-Light Conditions: A Comprehensive Survey
Bahareh Ghari
Ali Tourani
A. Shahbahrami
Georgi Gaydadjiev
24
15
0
15 Jan 2024
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation
Aniruddh Sikdar
Jayant Teotia
Suresh Sundaram
24
2
0
04 Dec 2023
Fuse It or Lose It: Deep Fusion for Multimodal Simulation-Based Inference
Marvin Schmitt
Stefan T. Radev
Paul-Christian Burkner
40
5
0
17 Nov 2023
RGB-X Object Detection via Scene-Specific Fusion Modules
Sri Aditya Deevi
Connor T. Lee
Lu Gan
Sushruth Nagesh
Gaurav Pandey
Soon-Jo Chung
3DPC
15
11
0
30 Oct 2023
Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images
Bissmella Bahaduri
Zuheng Ming
Fangchen Feng
Anissa Mokraou
16
1
0
21 Oct 2023
ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object Detection
Jifeng Shen
Yifei Chen
Yue Liu
Xin Zuo
Heng Fan
Wankou Yang
ViT
13
87
0
15 Aug 2023
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
Sixun Dong
Huazhang Hu
Dongze Lian
Weixin Luo
Yichen Qian
Shenghua Gao
ViT
AI4TS
21
11
0
22 Mar 2023
MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields
Jiaying Lu
Yongchen Qian
Shifan Zhao
Yuanzhe Xi
Carl Yang
VLM
19
3
0
06 Feb 2023
MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization
Yinghui Xing
Song Wang
Shizhou Zhang
Guoqiang Liang
Xiuwei Zhang
Yanning Zhang
ViT
20
7
0
01 Feb 2023
Guided Hybrid Quantization for Object detection in Multimodal Remote Sensing Imagery via One-to-one Self-teaching
Jiaqing Zhang
Jie Lei
Weiying Xie
Yunsong Li
X. Jia
MQ
19
18
0
31 Dec 2022
Unsupervised RGB-to-Thermal Domain Adaptation via Multi-Domain Attention Network
L. Gan
Connor T. Lee
Soon-Jo Chung
23
15
0
09 Oct 2022
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
41
522
0
13 Jun 2022
Emergent Visual Sensors for Autonomous Vehicles
You Li
Julien Moreau
J. Ibañez-Guzmán
40
27
0
19 May 2022
Cross-Modality High-Frequency Transformer for MR Image Super-Resolution
Chaowei Fang
Di Zhang
Liang Wang
Yulun Zhang
Lechao Cheng
Junwei Han
ViT
MedIm
13
46
0
29 Mar 2022
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
410
594
0
21 Jul 2020
1