PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K
Text-to-Image GenerationEuropean Conference on Computer Vision (ECCV), 2024 |
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with
Competitive Query Selection and Adaptive Feature Fusion Junjie Guo Chenqiang Gao Fangcen Liu Deyu Meng Xinbo Gao |
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D
Object DetectionEuropean Conference on Computer Vision (ECCV), 2024 |
SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignComputer Vision and Pattern Recognition (CVPR), 2024 |