ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.08430
  4. Cited By
YOLOX: Exceeding YOLO Series in 2021
v1v2 (latest)

YOLOX: Exceeding YOLO Series in 2021

18 July 2021
Zheng Ge
Songtao Liu
Feng Wang
Zeming Li
Jian Sun
    ObjD
ArXiv (abs)PDFHTMLGithub (9857★)

Papers citing "YOLOX: Exceeding YOLO Series in 2021"

50 / 869 papers shown
Concept-based Explainable Data Mining with VLM for 3D Detection
Concept-based Explainable Data Mining with VLM for 3D Detection
Mai Tsujimoto
3DPC
222
0
0
05 Dec 2025
From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking
From Detection to Association: Learning Discriminative Object Embeddings for Multi-Object Tracking
Yuqing Shao
Yuchen Yang
Rui Yu
Weilong Li
Xu Guo
HuaiCheng Yan
Wei Wang
Xiao Sun
VOT
360
0
0
02 Dec 2025
SelfMOTR: Revisiting MOTR with Self-Generating Detection Priors
SelfMOTR: Revisiting MOTR with Self-Generating Detection Priors
Fabian Gülhan
Emil Mededovic
Yuli Wu
Johannes Stegmaier
VOTMQ
447
0
0
25 Nov 2025
StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections
StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections
Matvei Shelukhan
Timur Mamedov
Karina Kvanchiani
VOT
462
0
0
25 Nov 2025
A Tri-Modal Dataset and a Baseline System for Tracking Unmanned Aerial Vehicles
A Tri-Modal Dataset and a Baseline System for Tracking Unmanned Aerial Vehicles
Tianyang Xu
Jinjie Gu
Xuefeng Zhu
Xiaojun Wu
J. Kittler
136
1
0
23 Nov 2025
OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding
OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding
Teng Fu
Mengyang Zhao
Ke Niu
Kaixin Peng
Bin Li
110
0
0
21 Nov 2025
MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots
MobileOcc: A Human-Aware Semantic Occupancy Dataset for Mobile Robots
Junseo Kim
Guido Dumont
Xinyu Gao
Gang Chen
Holger Caesar
Javier Alonso-Mora
170
0
0
21 Nov 2025
Real-Time 3D Object Detection with Inference-Aligned Learning
Real-Time 3D Object Detection with Inference-Aligned Learning
Chenyu Zhao
Xianwei Zheng
Zimin Xia
Linwei Yue
Nan Xue
3DPC
266
0
0
20 Nov 2025
PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation
PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person GenerationInformation Fusion (Inf. Fusion), 2025
Ting Pan
Ye Wang
Peiguang Jing
Rui Ma
Zili Yi
Y. Liu
321
0
0
20 Nov 2025
Enhancing Multi-Camera Gymnast Tracking Through Domain Knowledge Integration
Enhancing Multi-Camera Gymnast Tracking Through Domain Knowledge Integration
Fan Yang
S. Odashima
S. Masui
Ikuo Kusajima
Sosuke Yamao
Shan Jiang
137
7
0
20 Nov 2025
Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection
Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection
Spyridon Loukovitis
Vasileios Karampinis
Athanasios Voulodimos
115
0
0
19 Nov 2025
PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking
PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking
Seungjae Kim
SeungJoon Lee
MyeongAh Cho
149
0
0
17 Nov 2025
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Jinfu Li
Yuqi Huang
Hong Song
Ting Wang
Jianghan Xia
Yucong Lin
Jingfan Fan
Jian Yang
ObjD
264
0
0
13 Nov 2025
On the Interplay between Positional Encodings, Morphological Complexity, and Word Order Flexibility
On the Interplay between Positional Encodings, Morphological Complexity, and Word Order Flexibility
Kushal Tatariya
Wessel Poelman
Miryam de Lhoneux
137
0
0
11 Nov 2025
Zero-Shot Multi-Animal Tracking in the Wild
Zero-Shot Multi-Animal Tracking in the Wild
Jan Frederik Meier
Timo Lüddecke
VLM
158
0
0
04 Nov 2025
Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
SiWoo Kim
JhongHyun An
206
0
0
03 Nov 2025
World Simulation with Video Foundation Models for Physical AI
World Simulation with Video Foundation Models for Physical AI
Nvidia
A. M. Ali
Junjie Bai
Maciej Bala
Yogesh Balaji
...
Jing Zhang
Qinsheng Zhang
Kaiwen Zheng
Andrew Zhu
Yuke Zhu
VGenPINN
633
53
0
28 Oct 2025
DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios
DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios
Ziyu Wang
Wenhao Li
Ji Wu
147
0
0
27 Oct 2025
Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists
Monocular Visual 8D Pose Estimation for Articulated Bicycles and Cyclists
Eduardo R. Corral-Soto
Yang Liu
Y. Ren
Bai Dongfeng
Liu Bingbing
158
1
0
23 Oct 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&RoAI4CE
204
0
0
23 Oct 2025
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
Haowei Zhu
Tianxiang Pan
Rui Qin
Jun-Hai Yong
Bin Wang
DiffM
233
1
0
17 Oct 2025
Valeo Near-Field: a novel dataset for pedestrian intent detection
Valeo Near-Field: a novel dataset for pedestrian intent detection
Antonyo Musabini
Rachid Benmokhtar
Jagdish Bhanushali
Victor Galizzi
Bertrand Luvison
Xavier Perrotton
150
0
0
17 Oct 2025
DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association
DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association
Zenghuang Fu
Xiaofeng Han
Mingda Jia
Jin ming Yang
Qi Zeng
Muyang Zahng
Changwei Wang
Weiliang Meng
Xiaopeng Zhang
152
0
0
15 Oct 2025
An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities
An Analytical Framework to Enhance Autonomous Vehicle Perception for Smart Cities
Jalal Khan
Manzoor Khan
Sherzod Turaev
Sumbal Malik
Hesham El-Sayed
Farman Ullah
116
1
0
15 Oct 2025
DEF-YOLO: Leveraging YOLO for Concealed Weapon Detection in Thermal Imagin
DEF-YOLO: Leveraging YOLO for Concealed Weapon Detection in Thermal Imagin
Divya Bhardwaj
Arnav Ramamoorthy
Poonam Goyal
158
1
0
15 Oct 2025
SpikePool: Event-driven Spiking Transformer with Pooling Attention
SpikePool: Event-driven Spiking Transformer with Pooling Attention
Donghyun Lee
Alex Sima
Yuhang Li
Panos Stinis
Priyadarshini Panda
126
0
0
14 Oct 2025
MultiFoodhat: A potential new paradigm for intelligent food quality inspection
MultiFoodhat: A potential new paradigm for intelligent food quality inspection
Yue Hu
Guohang Zhuang
170
0
0
14 Oct 2025
Adap-RPF: Adaptive Trajectory Sampling for Robot Person Following in Dynamic Crowded Environments
Adap-RPF: Adaptive Trajectory Sampling for Robot Person Following in Dynamic Crowded Environments
Weixi Situ
Hanjing Ye
Jianwei Peng
Yu Zhan
Hong Zhang
142
0
0
13 Oct 2025
Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking
Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking
Milad Khanchi
Maria Amer
Charalambos Poullis
VOT
294
0
0
10 Oct 2025
PRNet: Original Information Is All You Have
PRNet: Original Information Is All You Have
PeiHuang Zheng
Yunlong Zhao
Zheng Cui
Yang Li
110
1
0
10 Oct 2025
SPICE: Simple and Practical Image Clarification and Enhancement
SPICE: Simple and Practical Image Clarification and Enhancement
Alexander Belyaev
Pierre-Alain Fayolle
Michael Cohen
137
0
0
09 Oct 2025
Explaining raw data complexity to improve satellite onboard processing
Explaining raw data complexity to improve satellite onboard processing
Adrien Dorise
Marjorie Bellizzi
Adrien Girard
Benjamin Francesconi
Stéphane May
173
0
0
08 Oct 2025
StereoSync: Spatially-Aware Stereo Audio Generation from Video
StereoSync: Spatially-Aware Stereo Audio Generation from Video
Christian Marinoni
R. F. Gramaccioni
Kazuki Shimada
Takashi Shibuya
Yuki Mitsufuji
Danilo Comminiello
DiffMVGen
135
2
0
07 Oct 2025
Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Aoduo Li
Peikai Lin
Jiancheng Li
Zhen Zhang
Shiting Wu
Zexiao Liang
Zhifa Jiang
186
0
0
01 Oct 2025
FracDetNet: Advanced Fracture Detection via Dual-Focus Attention and Multi-scale Calibration in Medical X-ray Imaging
FracDetNet: Advanced Fracture Detection via Dual-Focus Attention and Multi-scale Calibration in Medical X-ray Imaging
Yuyang Sun
Cuiming Zou
OODMedIm
112
0
0
27 Sep 2025
Real-Time Object Detection Meets DINOv3
Real-Time Object Detection Meets DINOv3
Shihua Huang
Yongjie Hou
Longfei Liu
Xuanlong Yu
Xi Shen
ObjD3DHPINNVLM
542
17
0
25 Sep 2025
CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks
CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks
Hyomin Choi
Heeji Han
Chris Rosewarne
Fabien Racapé
305
2
0
25 Sep 2025
X-Streamer: Unified Human World Modeling with Audiovisual Interaction
X-Streamer: Unified Human World Modeling with Audiovisual Interaction
You Xie
Tianpei Gu
Zenan Li
Chenxu Zhang
Guoxian Song
Xiaochen Zhao
C. Liang
Jianwen Jiang
Hongyi Xu
Linjie Luo
VGen
318
8
0
25 Sep 2025
Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer
Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer
Abdur Rehman
S. Sharif
Md Abdur Rahaman
M. J. Aashik Rasool
Seongwan Kim
J. Lee
MQ
165
2
0
25 Sep 2025
Visual Detector Compression via Location-Aware Discriminant Analysis
Visual Detector Compression via Location-Aware Discriminant Analysis
Qizhen Lan
Jung Im Choi
Qing Tian
138
2
0
22 Sep 2025
SFN-YOLO: Towards Free-Range Poultry Detection via Scale-aware Fusion Networks
SFN-YOLO: Towards Free-Range Poultry Detection via Scale-aware Fusion Networks
Jie Chen
Yuhong Feng
Tao Dai
Mingzhe Liu
Hongtao Chen
Zhaoxi He
Jiancong Bai
81
0
0
21 Sep 2025
Task-Aware Image Signal Processor for Advanced Visual Perception
Task-Aware Image Signal Processor for Advanced Visual Perception
Kai Chen
Jin Xiao
Leheng Zhang
Kexuan Shi
Shuhang Gu
168
0
0
17 Sep 2025
VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement
VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement
Jun Du
Weiwei Xing
Ming Li
Fei Richard Yu
DiffM
196
0
0
17 Sep 2025
Multi-animal tracking in Transition: Comparative Insights into Established and Emerging Methods
Multi-animal tracking in Transition: Comparative Insights into Established and Emerging MethodsSmart Agricultural Technology (SAT), 2025
Anne Marthe Sophie Ngo Bibinbe
Patrick Gagnon
Jamie Ahloy-Dallaire
Eric R. Paquet
VOT
262
0
0
15 Sep 2025
Motion Estimation for Multi-Object Tracking using KalmanNet with Semantic-Independent Encoding
Motion Estimation for Multi-Object Tracking using KalmanNet with Semantic-Independent Encoding
Jian Song
Wei Mei
Yunfeng Xu
Qiang Fu
Renke Kou
Lina Bu
Yucheng Long
227
0
0
14 Sep 2025
An HMM-based framework for identity-aware long-term multi-object tracking from sparse and uncertain identification: use case on long-term tracking in livestock
An HMM-based framework for identity-aware long-term multi-object tracking from sparse and uncertain identification: use case on long-term tracking in livestock
Anne Marthe Sophie Ngo Bibinbe
Chiron Bang
Patrick Gagnon
Jamie Ahloy-Dallaire
Eric R. Paquet
VOT
269
0
0
12 Sep 2025
Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation
Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation
Yuiko Uchida
Ren Togo
Keisuke Maeda
Takahiro Ogawa
Miki Haseyama
253
0
0
11 Sep 2025
Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception
Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception
Spyridon Loukovitis
Anastasios Arsenos
Vasileios Karampinis
Athanasios Voulodimos
132
2
0
11 Sep 2025
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
Jenna Kang
Maria Silva
Patsorn Sangkloy
Kenneth Chen
Niall Williams
Qi Sun
EGVMVGen
195
1
0
10 Sep 2025
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
Feng Shen
Jiaming Cui
Shuai Zhou
Wenqiang Li
Ruifeng Qin
295
0
0
07 Sep 2025
1234...161718
Next
Page 1 of 18
Pageof 18