ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06870
  4. Cited By
Mask R-CNN

Mask R-CNN

20 March 2017
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
    ObjD
ArXivPDFHTML

Papers citing "Mask R-CNN"

50 / 239 papers shown
Title
On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data
On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data
Aitor Martinez-Seras
Javier Del Ser
Alain Andres
Pablo García Bringas
Pablo Garcia-Bringas
OODD
63
0
0
07 Nov 2024
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
Jie Liu
Pan Zhou
Yingjun Du
Ah-Hwee Tan
Cees G. M. Snoek
Jan-Jakob Sonke
E. Gavves
LLMAG
55
2
0
07 Nov 2024
Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion
Spatial-Mamba: Effective Visual State Space Models via Structure-aware State Fusion
Chaodong Xiao
Minghan Li
Zhengqiang Zhang
Deyu Meng
Lei Zhang
Mamba
106
5
0
19 Oct 2024
Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model
Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model
Li Yuan
Yi Cai
Junsheng Huang
VLM
48
2
0
18 Oct 2024
Fractal Calibration for long-tailed object detection
Fractal Calibration for long-tailed object detection
Konstantinos Panagiotis Alexandridis
Ismail Elezi
Jiankang Deng
Anh H. Nguyen
Shan Luo
330
0
0
15 Oct 2024
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent
  Approach
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
Rory Young
Nicolas Pugeault
AAML
78
3
0
14 Oct 2024
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection
Mingyi Guo
Yuyang Liu
Zongying Lin
Peixi Peng
Yonghong Tian
Yonghong Tian
VLM
51
0
0
08 Oct 2024
Designing Concise ConvNets with Columnar Stages
Designing Concise ConvNets with Columnar Stages
Ashish Kumar
Jaesik Park
MQ
71
0
0
05 Oct 2024
DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images
DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images
Chen Liu
Danqi Liao
Alejandro Parada-Mayorga
Alejandro Ribeiro
Marcello DiStasio
Smita Krishnaswamy
62
4
0
04 Oct 2024
OmniSR: Shadow Removal under Direct and Indirect Lighting
OmniSR: Shadow Removal under Direct and Indirect Lighting
Jiamin Xu
Zelong Li
Yuxin Zheng
Chenyu Huang
Renshu Gu
Weiwei Xu
Gang Xu
3DV
96
1
0
02 Oct 2024
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model
Fulong Ma
Weiqing Qi
Guoyang Zhao
Ming Liu
Jun Ma
DiffM
50
0
0
30 Sep 2024
When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation
When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation
Yuli Zhou
Guolei Sun
Yawei Li
Guo-Sen Xie
Luca Benini
Ender Konukoglu
38
5
0
27 Sep 2024
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
Phillip Mueller
Sebastian Mueller
Lars Mikelsons
37
2
0
25 Sep 2024
MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule
MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule
Guohui Cai
Ying Cai
Zeyu Zhang
Daji Ergu
Yuanzhouhan Cao
...
Zhibin Liao
Binbin Hu
Zhinbin Liao
Yang Zhao
Ying Cai
51
9
0
21 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
329
0
0
19 Sep 2024
Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments
Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments
Gang Chen
Zhaoying Wang
Wei Dong
Javier Alonso-Mora
181
0
0
18 Sep 2024
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild
Rolandos Alexandros Potamias
Jinglei Zhang
Jiankang Deng
Stefanos Zafeiriou
3DH
57
12
0
18 Sep 2024
Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection
Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection
Xiang Fang
Arvind Easwaran
B. Genest
47
4
0
16 Sep 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
81
4
0
16 Sep 2024
Sparse R-CNN OBB: Ship Target Detection in SAR Images Based on Oriented Sparse Proposals
Sparse R-CNN OBB: Ship Target Detection in SAR Images Based on Oriented Sparse Proposals
Kamirul Kamirul
Odysseas A. Pappas
A. Achim
41
0
0
12 Sep 2024
ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics
ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics
Xiaomin Lin
Vivek Mange
Arjun Suresh
Bernhard Neuberger
Aadi Palnitkar
...
Alhim Vera
Markus Vincze
Ioannis Rekleitis
Herbert G. Tanner
Yiannis Aloimonos
60
2
0
11 Sep 2024
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
Amin Karimi Monsefi
Kishore Prakash Sailaja
Ali Alilooee
Ser-Nam Lim
R. Ramnath
VLM
46
6
0
10 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
63
1
0
04 Sep 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
77
2
0
27 Aug 2024
FORGE: Force-Guided Exploration for Robust Contact-Rich Manipulation under Uncertainty
FORGE: Force-Guided Exploration for Robust Contact-Rich Manipulation under Uncertainty
Michael Noseworthy
Bingjie Tang
Bowen Wen
Ankur Handa
Nicholas Roy
Nicholas Roy
Dieter Fox
Yashraj S. Narang
Iretiayo Akinola
Iretiayo Akinola
66
9
0
08 Aug 2024
Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation
Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation
Sai Prasanna
Daniel Honerkamp
Kshitij Sirohi
Tim Welschehold
Wolfram Burgard
Abhinav Valada
66
1
0
05 Aug 2024
Compositional Physical Reasoning of Objects and Events from Videos
Compositional Physical Reasoning of Objects and Events from Videos
Zhenfang Chen
Shilong Dong
Kexin Yi
Yunzhu Li
Mingyu Ding
Antonio Torralba
Joshua B. Tenenbaum
Chuang Gan
OCL
76
1
0
02 Aug 2024
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
71
1
0
29 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
112
1
0
23 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
136
1
0
17 Jul 2024
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
Jan Kautz
Mamba
70
66
0
10 Jul 2024
CountGD: Multi-Modal Open-World Counting
CountGD: Multi-Modal Open-World Counting
Niki Amini-Naieni
Tengda Han
Andrew Zisserman
ObjD
100
11
0
05 Jul 2024
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
Minghao Zhou
Hong Wang
Yefeng Zheng
Deyu Meng
96
1
0
02 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
78
2
0
01 Jul 2024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li
Cristina Mata
J. Park
Kumara Kahatapitiya
Yoo Sung Jang
...
Kanchana Ranasinghe
R. Burgert
Mu Cai
Yong Jae Lee
Michael S. Ryoo
LM&Ro
86
26
0
28 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
74
3
0
28 Jun 2024
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
Oluwatosin O. Alabi
K. Toe
Zijian Zhou
Charlie Budd
Nicholas Raison
Miaojing Shi
Tom Vercauteren
ISeg
76
1
0
23 Jun 2024
TraceNet: Segment one thing efficiently
TraceNet: Segment one thing efficiently
Mingyuan Wu
Zichuan Liu
Haozhen Zheng
Hongpeng Guo
Bo Chen
Xin Lu
Klara Nahrstedt
51
0
0
21 Jun 2024
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
Jordy Van Landeghem
Subhajit Maity
Ayan Banerjee
Matthew Blaschko
Marie-Francine Moens
Josep Lladós
Sanket Biswas
77
2
0
12 Jun 2024
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Hou-I Liu
Yu-Wen Tseng
Kai-Cheng Chang
Pin-Jyun Wang
Hong-Han Shuai
Wen-Huang Cheng
ViT
ObjD
86
25
0
09 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
97
14
0
09 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
112
1
0
06 Jun 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
76
4
0
28 May 2024
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Weize Li
Zhicheng Zhao
Haochen Bai
Fei Su
57
0
0
24 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
157
49
0
23 May 2024
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition
Kun Yuan
V. Srivastav
Nassir Navab
N. Padoy
63
12
0
16 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
91
0
0
15 May 2024
VIEW: Visual Imitation Learning with Waypoints
VIEW: Visual Imitation Learning with Waypoints
Ananth Jonnavittula
Sagar Parekh
Dylan P. Losey
SSL
98
10
0
27 Apr 2024
Closed Loop Interactive Embodied Reasoning for Robot Manipulation
Closed Loop Interactive Embodied Reasoning for Robot Manipulation
Michal Nazarczuk
Jan Kristof Behrens
Karla Stepanova
Matej Hoffmann
K. Mikolajczyk
LM&Ro
LRM
68
1
0
23 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
83
0
0
23 Apr 2024
Previous
12345
Next