ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.02643
  4. Cited By
Segment Anything

Segment Anything

5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
    MLLM
    VLM
ArXivPDFHTML

Papers citing "Segment Anything"

50 / 4,189 papers shown
Title
SciGraphQA: A Large-Scale Synthetic Multi-Turn Question-Answering
  Dataset for Scientific Graphs
SciGraphQA: A Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs
Sheng Li
Nima Tajbakhsh
MLLM
18
48
0
07 Aug 2023
Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with
  Whitted-Style Ray Tracing
Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing
Junyi Zeng
Chong Bao
Ruiguo Chen
Zilong Dong
Guofeng Zhang
Hujun Bao
Zhaopeng Cui
AI4CE
28
26
0
07 Aug 2023
Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene
  Understanding of Healthcare Facilities
Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities
Rohit Mohan
J. Arce
Sassan Mokhtar
Daniele Cattaneo
Abhinav Valada
34
1
0
06 Aug 2023
EventBind: Learning a Unified Representation to Bind Them All for
  Event-based Open-world Understanding
EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding
Jiazhou Zhou
Xueye Zheng
Yuanhuiyi Lyu
Lin Wang
VLM
24
16
0
06 Aug 2023
UGainS: Uncertainty Guided Anomaly Instance Segmentation
UGainS: Uncertainty Guided Anomaly Instance Segmentation
Alexey Nekrasov
Alexander Hermans
L. Kuhnert
Bastian Leibe
32
11
0
03 Aug 2023
The All-Seeing Project: Towards Panoptic Visual Recognition and
  Understanding of the Open World
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Weiyun Wang
Min Shi
Qingyun Li
Wen Wang
Zhenhang Huang
...
Zhiguo Cao
Yushi Chen
Tong Lu
Jifeng Dai
Yu Qiao
LRM
MLLM
48
84
0
03 Aug 2023
RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic
  and Regional Comprehension
RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension
Qiang-feng Zhou
Chaohui Yu
Shaofeng Zhang
Sitong Wu
Zhibin Wang
Fan Wang
34
27
0
03 Aug 2023
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
Wentong Li
Yu-Jie Yuan
Song Wang
Jianke Zhu
Jianshu Li
Jian Liu
Lei Zhang
3DPC
OT
32
19
0
03 Aug 2023
PerceptionCLIP: Visual Classification by Inferring and Conditioning on
  Contexts
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
Bang An
Sicheng Zhu
Michael-Andrei Panaitescu-Liess
Chaithanya Kumar Mummadi
Furong Huang
VLM
33
7
0
02 Aug 2023
DiffusePast: Diffusion-based Generative Replay for Class Incremental
  Semantic Segmentation
DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation
Jingfan Chen
Yuxi Wang
Peng Wang
Xiao Chen
Zhaoxiang Zhang
Zhen Lei
Qing Li
DiffM
31
5
0
02 Aug 2023
LISA: Reasoning Segmentation via Large Language Model
LISA: Reasoning Segmentation via Large Language Model
Xin Lai
Zhuotao Tian
Yukang Chen
Yanwei Li
Yuhui Yuan
Shu Liu
Jiaya Jia
LM&Ro
VLM
MLLM
LRM
31
399
0
01 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
46
41
0
01 Aug 2023
DiVa-360: The Dynamic Visual Dataset for Immersive Neural Fields
DiVa-360: The Dynamic Visual Dataset for Immersive Neural Fields
Chengkun Lu
Peisen Zhou
Angela Xing
Chandradeep Pokhariya
Arnab Dey
...
Rugved Mavidipalli
Dylan Hu
Andrew I. Comport
Kefan Chen
Srinath Sridhar
3DH
VGen
29
5
0
31 Jul 2023
SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment
  Anything Model
SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model
Shili Zhou
Ruian He
Weimin Tan
Bo Yan
VLM
27
12
0
31 Jul 2023
Transferable Attack for Semantic Segmentation
Transferable Attack for Semantic Segmentation
Mengqi He
Jing Zhang
Zhaoyuan Yang
Mingyi He
Nick Barnes
Yuchao Dai
38
2
0
31 Jul 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for
  Complex Visual Reasoning Tasks
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
25
2
0
31 Jul 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic
  Control
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
30
1,110
0
28 Jul 2023
YOLOv8 for Defect Inspection of Hexagonal Directed Self-Assembly
  Patterns: A Data-Centric Approach
YOLOv8 for Defect Inspection of Hexagonal Directed Self-Assembly Patterns: A Data-Centric Approach
Enrique Dehaerne
Bappaditya Dey
Hossein Esfandiar
L. Verstraete
H. Suh
S. Halder
S. de Gendt
27
13
0
28 Jul 2023
Simplified Concrete Dropout -- Improving the Generation of Attribution
  Masks for Fine-grained Classification
Simplified Concrete Dropout -- Improving the Generation of Attribution Masks for Fine-grained Classification
D. Korsch
M. Shadaydeh
Joachim Denzler
19
1
0
27 Jul 2023
Unified Adversarial Patch for Visible-Infrared Cross-modal Attacks in
  the Physical World
Unified Adversarial Patch for Visible-Infrared Cross-modal Attacks in the Physical World
Xingxing Wei
Yao Huang
Yitong Sun
Jie Yu
AAML
42
14
0
27 Jul 2023
Car-Studio: Learning Car Radiance Fields from Single-View and Endless
  In-the-wild Images
Car-Studio: Learning Car Radiance Fields from Single-View and Endless In-the-wild Images
Tianyu Liu
Hao Zhao
Yang Yu
Guyue Zhou
Ming Liu
39
3
0
26 Jul 2023
Tracking Anything in High Quality
Tracking Anything in High Quality
Jiawen Zhu
Zhe Chen
Zeqi Hao
Shijie Chang
Lu Zhang
...
Bin Luo
Ju He
Jinpeng Lan
Hanyuan Chen
Chenyang Li
VOS
21
7
0
26 Jul 2023
AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation
  Datasets
AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets
Siyi Du
Nourhan Bayasi
Ghassan Hamarneh
Rafeef Garbi
ViT
39
3
0
26 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
Fahad Shahbaz Khan
VLM
38
118
0
25 Jul 2023
Towards Unifying Anatomy Segmentation: Automated Generation of a
  Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines
Towards Unifying Anatomy Segmentation: Automated Generation of a Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines
A. Jaus
C. Seibold
Kelsey Hermann
Alexandra Walter
K. Giske
Johannes Haubold
Jens Kleesiek
Rainer Stiefelhagen
42
19
0
25 Jul 2023
GraspGPT: Leveraging Semantic Knowledge from a Large Language Model for
  Task-Oriented Grasping
GraspGPT: Leveraging Semantic Knowledge from a Large Language Model for Task-Oriented Grasping
Chao Tang
Dehao Huang
Wenqiang Ge
Weiyu Liu
Hong Zhang
24
68
0
25 Jul 2023
RoboChop: Autonomous Framework for Fruit and Vegetable Chopping
  Leveraging Foundational Models
RoboChop: Autonomous Framework for Fruit and Vegetable Chopping Leveraging Foundational Models
Atharva Dikshit
Alison Bartsch
Abraham George
A. Farimani
30
10
0
24 Jul 2023
Industrial Segment Anything -- a Case Study in Aircraft Manufacturing,
  Intralogistics, Maintenance, Repair, and Overhaul
Industrial Segment Anything -- a Case Study in Aircraft Manufacturing, Intralogistics, Maintenance, Repair, and Overhaul
Keno Moenck
Arne Wendt
Philipp Prünte
Julian Koch
Arne Sahrhage
...
Falko Kähler
Dirk Holst
Martin Gomse
Thorsten Schuppstuhl
Daniel Schoepflin
VLM
36
6
0
24 Jul 2023
Challenges for Monocular 6D Object Pose Estimation in Robotics
Challenges for Monocular 6D Object Pose Estimation in Robotics
S. Thalhammer
Dominik Bauer
Peter Honig
Jean-Baptiste Weibel
José García-Rodríguez
Markus Vincze
46
24
0
22 Jul 2023
GEM: Boost Simple Network for Glass Surface Segmentation via Vision
  Foundation Models
GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models
Jing Hao
Xinyu Li
Liang Gao
Shumin Han
VLM
DiffM
25
2
0
22 Jul 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation
  without Test-time Fine-tuning
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
31
138
0
21 Jul 2023
"Tidy Up the Table": Grounding Common-sense Objective for Tabletop
  Object Rearrangement
"Tidy Up the Table": Grounding Common-sense Objective for Tabletop Object Rearrangement
Yiqing Xu
David Hsu
LM&Ro
LMTD
34
0
0
21 Jul 2023
OBJECT 3DIT: Language-guided 3D-aware Image Editing
OBJECT 3DIT: Language-guided 3D-aware Image Editing
Oscar Michel
Anand Bhattad
Eli VanderBilt
Ranjay Krishna
Aniruddha Kembhavi
Tanmay Gupta
DiffM
32
39
0
20 Jul 2023
CNOS: A Strong Baseline for CAD-based Novel Object Segmentation
CNOS: A Strong Baseline for CAD-based Novel Object Segmentation
Van Nguyen Nguyen
Thibault Groueix
Georgy Ponimatkin
Vincent Lepetit
Tomás Hodan
3DPC
19
46
0
20 Jul 2023
Vesper: A Compact and Effective Pretrained Model for Speech Emotion
  Recognition
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
VLM
33
35
0
20 Jul 2023
Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged
  Object Detection
Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged Object Detection
Yinghui Xing
Dexuan Kong
Shizhou Zhang
Geng Chen
Lingyan Ran
Peng Wang
Yanning Zhang
48
4
0
20 Jul 2023
Boosting Federated Learning Convergence with Prototype Regularization
Boosting Federated Learning Convergence with Prototype Regularization
Yu Qiao
Huy Q. Le
Choong Seon Hong
FedML
32
6
0
20 Jul 2023
Interactive Segmentation for Diverse Gesture Types Without Context
Interactive Segmentation for Diverse Gesture Types Without Context
Josh Myers-Dean
Yifei Fan
Brian L. Price
Wilson Chan
Danna Gurari
26
2
0
20 Jul 2023
PharmacyGPT: The AI Pharmacist
PharmacyGPT: The AI Pharmacist
Zheng Liu
Zihao Wu
Mengxuan Hu
Bokai Zhao
Lin Zhao
...
Ye Shen
Sheng Li
Brian Murray
Tianming Liu
Andrea Sikora
LM&MA
AI4MH
45
0
0
19 Jul 2023
Divert More Attention to Vision-Language Object Tracking
Divert More Attention to Vision-Language Object Tracking
Mingzhe Guo
Zhipeng Zhang
Li Jing
Haibin Ling
Heng Fan
VLM
42
3
0
19 Jul 2023
DVPT: Dynamic Visual Prompt Tuning of Large Pre-trained Models for
  Medical Image Analysis
DVPT: Dynamic Visual Prompt Tuning of Large Pre-trained Models for Medical Image Analysis
Along He
Kai Wang
Zhihong Wang
Tao Li
Huazhu Fu
MedIm
45
3
0
19 Jul 2023
AnyDoor: Zero-shot Object-level Image Customization
AnyDoor: Zero-shot Object-level Image Customization
Xi Chen
Lianghua Huang
Yu Liu
Yujun Shen
Deli Zhao
Hengshuang Zhao
DiffM
46
257
0
18 Jul 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
RepViT: Revisiting Mobile CNN From ViT Perspective
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
34
178
0
18 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
31
32
0
18 Jul 2023
NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and
  Repulsive UDF
NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF
S. Lionar
Xiangyu Xu
Min Lin
G. Lee
3DV
3DGS
33
7
0
18 Jul 2023
Learning to Count without Annotations
Learning to Count without Annotations
Lukas Knobel
Tengda Han
Yuki M. Asano
SSL
37
2
0
17 Jul 2023
TableGPT: Towards Unifying Tables, Nature Language and Commands into One
  GPT
TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT
Liangyu Zha
Junlin Zhou
Liyao Li
Rui Wang
Qingyi Huang
...
Xing-yan Deng
J. Xu
Haobo Wang
Gang Chen
Jun Zhao
RALM
LMTD
32
42
0
17 Jul 2023
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Yang Zhao
Zhijie Lin
Daquan Zhou
Zilong Huang
Jiashi Feng
Bingyi Kang
MLLM
44
107
0
17 Jul 2023
Dense Affinity Matching for Few-Shot Segmentation
Dense Affinity Matching for Few-Shot Segmentation
Hao Chen
Yonghan Dong
Zhe-Ming Lu
YunLong Yu
Yingming Li
Jungong Han
Zhongfei Zhang
44
8
0
17 Jul 2023
On Point Affiliation in Feature Upsampling
On Point Affiliation in Feature Upsampling
Wenze Liu
Hao Lu
Yuliang Liu
Zhiguo Cao
3DPC
26
2
0
17 Jul 2023
Previous
123...757677...828384
Next