ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.06790
  4. Cited By
Inpaint Anything: Segment Anything Meets Image Inpainting

Inpaint Anything: Segment Anything Meets Image Inpainting

13 April 2023
Tao Yu
Runsen Feng
Ruoyu Feng
Jinming Liu
Xin Jin
Wenjun Zeng
Zhibo Chen
    DiffM
ArXivPDFHTML

Papers citing "Inpaint Anything: Segment Anything Meets Image Inpainting"

50 / 160 papers shown
Title
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
71
0
0
08 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
61
0
0
08 May 2025
Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models
Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models
Mishal Fatima
Steffen Jung
M. Keuper
31
0
0
06 May 2025
CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting
CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting
Huawei Sun
Bora Kunter Sahin
Georg Stettinger
Maximilian Bernhard
Matthias Schubert
Robert Wille
39
0
0
06 May 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
C. L. P. Chen
Sijie Zhu
DiffM
77
1
0
05 May 2025
Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models
Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models
Minh-Hao Van
Xintao Wu
VLM
81
0
0
30 Apr 2025
Generative Semantic Communications: Principles and Practices
Generative Semantic Communications: Principles and Practices
Xiaojun Yuan
Haoming Ma
Yinuo Huang
Zhoufan Hua
Yong Zuo
Z. Ding
AI4CE
22
0
0
21 Apr 2025
ARAP-GS: Drag-driven As-Rigid-As-Possible 3D Gaussian Splatting Editing with Diffusion Prior
ARAP-GS: Drag-driven As-Rigid-As-Possible 3D Gaussian Splatting Editing with Diffusion Prior
Xiao Han
RunZe Tian
Yifei Tong
Fenggen Yu
Dingyao Liu
Yan Zhang
3DGS
31
0
0
17 Apr 2025
Mask Image Watermarking
Mask Image Watermarking
Runyi Hu
Jie Zhang
Shiqian Zhao
Nils Lukas
Jiwei Li
Qing-Wu Guo
Han Qiu
Tianwei Zhang
20
0
0
17 Apr 2025
Towards Explainable Partial-AIGC Image Quality Assessment
Towards Explainable Partial-AIGC Image Quality Assessment
Jiaying Qian
Ziheng Jia
Zicheng Zhang
Zeyu Zhang
Guangtao Zhai
Xiongkuo Min
35
0
0
12 Apr 2025
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Jiahuan Long
Zhengqin Xu
Tingsong Jiang
Wen Yao
Shuai Jia
Chao Ma
Xiaoqian Chen
AAML
VLM
29
1
0
11 Apr 2025
Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models
Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models
Jiahuan Long
Tingsong Jiang
Wen Yao
Yizhe Xiong
Zhengqin Xu
Shuai Jia
Chao Ma
19
0
0
11 Apr 2025
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment
Jiayang Sun
H. Wang
Jie Cao
Huaibo Huang
R. He
DiffM
68
0
0
10 Apr 2025
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
Yongjin Choi
Chanhun Park
Seung Jun Baek
DiffM
46
0
0
22 Mar 2025
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Yingmao Miao
Zhanpeng Huang
Rui Han
Zibin Wang
Chenhao Lin
Chao Shen
DiffM
44
0
0
20 Mar 2025
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
Zeqi Gu
Difan Liu
Timothy Langlois
Matthew Fisher
Abe Davis
DiffM
3DH
60
0
0
19 Mar 2025
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Arsh Koneru
Yusuke Kato
Kazuki Kozuka
Aditya Grover
VLM
56
1
0
15 Mar 2025
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption
Joonsung Jeon
Woo Jae Kim
Suhyeon Ha
Sooel Son
Sung-eui Yoon
DiffM
AAML
54
0
0
13 Mar 2025
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models
Ying Zhang
Maoliang Yin
Wenfu Bi
Haibao Yan
Shaohan Bian
Cui-Hua Zhang
C. Hua
73
2
0
05 Feb 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
36
1
0
20 Jan 2025
SOEDiff: Efficient Distillation for Small Object Editing
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
60
0
0
03 Jan 2025
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
S. Park
Subeen Lee
Hyun Seok Seong
Jaejoon Yoo
Jae-Pil Heo
32
1
0
03 Jan 2025
Texture- and Shape-based Adversarial Attacks for Vehicle Detection in
  Synthetic Overhead Imagery
Texture- and Shape-based Adversarial Attacks for Vehicle Detection in Synthetic Overhead Imagery
Mikael Yeghiazaryan
Sai Abhishek Siddhartha Namburu
Emily Kim
Stanislav Panev
Celso de Melo
Brent Lance
Fernando De la Torre
Jessica K. Hodgins
AAML
70
0
0
20 Dec 2024
Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large
  Vision-Language Model via Causality Analysis
Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality Analysis
Po-Hsuan Huang
Jeng-Lin Li
Chin-Po Chen
Ming-Ching Chang
Wei-Chao Chen
LRM
72
1
0
04 Dec 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual
  Understanding Tasks
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Steven G. McDonagh
Elliot J. Crowley
VLM
63
0
0
22 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
47
0
0
15 Nov 2024
ZIM: Zero-Shot Image Matting for Anything
ZIM: Zero-Shot Image Matting for Anything
Beomyoung Kim
Chanyong Shin
Joonhyun Jeong
Hyungsik Jung
Se Yun Lee
Sewhan Chun
Dong-Hyun Hwang
Joonsang Yu
VLM
26
2
0
01 Nov 2024
BIFRÖST: 3D-Aware Image compositing with Language Instructions
BIFRÖST: 3D-Aware Image compositing with Language Instructions
Lingxiao Li
Kaixiong Gong
Weihong Li
Xili Dai
Tao Chen
Xiaojun Yuan
Xiangyu Yue
24
2
0
24 Oct 2024
RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models
RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models
Haoran Hao
Jiaming Han
Changsheng Li
Yu-Feng Li
Xiangyu Yue
RALM
46
1
0
17 Oct 2024
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Virmarie Maquiling
Sean Anthony Byrne
D. Niehorster
Marco Carminati
Enkelejda Kasneci
VLM
45
0
0
11 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
On Efficient Variants of Segment Anything Model: A Survey
Xiaorui Sun
J. Liu
H. Shen
Xiaofeng Zhu
Ping Hu
VLM
43
4
0
07 Oct 2024
Run-time Observation Interventions Make Vision-Language-Action Models
  More Visually Robust
Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust
Asher Hancock
Allen Z. Ren
Anirudha Majumdar
VLM
28
1
0
02 Oct 2024
A Survey on Diffusion Models for Inverse Problems
A Survey on Diffusion Models for Inverse Problems
Giannis Daras
Hyungjin Chung
Chieh-Hsin Lai
Yuki Mitsufuji
Jong Chul Ye
P. Milanfar
Alexandros G. Dimakis
M. Delbracio
MedIm
39
31
0
30 Sep 2024
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
Shaowei Liu
Zhongzheng Ren
Saurabh Gupta
Shenlong Wang
VGen
DiffM
PINN
39
33
0
27 Sep 2024
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
Jinghao Zhang
Wen Qian
Hao Luo
Fan Wang
Feng Zhao
DiffM
26
0
0
26 Sep 2024
SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image
SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image
Dimitrije Antić
Sai Kumar Dwivedi
Shashank Tripathi
Theo Gevers
Dimitrios Tzionas
Dimitrios Tzionas
49
2
0
24 Sep 2024
Towards Generalizable Scene Change Detection
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
38
0
0
10 Sep 2024
Thinking Outside the BBox: Unconstrained Generative Object Compositing
Thinking Outside the BBox: Unconstrained Generative Object Compositing
Gemma Canet Tarrés
Zhe Lin
Zhifei Zhang
Jianming Zhang
Yizhi Song
Dan Ruta
Andrew Gilbert
John Collomosse
Soo Ye Kim
DiffM
33
9
0
06 Sep 2024
Unveiling Context-Related Anomalies: Knowledge Graph Empowered
  Decoupling of Scene and Action for Human-Related Video Anomaly Detection
Unveiling Context-Related Anomalies: Knowledge Graph Empowered Decoupling of Scene and Action for Human-Related Video Anomaly Detection
Chenglizhao Chen
Xinyu Liu
Mengke Song
Luming Li
Xu Yu
Shanchen Pang
31
0
0
05 Sep 2024
Towards Modality-agnostic Label-efficient Segmentation with
  Entropy-Regularized Distribution Alignment
Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Liyao Tang
Zhe Chen
Shanshan Zhao
Chaoyue Wang
Dacheng Tao
32
0
0
29 Aug 2024
Learning Instruction-Guided Manipulation Affordance via Large Models for
  Embodied Robotic Tasks
Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks
Dayou Li
Chenkun Zhao
Shuo Yang
Lin Ma
Yibin Li
Wei Zhang
LM&Ro
25
1
0
20 Aug 2024
GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model
  for Panoramic Semantic Segmentation
GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation
Weiming Zhang
Yexin Liu
Xu Zheng
Lin Wang
VLM
42
5
0
17 Aug 2024
Comparative Analysis of Generative Models: Enhancing Image Synthesis
  with VAEs, GANs, and Stable Diffusion
Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion
Sanchayan Vivekananthan
DiffM
24
8
0
16 Aug 2024
EditScribe: Non-Visual Image Editing with Natural Language Verification
  Loops
EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
Ruei-Che Chang
Yuxuan Liu
Lotus Zhang
Anhong Guo
DiffM
21
2
0
13 Aug 2024
A Training-Free Framework for Video License Plate Tracking and
  Recognition with Only One-Shot
A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot
Haoxuan Ding
Qi. Wang
Junyu Gao
Qiang Li
VLM
37
0
0
11 Aug 2024
MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and
  Maximizing Utility in Audio-Visual Data Archiving
MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and Maximizing Utility in Audio-Visual Data Archiving
B. Owoyele
Martin Schilling
Rohan Sawahn
Niklas Kaemer
Pavel Zherebenkov
Bhuvanesh Verma
Wim Pouw
Gerard de Melo
20
0
0
06 Aug 2024
Add-SD: Rational Generation without Manual Reference
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
29
0
0
30 Jul 2024
DragText: Rethinking Text Embedding in Point-based Image Editing
DragText: Rethinking Text Embedding in Point-based Image Editing
Gayoon Choi
Taejin Jeong
Sujung Hong
Jaehoon Joo
Seong Jae Hwang
DiffM
41
0
0
25 Jul 2024
Category-Extensible Out-of-Distribution Detection via Hierarchical
  Context Descriptions
Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions
Kai-Chun Liu
Zhihang Fu
Chao Chen
Sheng Jin
Ze Chen
Mingyuan Tao
Rongxin Jiang
Jieping Ye
VLM
OODD
54
4
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
58
1
0
23 Jul 2024
1234
Next