Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.00647
Cited By
Visual Prompting via Image Inpainting
1 September 2022
Amir Bar
Yossi Gandelsman
Trevor Darrell
Amir Globerson
Alexei A. Efros
VLM
VPVLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visual Prompting via Image Inpainting"
48 / 48 papers shown
Title
E-InMeMo: Enhanced Prompting for Visual In-Context Learning
Jiahao Zhang
Bowen Wang
Hong Liu
Liangzhi Li
Yuta Nakashima
Hajime Nagahara
VLM
104
0
0
25 Apr 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
Xianrui Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming Yang
VOS
VLM
51
0
0
16 Apr 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffM
VGen
65
0
0
22 Mar 2025
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration
Kang Liao
Zongsheng Yue
Zhouxia Wang
Chen Change Loy
95
3
0
20 Feb 2025
Differentiable Prompt Learning for Vision Language Models
Zhenhan Huang
Tejaswini Pedapati
Pin-Yu Chen
Jianxi Gao
VLM
28
0
0
03 Jan 2025
LaVin-DiT: Large Vision Diffusion Transformer
Zhaoqing Wang
Xiaobo Xia
Runnan Chen
Dongdong Yu
Changhu Wang
Mingming Gong
Tongliang Liu
92
6
0
18 Nov 2024
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration
Xu Zhang
Jiaqi Ma
Guoli Wang
Q. Zhang
Huan Zhang
Lefei Zhang
VLM
99
6
0
28 Aug 2024
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Seung Hyun Lee
Junjie Ke
Yinxiao Li
Junfeng He
Steven Hickson
...
Irfan Essa
Sangpil Kim
Ming-Hsuan Yang
Irfan Essa
Feng Yang
VLM
49
0
0
14 Aug 2024
GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM
Keshav Bimbraw
Ye Wang
Jing Liu
T. Koike-Akino
VLM
MedIm
LM&MA
40
1
0
15 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
47
3
0
10 Jul 2024
Unsupervised Meta-Learning via In-Context Learning
Anna Vettoruzzo
Lorenzo Braccaioli
Joaquin Vanschoren
M. Nowaczyk
SSL
64
0
0
25 May 2024
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Jiaxin Zhang
Dezhi Peng
Chongyu Liu
Peirong Zhang
Lianwen Jin
VLM
40
12
0
07 May 2024
DesignProbe: A Graphic Design Benchmark for Multimodal Large Language Models
Jieru Lin
Danqing Huang
Tiejun Zhao
Dechen Zhan
Chin-Yew Lin
VLM
MLLM
35
3
0
23 Apr 2024
Roadside Monocular 3D Detection via 2D Detection Prompting
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
Yanan Li
Shu Kong
51
0
0
01 Apr 2024
In-Context Matting
He Guo
Zixuan Ye
Zhiguo Cao
Hao Lu
VOS
31
0
0
23 Mar 2024
OSTAF: A One-Shot Tuning Method for Improved Attribute-Focused T2I Personalization
Ye Wang
Zili Yi
Rui Ma
DiffM
36
0
0
17 Mar 2024
Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity
Zhuo Zhi
Ziquan Liu
M. Elbadawi
Adam Daneshmend
Mine Orlu
Abdul Basit
Andreas Demosthenous
Miguel R. D. Rodrigues
36
2
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
63
6
0
14 Mar 2024
VRP-SAM: SAM with Visual Reference Prompt
Yanpeng Sun
Jiahui Chen
Shan Zhang
Xinyu Zhang
Qiang Chen
Gang Zhang
Errui Ding
Jingdong Wang
Zechao Li
52
31
0
27 Feb 2024
Data-efficient Large Vision Models through Sequential Autoregression
Jianyuan Guo
Zhiwei Hao
Chengcheng Wang
Yehui Tang
Han Wu
Han Hu
Kai Han
Chang Xu
VLM
38
10
0
07 Feb 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
46
11
0
18 Jan 2024
Low-Resource Vision Challenges for Foundation Models
Yunhua Zhang
Hazel Doughty
Cees G. M. Snoek
VLM
30
5
0
09 Jan 2024
Adaptive Human Trajectory Prediction via Latent Corridors
Neerja Thakkar
K. Mangalam
Andrea V. Bajcsy
Jitendra Malik
22
4
0
11 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
50
63
0
11 Dec 2023
Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large Language Models
Bingshuai Liu
Chenyang Lyu
Zijun Min
Zhanyu Wang
Jinsong Su
Longyue Wang
LRM
31
7
0
04 Dec 2023
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Rongyao Fang
Shilin Yan
Zhaoyang Huang
Jingqiu Zhou
Hao Tian
Jifeng Dai
Hongsheng Li
MLLM
45
8
0
30 Nov 2023
Unifying Image Processing as Visual Prompting Question Answering
Yihao Liu
Xiangyu Chen
Xianzheng Ma
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
MLLM
22
18
0
16 Oct 2023
SAIR: Learning Semantic-aware Implicit Representation
Canyu Zhang
Xiaoguang Li
Qing-Wu Guo
Song Wang
36
3
0
13 Oct 2023
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Henry Hengyuan Zhao
Pichao Wang
Yuyang Zhao
Hao Luo
F. Wang
Mike Zheng Shou
ViT
34
14
0
15 Sep 2023
Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
Baoshuo Kan
Teng Wang
Wenpeng Lu
Xiantong Zhen
Weili Guan
Feng Zheng
VPVLM
VLM
28
25
0
22 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
37
33
0
02 Aug 2023
Explicit Visual Prompting for Universal Foreground Segmentations
Weihuang Liu
Xi Shen
Chi-Man Pun
Xiaodong Cun
VPVLM
VLM
38
14
0
29 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas L. Griffiths
N. Jha
LRM
MLLM
32
1
0
26 May 2023
Segment Anything in Non-Euclidean Domains: Challenges and Opportunities
Yongcheng Jing
Xinchao Wang
Dacheng Tao
48
21
0
23 Apr 2023
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOS
MLLM
VLM
35
199
0
06 Apr 2023
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark
P. Jaini
DiffM
VLM
32
107
0
27 Mar 2023
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLM
VPVLM
VLM
LRM
24
107
0
31 Jan 2023
Understanding Zero-Shot Adversarial Robustness for Large-Scale Models
Chengzhi Mao
Scott Geng
Junfeng Yang
Xin Eric Wang
Carl Vondrick
VLM
44
59
0
14 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLM
MLLM
66
244
0
05 Dec 2022
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
Aochuan Chen
Yuguang Yao
Pin-Yu Chen
Yihua Zhang
Sijia Liu
VPVLM
VLM
41
75
0
21 Nov 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
Yao Lu
Max Bartolo
Alastair Moore
Sebastian Riedel
Pontus Stenetorp
AILaw
LRM
279
1,124
0
18 Apr 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
280
3,848
0
18 Apr 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
Frustratingly Simple Few-Shot Object Detection
Xin Wang
Thomas E. Huang
Trevor Darrell
Joseph E. Gonzalez
F. I. F. Richard Yu
ObjD
95
544
0
16 Mar 2020
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
267
3,375
0
09 Mar 2020
Image Inpainting for Irregular Holes Using Partial Convolutions
Guilin Liu
F. Reda
Kevin J. Shih
Ting-Chun Wang
Andrew Tao
Bryan Catanzaro
142
1,913
0
20 Apr 2018
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
296
39,198
0
01 Sep 2014
1