Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.02499
Cited By
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
5 December 2022
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLM
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Images Speak in Images: A Generalist Painter for In-Context Visual Learning"
50 / 197 papers shown
Title
Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation
Gabriele Rosi
Fabio Cermelli
VLM
32
0
0
06 May 2025
Grounding Task Assistance with Multimodal Cues from a Single Demonstration
Gabriel Sarch
Balasaravanan Thoravi Kumaravel
Sahithya Ravi
Vibhav Vineet
A. D. Wilson
55
0
0
02 May 2025
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
J. Wang
Tianci Luo
Yaohua Zha
Yan Feng
Ruisheng Luo
B. Chen
Tao Dai
Long Chen
Yaowei Wang
Shu-Tao Xia
VLM
50
0
0
30 Apr 2025
Neural network task specialization via domain constraining
Roman Malashin
Daniil Ilyukhin
49
0
0
28 Apr 2025
E-InMeMo: Enhanced Prompting for Visual In-Context Learning
Jiahao Zhang
Bowen Wang
Hong Liu
Liangzhi Li
Yuta Nakashima
Hajime Nagahara
VLM
99
0
0
25 Apr 2025
DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining
Wei Zhuo
Zhiyue Tang
Wufeng Xue
Hao Ding
Linlin Shen
25
0
0
22 Apr 2025
RefComp: A Reference-guided Unified Framework for Unpaired Point Cloud Completion
Yixuan Yang
Jinyu Yang
Zixiang Zhao
Victor Sanchez
Feng Zheng
27
0
0
18 Apr 2025
AdaQual-Diff: Diffusion-Based Image Restoration via Adaptive Quality Prompting
Xin Su
Chen Wu
Yu Zhang
Chen Lyu
Zhuoran Zheng
34
0
0
17 Apr 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
X. Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming Yang
VOS
VLM
42
0
0
16 Apr 2025
Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers
Jiawei Wu
Zhifei Yang
Z. Wang
Zhi Jin
17
0
0
12 Apr 2025
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
Zhong-Yu Li
Ruoyi Du
Juncheng Yan
Le Zhuo
Zhen Li
Peng Gao
Zhanyu Ma
Ming-Ming Cheng
VLM
68
2
0
10 Apr 2025
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
DiffM
VOS
43
1
0
07 Apr 2025
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Yuandong Pu
Le Zhuo
Kaiwen Zhu
Liangbin Xie
Wenlong Zhang
Xiangyu Chen
Peng Gao
Yu Qiao
Chao Dong
Yihao Liu
MLLM
59
1
0
07 Apr 2025
Test-Time Visual In-Context Tuning
Jiahao Xie
A. Tonioni
N. Rauschmayr
F. Tombari
Bernt Schiele
OOD
VLM
52
0
0
27 Mar 2025
PAVE: Patching and Adapting Video Large Language Models
Zhuoming Liu
Yiquan Li
Khoi Duc Nguyen
Yiwu Zhong
Yin Li
KELM
LRM
79
0
0
25 Mar 2025
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
Yunhe Gao
Di Liu
Zhuowei Li
Y. Li
Dongdong Chen
Mu Zhou
Dimitris N. Metaxas
VLM
43
0
0
25 Mar 2025
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Lan Chen
Qi Mao
Yuchao Gu
Mike Zheng Shou
45
1
0
17 Mar 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Yijing Lin
Mengqi Huang
Shuhan Zhuang
Zhendong Mao
VGen
41
0
0
13 Mar 2025
Underlying Semantic Diffusion for Effective and Efficient In-Context Learning
Zhong Ji
Weilong Cao
Yan Zhang
Yanwei Pang
Jungong Han
X. Li
DiffM
VLM
37
0
0
06 Mar 2025
Building 3D In-Context Learning Universal Model in Neuroimaging
Jiesi Hu
Hanyang Peng
Yanwu Yang
Xutao Guo
Yang Shang
P. Shi
Chenfei Ye
Ting Ma
62
0
0
04 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Pandeng Li
Yun Zheng
Liwei Wang
ObjD
VLM
54
0
0
03 Mar 2025
Synthetic data enables context-aware bioacoustic sound event detection
Benjamin Hoffman
David Robinson
Marius Miron
V. Baglione
D. Canestrari
Damian Elias
Eva Trapote
Olivier Pietquin
32
0
0
01 Mar 2025
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks
Alessio Quercia
Erenus Yildiz
Zhuo Cao
Kai Krajsek
Abigail Morrison
Ira Assent
Hanno Scharr
45
0
0
22 Jan 2025
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks
Michael Schwingshackl
Fabio Francisco Oberweger
Markus Murschitz
44
1
0
20 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLM
VLM
LRM
91
45
0
03 Jan 2025
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
46
1
0
29 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
69
2
0
16 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
65
0
0
14 Dec 2024
LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Bingchen Li
Xin Li
Yiting Lu
Zhibo Chen
78
1
0
05 Dec 2024
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Bolin Lai
F. Xu
Miao Liu
Xiaoliang Dai
Nikhil Mehta
...
Zeyi Huang
James M. Rehg
Sangmin Lee
Ning Zhang
Tong Xiao
71
2
0
02 Dec 2024
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLM
VLM
3DV
80
1
0
29 Nov 2024
LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair
Xue Song
Jiequan Cui
H. Zhang
Jiaxin Shi
Jingjing Chen
Chi Zhang
Yu-Gang Jiang
83
0
0
28 Nov 2024
Adaptive Blind All-in-One Image Restoration
David Serrano-Lozano
Luis Herranz
Shaolin Su
Javier Vázquez-Corral
VLM
92
0
0
27 Nov 2024
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
Feifei Shao
Ping Liu
Zhao Wang
Yawei Luo
Hongwei Wang
Jun Xiao
3DPC
64
0
0
25 Nov 2024
Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain
Hangyul Yoon
Doohyuk Jang
JungEun Kim
Eunho Yang
VLM
MedIm
65
0
0
25 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Steven G. McDonagh
Elliot J. Crowley
VLM
63
0
0
22 Nov 2024
LaVin-DiT: Large Vision Diffusion Transformer
Zhaoqing Wang
Xiaobo Xia
Runnan Chen
Dongdong Yu
Changhu Wang
M. Gong
Tongliang Liu
92
6
0
18 Nov 2024
AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations
J. Mao
Y. Yang
Xuesong Yin
Ling Shao
Hao Tang
31
0
0
16 Nov 2024
All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model
Yuanbo Wen
Tao Gao
Ziqi Li
Jing Zhang
Kaihao Zhang
Ting Chen
VLM
DiffM
29
0
0
12 Nov 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
Xue Xia
Daiwei Zhang
Wenxuan Song
Wei Huang
L. Hurni
AI4TS
VLM
16
0
0
11 Nov 2024
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
Ashutosh Srivastava
Tarun Ram Menta
Abhinav Java
Avadhoot Jadhav
Silky Singh
Surgan Jandial
Balaji Krishnamurthy
DiffM
30
1
0
06 Nov 2024
Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
Shenghao Xie
Wenqiang Zu
Mingyang Zhao
Duo Su
Shilong Liu
Ruohua Shi
Guoqi Li
Shanghang Zhang
Lei Ma
LRM
40
3
0
29 Oct 2024
LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration
Yuang Ai
Huaibo Huang
Ran He
28
2
0
20 Oct 2024
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
Junjun Jiang
Zengyuan Zuo
Gang Wu
Kui Jiang
Xianming Liu
43
10
0
19 Oct 2024
EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training
Tongtian Yue
Shuning Xue
Xuange Gao
Yepeng Tang
Longteng Guo
Jie Jiang
J. Liu
21
3
0
14 Oct 2024
Bridge the Points: Graph-based Few-shot Segment Anything Semantically
Anqi Zhang
Guangyu Gao
Jianbo Jiao
C. Liu
Yunchao Wei
VLM
29
3
0
09 Oct 2024
A Simple Image Segmentation Framework via In-Context Examples
Yang Liu
Chenchen Jing
Hengtao Li
Muzhi Zhu
Hao Chen
Xinlong Wang
Chunhua Shen
25
6
0
07 Oct 2024
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
Muzhi Zhu
Yang Liu
Zekai Luo
Chenchen Jing
Hao Chen
Guangkai Xu
Xinlong Wang
Chunhua Shen
DiffM
VLM
29
3
0
03 Oct 2024
Uni
2
^2
2
Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
Yubin Wang
Zhikang Zou
Xiaoqing Ye
Xiao Tan
Errui Ding
Cairong Zhao
23
0
0
30 Sep 2024
Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration
Chu-Jie Qin
Rui-Qi Wu
Zikun Liu
Xin Lin
Chun-Le Guo
Hyun Hee Park
Chongyi Li
20
6
0
28 Sep 2024
1
2
3
4
Next