ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.02499
  4. Cited By
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning

Images Speak in Images: A Generalist Painter for In-Context Visual Learning

5 December 2022
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
    VLM
    MLLM
ArXivPDFHTML

Papers citing "Images Speak in Images: A Generalist Painter for In-Context Visual Learning"

47 / 197 papers shown
Title
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Jianwei Yang
Hao Zhang
Feng Li
Xueyan Zou
Chun-yue Li
Jianfeng Gao
MLLM
VLM
16
152
0
17 Oct 2023
Context-Aware Meta-Learning
Context-Aware Meta-Learning
Christopher Fifty
Dennis Duan
Ronald G. Junkins
Ehsan Amid
Jurij Leskovec
Christopher Ré
Sebastian Thrun
LRM
VLM
MLLM
25
9
0
17 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt
  Foundation Models
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
6
7
0
17 Oct 2023
Unifying Image Processing as Visual Prompting Question Answering
Unifying Image Processing as Visual Prompting Question Answering
Yihao Liu
Xiangyu Chen
Xianzheng Ma
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
MLLM
22
18
0
16 Oct 2023
Lightweight In-Context Tuning for Multimodal Unified Models
Lightweight In-Context Tuning for Multimodal Unified Models
Yixin Chen
Shuai Zhang
Boran Han
Jiaya Jia
11
2
0
08 Oct 2023
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision
  Generalists
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists
Yulu Gan
Sungwoo Park
Alexander Schubert
Anthony Philippakis
Ahmed Alaa
VLM
17
21
0
30 Sep 2023
Visual In-Context Learning for Few-Shot Eczema Segmentation
Visual In-Context Learning for Few-Shot Eczema Segmentation
Monitirtha Dey
S. K. Bhandari
Venugopal Vasudevan
12
1
0
28 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng-Wei Zhang
Han Hu
Dongdong Chen
Baining Guo
DiffM
VLM
38
92
0
07 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image
  Modeling
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Qi Han
Yuxuan Cai
Xiangyu Zhang
25
7
0
02 Sep 2023
Mobile Foundation Model as Firmware
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
30
19
0
28 Aug 2023
UniAP: Towards Universal Animal Perception in Vision via Few-shot
  Learning
UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
Meiqi Sun
Zhonghan Zhao
Wenhao Chai
Hanjun Luo
Shidong Cao
Yanting Zhang
Jenq-Neng Hwang
Gaoang Wang
17
7
0
19 Aug 2023
Towards Large-scale 3D Representation Learning with Multi-dataset Point
  Prompt Training
Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Xiaoyang Wu
Zhuotao Tian
Xin Wen
Bohao Peng
Xihui Liu
Kaicheng Yu
Hengshuang Zhao
19
45
0
18 Aug 2023
Diffusion Models for Image Restoration and Enhancement -- A
  Comprehensive Survey
Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Xin Li
Yulin Ren
Xin Jin
Cuiling Lan
X. Wang
Wenjun Zeng
Xinchao Wang
Zhibo Chen
39
83
0
18 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
  Image Manipulation
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
27
33
0
02 Aug 2023
Visual Instruction Inversion: Image Editing via Visual Prompting
Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen
Yuheng Li
Utkarsh Ojha
Yong Jae Lee
DiffM
19
22
0
26 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
18
116
0
25 Jul 2023
Segment Anything Meets Point Tracking
Segment Anything Meets Point Tracking
Frano Rajič
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
Martin Danelljan
F. I. F. Richard Yu
VLM
VOS
33
74
0
03 Jul 2023
Towards Open Vocabulary Learning: A Survey
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Bernard Ghanem
Dacheng Tao
ObjD
VLM
27
134
0
28 Jun 2023
ProRes: Exploring Degradation-aware Visual Prompt for Universal Image
  Restoration
ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Jiaqi Ma
Tianheng Cheng
Guoli Wang
Qian Zhang
Xinggang Wang
L. Zhang
DiffM
VLM
6
43
0
23 Jun 2023
Robustness Analysis on Foundational Segmentation Models
Robustness Analysis on Foundational Segmentation Models
Madeline Chantry Schiappa
Shehreen Azad
V. Sachidanand
Yunhao Ge
O. Mikšík
Y. S. Rawat
Vibhav Vineet
OOD
VLM
AAML
9
5
0
15 Jun 2023
Explore In-Context Learning for 3D Point Cloud Understanding
Explore In-Context Learning for 3D Point Cloud Understanding
Zhongbin Fang
Xiangtai Li
Xia Li
J. M. Buhmann
Chen Change Loy
Mengyuan Liu
3DPC
11
24
0
14 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
21
6
0
14 Jun 2023
Unifying (Machine) Vision via Counterfactual World Modeling
Unifying (Machine) Vision via Counterfactual World Modeling
Daniel M. Bear
Kevin T. Feigelis
Honglin Chen
Wanhee Lee
R. Venkatesh
Klemen Kotar
Alex Durango
Daniel L. K. Yamins
VGen
18
12
0
02 Jun 2023
Towards In-context Scene Understanding
Towards In-context Scene Understanding
Ivana Balazevic
David Steiner
Nikhil Parthasarathy
Relja Arandjelović
Olivier J. Hénaff
15
28
0
02 Jun 2023
Explicit Visual Prompting for Universal Foreground Segmentations
Explicit Visual Prompting for Universal Foreground Segmentations
Weihuang Liu
Xi Shen
Chi-Man Pun
Xiaodong Cun
VPVLM
VLM
22
14
0
29 May 2023
Matcher: Segment Anything with One Shot Using All-Purpose Feature
  Matching
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Yang Liu
Muzhi Zhu
Hengtao Li
Hao Chen
Xinlong Wang
Chunhua Shen
VLM
MLLM
86
82
0
22 May 2023
VisionLLM: Large Language Model is also an Open-Ended Decoder for
  Vision-Centric Tasks
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Wen Wang
Zhe Chen
Xiaokang Chen
Jiannan Wu
Xizhou Zhu
...
Ping Luo
Tong Lu
Jie Zhou
Yu Qiao
Jifeng Dai
MLLM
VLM
22
449
0
18 May 2023
One-Prompt to Segment All Medical Images
One-Prompt to Segment All Medical Images
Junde Wu
Jiayuan Zhu
Yueming Jin
Min Xu
VLM
MedIm
12
28
0
17 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
32
89
0
14 May 2023
Visual Tuning
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
39
37
0
10 May 2023
Change Detection Methods for Remote Sensing in the Last Decade: A
  Comprehensive Review
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review
Guangliang Cheng
Yun-Min Huang
Xiangtai Li
Shuchang Lyu
Zhaoyang Xu
Qi Zhao
Shiming Xiang
20
68
0
09 May 2023
Personalize Segment Anything Model with One Shot
Personalize Segment Anything Model with One Shot
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Peng Gao
Hongsheng Li
MLLM
VLM
23
206
0
04 May 2023
In-Context Learning Unlocked for Diffusion Models
In-Context Learning Unlocked for Diffusion Models
Zhendong Wang
Yifan Jiang
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zhangyang Wang
Mingyuan Zhou
VLM
DiffM
86
68
0
01 May 2023
Segment Everything Everywhere All at Once
Segment Everything Everywhere All at Once
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLM
VLM
9
453
0
13 Apr 2023
UniverSeg: Universal Medical Image Segmentation
UniverSeg: Universal Medical Image Segmentation
V. Butoi
Jose Javier Gonzalez Ortiz
Tianyu Ma
M. Sabuncu
John Guttag
Adrian V. Dalca
17
68
0
12 Apr 2023
Few Shot Semantic Segmentation: a review of methodologies, benchmarks,
  and open challenges
Few Shot Semantic Segmentation: a review of methodologies, benchmarks, and open challenges
Nicolás Catalano
Matteo Matteucci
VLM
19
3
0
12 Apr 2023
Exploring Effective Factors for Improving Visual In-Context Learning
Exploring Effective Factors for Improving Visual In-Context Learning
Yanpeng Sun
Qiang Chen
Jian Wang
Jingdong Wang
Zechao Li
LRM
VLM
41
24
0
10 Apr 2023
SegGPT: Segmenting Everything In Context
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOS
MLLM
VLM
11
198
0
06 Apr 2023
Offsite-Tuning: Transfer Learning without Full Model
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
27
66
0
09 Feb 2023
What Makes Good Examples for Visual In-Context Learning?
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLM
VPVLM
VLM
LRM
8
107
0
31 Jan 2023
A Survey on In-context Learning
A Survey on In-context Learning
Qingxiu Dong
Lei Li
Damai Dai
Ce Zheng
Jingyuan Ma
...
Zhiyong Wu
Baobao Chang
Xu Sun
Lei Li
Zhifang Sui
ReLM
AIMat
20
443
0
31 Dec 2022
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
Alexander Kolesnikov
André Susano Pinto
Lucas Beyer
Xiaohua Zhai
Jeremiah Harmsen
N. Houlsby
103
67
0
20 May 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
233
341
0
22 Sep 2021
Multi-Stage Progressive Image Restoration
Multi-Stage Progressive Image Restoration
Syed Waqas Zamir
Aditya Arora
Salman Khan
Munawar Hayat
F. Khan
Ming-Hsuan Yang
Ling Shao
119
1,420
0
04 Feb 2021
Deep Joint Rain Detection and Removal from a Single Image
Deep Joint Rain Detection and Removal from a Single Image
Wenhan Yang
R. Tan
Jiashi Feng
Jiaying Liu
Zongming Guo
Shuicheng Yan
128
987
0
25 Sep 2016
Semantic Understanding of Scenes through the ADE20K Dataset
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
249
1,817
0
18 Aug 2016
Previous
1234