ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.02499
  4. Cited By
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning

Images Speak in Images: A Generalist Painter for In-Context Visual Learning

5 December 2022
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
    VLM
    MLLM
ArXivPDFHTML

Papers citing "Images Speak in Images: A Generalist Painter for In-Context Visual Learning"

50 / 197 papers shown
Title
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
50
10
0
23 Sep 2024
OmniGen: Unified Image Generation
OmniGen: Unified Image Generation
Shitao Xiao
Yueze Wang
Junjie Zhou
Huaying Yuan
Xingrun Xing
Ruiran Yan
Shuting Wang
Tiejun Huang
Zheng Liu
DiffM
VLM
SyDa
50
61
0
17 Sep 2024
Foundation Model or Finetune? Evaluation of few-shot semantic
  segmentation for river pollution
Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution
Marga Don
Stijn Pinson
Blanca Guillen Cebrian
Yuki M. Asano
16
0
0
05 Sep 2024
AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning
AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning
Sudarshan Rajagopalan
Vishal M. Patel
24
3
0
30 Aug 2024
A Simple and Generalist Approach for Panoptic Segmentation
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
35
0
0
29 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
40
4
0
23 Aug 2024
Learning A Low-Level Vision Generalist via Visual Task Prompt
Learning A Low-Level Vision Generalist via Visual Task Prompt
Xiangyu Chen
Yihao Liu
Yuandong Pu
Wenlong Zhang
Jiantao Zhou
Yu Qiao
Chao Dong
VLM
23
4
0
16 Aug 2024
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Seung Hyun Lee
Junjie Ke
Yinxiao Li
Junfeng He
Steven Hickson
...
Irfan Essa
Sangpil Kim
Ming-Hsuan Yang
Irfan Essa
Feng Yang
VLM
39
0
0
14 Aug 2024
One Shot is Enough for Sequential Infrared Small Target Segmentation
One Shot is Enough for Sequential Infrared Small Target Segmentation
Bingbing Dan
Meihui Li
Tao Tang
Jing Zhang
18
0
0
09 Aug 2024
Path-SAM2: Transfer SAM2 for digital pathology semantic segmentation
Path-SAM2: Transfer SAM2 for digital pathology semantic segmentation
Mingya Zhang
Liang Wang
Zhihao Chen
Yiyuan Ge
Xianping Tao
VLM
MedIm
24
2
0
07 Aug 2024
Medical SAM 2: Segment medical images as video via Segment Anything
  Model 2
Medical SAM 2: Segment medical images as video via Segment Anything Model 2
Jiayuan Zhu
Yunli Qi
A. El Abbadi
VLM
MedIm
29
64
0
01 Aug 2024
Unified-EGformer: Exposure Guided Lightweight Transformer for
  Mixed-Exposure Image Enhancement
Unified-EGformer: Exposure Guided Lightweight Transformer for Mixed-Exposure Image Enhancement
Eashan Adhikarla
Kai Zhang
Rosaura G. VidalMata
Manjushree B. Aithal
Nikhil Ambha Madhusudhana
John Nicholson
Lichao Sun
Brian D. Davison
30
2
0
18 Jul 2024
Efficient In-Context Medical Segmentation with Meta-driven Visual Prompt
  Selection
Efficient In-Context Medical Segmentation with Meta-driven Visual Prompt Selection
Chenwei Wu
David Restrepo
Zitao Shuai
Zhongming Liu
Liyue Shen
VLM
33
1
0
15 Jul 2024
GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via
  VLM
GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM
Keshav Bimbraw
Ye Wang
Jing Liu
T. Koike-Akino
VLM
MedIm
LM&MA
27
1
0
15 Jul 2024
Visual Prompt Selection for In-Context Learning Segmentation
Visual Prompt Selection for In-Context Learning Segmentation
Wei Suo
Lanqing Lai
Mengyang Sun
Hanwang Zhang
Peng Wang
Yanning Zhang
VLM
27
3
0
14 Jul 2024
DiffRect: Latent Diffusion Label Rectification for Semi-supervised
  Medical Image Segmentation
DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation
Xinyu Liu
Wuyang Li
Yixuan Yuan
MedIm
16
7
0
13 Jul 2024
DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud
  Understanding
DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding
Jincen Jiang
Qianyu Zhou
Yuhang Li
Xuequan Lu
Meili Wang
Lizhuang Ma
Jian Chang
Jian Jun Zhang
OOD
44
12
0
11 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
32
3
0
10 Jul 2024
Toward a Diffusion-Based Generalist for Dense Vision Tasks
Toward a Diffusion-Based Generalist for Dense Vision Tasks
Yue Fan
Yongqin Xian
Xiaohua Zhai
Alexander Kolesnikov
Muhammad Ferjad Naeem
Bernt Schiele
Federico Tombari
VLM
MDE
DiffM
29
1
0
29 Jun 2024
Wavelets Are All You Need for Autoregressive Image Generation
Wavelets Are All You Need for Autoregressive Image Generation
Wael Mattar
Idan Levy
Nir Sharon
S. Dekel
30
3
0
28 Jun 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and
  Understanding
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRM
MLLM
VLM
47
44
0
27 Jun 2024
ConStyle v2: A Strong Prompter for All-in-One Image Restoration
ConStyle v2: A Strong Prompter for All-in-One Image Restoration
Dongqi Fan
Junhao Zhang
Liang Chang
VLM
29
2
0
26 Jun 2024
PIG: Prompt Images Guidance for Night-Time Scene Parsing
PIG: Prompt Images Guidance for Night-Time Scene Parsing
Zhifeng Xie
Rui Qiu
Sen Wang
Xin Tan
Yuan Xie
Lizhuang Ma
30
2
0
15 Jun 2024
Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model
Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model
Kyeongjin Ahn
Sungwon Han
Sungwon Park
Jihee Kim
Sangyoon Park
Meeyoung Cha
18
2
0
12 Jun 2024
Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Sucheng Ren
Xiaoke Huang
Xianhang Li
Junfei Xiao
Jieru Mei
Zeyu Wang
Alan Yuille
Yuyin Zhou
MedIm
34
7
0
08 Jun 2024
AlignSAM: Aligning Segment Anything Model to Open Context via
  Reinforcement Learning
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
Duojun Huang
Xinyu Xiong
Jie Ma
Jichang Li
Zequn Jie
Lin Ma
Guanbin Li
VLM
44
11
0
01 Jun 2024
X-VILA: Cross-Modality Alignment for Large Language Model
X-VILA: Cross-Modality Alignment for Large Language Model
Hanrong Ye
De-An Huang
Yao Lu
Zhiding Yu
Wei Ping
...
Jan Kautz
Song Han
Dan Xu
Pavlo Molchanov
Hongxu Yin
MLLM
VLM
40
29
0
29 May 2024
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic
  Segmentation
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation
Niclas Vodisch
Kürsat Petek
Markus Kappeler
Abhinav Valada
Wolfram Burgard
VLM
32
4
0
29 May 2024
In-Context Symmetries: Self-Supervised Learning through Contextual World
  Models
In-Context Symmetries: Self-Supervised Learning through Contextual World Models
Sharut Gupta
Chenyu Wang
Yifei Wang
Tommi Jaakkola
Stefanie Jegelka
19
1
0
28 May 2024
Towards Global Optimal Visual In-Context Learning Prompt Selection
Towards Global Optimal Visual In-Context Learning Prompt Selection
Chengming Xu
Chen Liu
Yikai Wang
Yanwei Fu
19
5
0
24 May 2024
PerSense: Personalized Instance Segmentation in Dense Images
PerSense: Personalized Instance Segmentation in Dense Images
Muhammad Ibraheem Siddiqui
Muhammad Umer Sheikh
Hassan Abid
Muhammad Haris Khan
VLM
47
0
0
22 May 2024
Adapting Large Multimodal Models to Distribution Shifts: The Role of
  In-Context Learning
Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning
Guanglin Zhou
Zhongyi Han
Shiming Chen
Biwei Huang
Liming Zhu
Salman Khan
Xin Gao
Lina Yao
VLM
36
2
0
20 May 2024
NubbleDrop: A Simple Way to Improve Matching Strategy for Prompted
  One-Shot Segmentation
NubbleDrop: A Simple Way to Improve Matching Strategy for Prompted One-Shot Segmentation
Zhiyu Xu
Qingliang Chen
17
0
0
19 May 2024
Analogist: Out-of-the-box Visual In-Context Learning with Image
  Diffusion Model
Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model
Zheng Gu
Shiyuan Yang
Jing Liao
Jing Huo
Yang Gao
VLM
DiffM
33
5
0
16 May 2024
DocRes: A Generalist Model Toward Unifying Document Image Restoration
  Tasks
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Jiaxin Zhang
Dezhi Peng
Chongyu Liu
Peirong Zhang
Lianwen Jin
VLM
30
12
0
07 May 2024
Customizing Text-to-Image Models with a Single Image Pair
Customizing Text-to-Image Models with a Single Image Pair
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
25
18
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
26
6
0
02 May 2024
UniFS: Universal Few-shot Instance Perception with Point Representations
UniFS: Universal Few-shot Instance Perception with Point Representations
Sheng Jin
Ruijie Yao
Lumin Xu
Wentao Liu
Chao Qian
Ji Wu
Ping Luo
35
2
0
30 Apr 2024
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in
  the Wild
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
Donggyun Kim
Seongwoong Cho
Semin Kim
Chong Luo
Seunghoon Hong
VLM
31
2
0
29 Apr 2024
Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing
  Domain
Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain
Steve Andreas Immanuel
H. R. Sinulingga
VLM
27
3
0
16 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li-Na Song
Wenjun Zhang
Zhiwu Huang
MLLM
36
0
0
15 Apr 2024
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
Jihao Liu
Jinliang Zheng
Yu Liu
Hongsheng Li
VLM
19
3
0
11 Apr 2024
Monocular 3D lane detection for Autonomous Driving: Recent Achievements,
  Challenges, and Outlooks
Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks
Fulong Ma
Weiqing Qi
Guoyang Zhao
Linwei Zheng
Sheng Wang
Yuxuan Liu
Ming-Yu Liu
68
8
0
10 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
25
248
0
03 Apr 2024
Roadside Monocular 3D Detection via 2D Detection Prompting
Roadside Monocular 3D Detection via 2D Detection Prompting
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
Yanan Li
Shu Kong
31
0
0
01 Apr 2024
InstructBrush: Learning Attention-based Instruction Optimization for
  Image Editing
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
25
4
0
27 Mar 2024
SegICL: A Multimodal In-context Learning Framework for Enhanced
  Segmentation in Medical Imaging
SegICL: A Multimodal In-context Learning Framework for Enhanced Segmentation in Medical Imaging
Lingdong Shen
Fangxin Shang
Xiaoshuang Huang
Yehui Yang
Haifeng Huang
Shiming Xiang
VLM
11
3
0
25 Mar 2024
In-Context Matting
In-Context Matting
He Guo
Zixuan Ye
Zhiguo Cao
Hao Lu
VOS
18
0
0
23 Mar 2024
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Zheng-Wei Zhang
Yeyao Ma
Enming Zhang
Xiang Bai
VLM
MLLM
32
29
0
21 Mar 2024
VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
Yongshuo Zong
Ondrej Bohdal
Timothy M. Hospedales
28
5
0
19 Mar 2024
Previous
1234
Next