ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.07539
  4. Cited By
Cross-aware Early Fusion with Stage-divided Vision and Language
  Transformer Encoders for Referring Image Segmentation

Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation

IEEE transactions on multimedia (IEEE TMM), 2024
14 August 2024
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
ArXiv (abs)PDFHTMLGithub

Papers citing "Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation"

16 / 16 papers shown
Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya
Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya
Hassan Ugail
Ismail Lujain Jaleel
110
0
0
02 Nov 2025
Latent Expression Generation for Referring Image Segmentation and Grounding
Latent Expression Generation for Referring Image Segmentation and Grounding
S. Yu
Joonbeom Hong
Joonseok Lee
Jeany Son
ObjD
306
2
0
07 Aug 2025
Multimodal Referring Segmentation: A Survey
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
521
19
0
01 Aug 2025
RemoteSAM: Towards Segment Anything for Earth Observation
RemoteSAM: Towards Segment Anything for Earth Observation
Liang Yao
Fan Liu
Delong Chen
Chuanyi Zhang
Yijun Wang
Ziyun Chen
Wei Xu
Shimin Di
Yuhui Zheng
871
29
0
23 May 2025
BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
Rafi Ibn Sultan
Hui Zhu
Chengyin Li
Dongxiao Zhu
292
1
0
30 Mar 2025
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models
Keyan Chen
Jiafan Zhang
Chenyang Liu
Zhengxia Zou
Zhenwei Shi
VLM
340
27
0
12 Jan 2025
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation
Zhe Dong
Yuzhe Sun
Tianzhu Liu
Wangmeng Zuo
Yanfeng Gu
530
23
0
11 Oct 2024
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing
  Image Segmentation
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image SegmentationIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Sen Lei
Xinyu Xiao
Heng-Chao Li
Z. Shi
Qing Zhu
350
32
0
20 Sep 2024
Depth-Weighted Detection of Behaviours of Risk in People with Dementia using Cameras
Depth-Weighted Detection of Behaviours of Risk in People with Dementia using Cameras
Pratik K. Mishra
Irene Ballester
Andrea Iaboni
Bing Ye
Kristine Newman
Alex Mihailidis
Shehroz S. Khan
318
2
0
28 Aug 2024
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient
  Semantic Segmentation
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Beoungwoo Kang
Seunghun Moon
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
ViTMedIm
322
30
0
14 Aug 2024
Embedding-Free Transformer with Inference Spatial Reduction for
  Efficient Semantic Segmentation
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
299
16
0
24 Jul 2024
Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring
  Image Segmentation
Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Seonghoon Yu
Paul Hongsuck Seo
Jeany Son
DiffM
479
12
0
10 Jul 2024
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Weize Li
Zhicheng Zhao
Haochen Bai
Fei Su
602
9
0
24 May 2024
Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for
  Referring Image Segmentation
Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image Segmentation
Yichen Yan
Xingjian He
Sihan Chen
Shichen Lu
Jing Liu
309
1
0
18 May 2024
TT-BLIP: Enhancing Fake News Detection Using BLIP and Tri-Transformer
TT-BLIP: Enhancing Fake News Detection Using BLIP and Tri-Transformer
Eunjee Choi
Jong-Kook Kim
333
12
0
19 Mar 2024
EAVL: Explicitly Align Vision and Language for Referring Image
  Segmentation
EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Yimin Yan
Xingjian He
Wenxuan Wang
Sihan Chen
Qingbin Liu
ObjDVLM
379
2
0
18 Aug 2023
1
Page 1 of 1