Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.07539
Cited By
Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation
14 August 2024
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation"
5 / 5 papers shown
Title
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation
Beoungwoo Kang
Seunghun Moon
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
ViT
MedIm
24
8
0
14 Aug 2024
TT-BLIP: Enhancing Fake News Detection Using BLIP and Tri-Transformer
Eunjee Choi
Jong-Kook Kim
32
1
0
19 Mar 2024
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip H. S. Torr
133
308
0
04 Dec 2021
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
ObjD
159
282
0
19 Mar 2020
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
249
1,817
0
18 Aug 2016
1