Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2007.01951
Cited By
v1
v2 (latest)
Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation
3 July 2020
Liwei Wang
Jing-ling Huang
Yin Li
Kun Xu
Zhengyuan Yang
Dong Yu
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation"
43 / 43 papers shown
LIHE: Linguistic Instance-Split Hyperbolic-Euclidean Framework for Generalized Weakly-Supervised Referring Expression Comprehension
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
X. Shi
Silin Cheng
Sirui Zhao
Yunhan Jiang
Enhong Chen
Yang Liu
Sebastien Ourselin
191
1
0
15 Nov 2025
Learning Egocentric In-Hand Object Segmentation through Weak Supervision from Human Narrations
Nicola Messina
Rosario Leonardi
Luca Ciampi
F. Carrara
G. Farinella
Fabrizio Falchi
Antonino Furnari
EgoV
417
0
0
30 Sep 2025
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
Jiangnan Xie
Xiaolong Zheng
Liang Zheng
ObjD
208
0
0
08 Sep 2025
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
Ta Duc Huy
Duy Anh Huynh
Yutong Xie
Yuankai Qi
Qi Chen
...
Anton van den Hengel
Zhibin Liao
Minh-Son To
Johan Verjans
Vu Minh Hieu Phan
481
4
0
21 May 2025
3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment
IEEE International Conference on Robotics and Automation (ICRA), 2025
Xianrui Li
Jing Liu
Nuowei Han
Liang Heng
Yike Guo
Hao Dong
Yang Liu
280
2
0
03 May 2025
Towards Visual Grounding: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
1.1K
43
0
28 Dec 2024
Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM
Navid Rajabi
Jana Kosecka
236
3
0
29 Apr 2024
How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding
Jiamin Luo
Jianing Zhao
Jingjing Wang
Guodong Zhou
257
0
0
29 Feb 2024
Cycle-Consistency Learning for Captioning and Grounding
Ning Wang
Jiajun Deng
Mingbo Jia
ObjD
321
15
0
23 Dec 2023
SEER-ZSL: Semantic Encoder-Enhanced Representations for Generalized Zero-Shot Learning
William Heyden
Habib Ullah
M. Salman Siddiqui
Fadi Al Machot
VLM
300
2
0
20 Dec 2023
Weakly-Supervised 3D Visual Grounding based on Visual Language Alignment
IEEE transactions on multimedia (IEEE TMM), 2023
Xiaoxu Xu
Yitian Yuan
Qiudan Zhang
Wen-Bin Wu
Zequn Jie
Lin Ma
Xu Wang
634
5
0
15 Dec 2023
EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model
IEEE transactions on multimedia (IEEE TMM), 2023
Guozhang Li
Xinpeng Ding
De Cheng
Jie Li
Nannan Wang
Xinbo Gao
507
5
0
05 Dec 2023
Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Chancharik Mitra
Abrar Anwar
Rodolfo Corona
Dan Klein
Trevor Darrell
Jesse Thomason
325
3
0
12 Nov 2023
Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
IEEE International Conference on Computer Vision (ICCV), 2023
Dongwon Kim
Nam-Won Kim
Cuiling Lan
Suha Kwak
VLM
339
29
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
IEEE International Conference on Computer Vision (ICCV), 2023
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
Lulu Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
344
49
0
28 Aug 2023
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
IEEE International Conference on Computer Vision (ICCV), 2023
Zehan Wang
Haifeng Huang
Yang Zhao
Lin Li
Xize Cheng
Yichen Zhu
Aoxiong Yin
Zhou Zhao
222
30
0
18 Jul 2023
Top-Down Framework for Weakly-supervised Grounded Image Captioning
Chen Cai
Suchen Wang
Kim-Hui Yap
Yi Wang
ObjD
269
4
0
13 Jun 2023
Weakly-Supervised Visual-Textual Grounding with Semantic Prior Refinement
British Machine Vision Conference (BMVC), 2023
Davide Rigoni
Luca Parolari
Luciano Serafini
A. Sperduti
Lamberto Ballan
238
1
0
18 May 2023
CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding
IEEE transactions on multimedia (IEEE TMM), 2023
Linhui Xiao
Xiaoshan Yang
Fang Peng
Ming Yan
Yaowei Wang
Changsheng Xu
ObjD
VLM
563
67
0
15 May 2023
Focusing On Targets For Improving Weakly Supervised Visual Grounding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
V. Pham
Nao Mishima
ObjD
238
1
0
22 Feb 2023
Who are you referring to? Coreference resolution in image narrations
IEEE International Conference on Computer Vision (ICCV), 2022
A. Goel
Basura Fernando
Frank Keller
Hakan Bilen
359
5
0
26 Nov 2022
A Unified Mutual Supervision Framework for Referring Expression Segmentation and Generation
Shijia Huang
Feng Li
Hao Zhang
Siyi Liu
Lei Zhang
Liwei Wang
207
5
0
15 Nov 2022
Exploring Generalizable Distillation for Efficient Medical Image Segmentation
IEEE journal of biomedical and health informatics (IEEE JBHI), 2022
Xingqun Qi
Zhuo Wu
Min Ren
Muyi Sun
Caifeng Shan
Zhe Sun
261
9
0
26 Jul 2022
Contrastive Deep Supervision
European Conference on Computer Vision (ECCV), 2022
Linfeng Zhang
Xin Chen
Junbo Zhang
Runpei Dong
Kaisheng Ma
336
47
0
12 Jul 2022
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Zhuo Chen
Yufen Huang
Jiaoyan Chen
Yuxia Geng
Wen Zhang
Yin Fang
Jeff Z. Pan
Huajun Chen
VLM
493
94
0
04 Jul 2022
Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations
Computer Vision and Pattern Recognition (CVPR), 2022
Ziyan Yang
Kushal Kafle
Franck Dernoncourt
Vicente Ordónez Román
VLM
454
32
0
30 Jun 2022
A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Zhihao Fan
Zhongyu Wei
Jingjing Chen
Siyuan Wang
Zejun Li
Jiarong Xu
Xuanjing Huang
CLL
172
6
0
11 Jun 2022
Guiding Visual Question Answering with Attention Priors
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
T. Le
Vuong Le
Sunil R. Gupta
Svetha Venkatesh
T. Tran
286
9
0
25 May 2022
Region-aware Knowledge Distillation for Efficient Image-to-Image Translation
British Machine Vision Conference (BMVC), 2022
Linfeng Zhang
Xin Chen
Runpei Dong
Kaisheng Ma
VLM
315
13
0
25 May 2022
Beyond Bounding Box: Multimodal Knowledge Learning for Object Detection
Wei Feng
Xingyuan Bu
Chenchen Zhang
Xubin Li
VLM
198
6
0
09 May 2022
Adapting CLIP For Phrase Localization Without Further Training
Jiahao Li
G. Shakhnarovich
Raymond A. Yeh
VLM
CLIP
315
27
0
07 Apr 2022
Multi-View Transformer for 3D Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2022
Shijia Huang
Yilun Chen
Jiaya Jia
Liwei Wang
466
191
0
05 Apr 2022
TubeDETR: Spatio-Temporal Video Grounding with Transformers
Computer Vision and Pattern Recognition (CVPR), 2022
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
390
127
0
30 Mar 2022
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships
Computer Vision and Pattern Recognition (CVPR), 2022
Chao Lou
Wenjuan Han
Yuh-Chen Lin
Zilong Zheng
CoGe
289
11
0
27 Mar 2022
Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2022
Haojun Jiang
Yuanze Lin
Dongchen Han
Shiji Song
Gao Huang
ObjD
430
67
0
16 Mar 2022
GroupViT: Semantic Segmentation Emerges from Text Supervision
Computer Vision and Pattern Recognition (CVPR), 2022
Jiarui Xu
Shalini De Mello
Sifei Liu
Wonmin Byeon
Thomas Breuel
Jan Kautz
Xinyu Wang
ViT
VLM
937
685
0
22 Feb 2022
Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching
Neurocomputing (Neurocomputing), 2022
Hengcan Shi
Munawar Hayat
Jianfei Cai
ObjD
275
12
0
18 Jan 2022
Injecting Semantic Concepts into End-to-End Image Captioning
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lin Liang
Zhe Gan
Lijuan Wang
Yezhou Yang
Zicheng Liu
ViT
VLM
289
124
0
09 Dec 2021
Making a Bird AI Expert Work for You and Me
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Dongliang Chang
Kaiyue Pang
Ruoyi Du
Zhanyu Ma
Yi-Zhe Song
Jun Guo
341
22
0
06 Dec 2021
Weakly-Supervised Video Object Grounding via Causal Intervention
Wei Wang
Junyu Gao
Changsheng Xu
CML
358
32
0
01 Dec 2021
A Survey on Temporal Sentence Grounding in Videos
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Zhi Wang
Wenwu Zhu
406
59
0
16 Sep 2021
Distributed Attention for Grounded Image Captioning
Nenglun Chen
Xingjia Pan
Runnan Chen
Lei Yang
Zhiwen Lin
Yuqiang Ren
Haolei Yuan
Xiaowei Guo
Feiyue Huang
Wenping Wang
470
23
0
02 Aug 2021
Relation-aware Instance Refinement for Weakly Supervised Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2021
Yongfei Liu
Bo Wan
Lin Ma
Xuming He
ObjD
298
65
0
24 Mar 2021
1
Page 1 of 1