ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00839
  4. Cited By
Improving Referring Expression Grounding with Cross-modal
  Attention-guided Erasing

Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing

3 March 2019
Xihui Liu
Zihao W. Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
    ObjD
ArXivPDFHTML

Papers citing "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"

41 / 91 papers shown
Title
YouRefIt: Embodied Reference Understanding with Language and Gesture
YouRefIt: Embodied Reference Understanding with Language and Gesture
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
LM&Ro
35
41
0
08 Sep 2021
Auto-Parsing Network for Image Captioning and Visual Question Answering
Auto-Parsing Network for Image Captioning and Visual Question Answering
Xu Yang
Chongyang Gao
Hanwang Zhang
Jianfei Cai
9
35
0
24 Aug 2021
A Better Loss for Visual-Textual Grounding
A Better Loss for Visual-Textual Grounding
Davide Rigoni
Luciano Serafini
A. Sperduti
ObjD
17
3
0
11 Aug 2021
Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding
Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding
Heng Zhao
Joey Tianyi Zhou
Yew-Soon Ong
ObjD
17
23
0
31 Jul 2021
Cross-Modal Discrete Representation Learning
Cross-Modal Discrete Representation Learning
Alexander H. Liu
SouYoung Jin
Cheng-I Jeff Lai
Andrew Rouditchenko
A. Oliva
James R. Glass
SSL
22
40
0
10 Jun 2021
Discriminative Triad Matching and Reconstruction for Weakly Referring
  Expression Grounding
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding
Mingjie Sun
Jimin Xiao
Eng Gee Lim
Si Liu
John Y. Goulermas
ObjD
11
160
0
08 Jun 2021
Referring Transformer: A One-step Approach to Multi-task Visual
  Grounding
Referring Transformer: A One-step Approach to Multi-task Visual Grounding
Muchen Li
Leonid Sigal
ObjD
10
187
0
06 Jun 2021
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language
  Matching
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching
Chenchi Zhang
Wenbo Ma
Jun Xiao
Hanwang Zhang
Jian Shao
Yueting Zhuang
Long Chen
15
4
0
12 May 2021
Understanding Synonymous Referring Expressions via Contrastive Features
Understanding Synonymous Referring Expressions via Contrastive Features
Yi-Wen Chen
Yi-Hsuan Tsai
Ming-Hsuan Yang
ObjD
11
4
0
20 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
21
329
0
17 Apr 2021
Disentangled Motif-aware Graph Learning for Phrase Grounding
Disentangled Motif-aware Graph Learning for Phrase Grounding
Zongshen Mu
Siliang Tang
Jie Tan
Qiang Yu
Yueting Zhuang
GNN
31
35
0
13 Apr 2021
Look Before You Leap: Learning Landmark Features for One-Stage Visual
  Grounding
Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding
Binbin Huang
Dongze Lian
Weixin Luo
Shenghua Gao
ObjD
8
92
0
09 Apr 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
21
175
0
31 Mar 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
22
52
0
24 Mar 2021
Co-matching: Combating Noisy Labels by Augmentation Anchoring
Co-matching: Combating Noisy Labels by Augmentation Anchoring
Yangdi Lu
Yang Bo
Wenbo He
NoLa
19
7
0
23 Mar 2021
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD
  Images
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
Haolin Liu
Anran Lin
Xiaoguang Han
Lei Yang
Yizhou Yu
Shuguang Cui
17
39
0
14 Mar 2021
Iterative Shrinking for Referring Expression Grounding Using Deep
  Reinforcement Learning
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning
Mingjie Sun
Jimin Xiao
Eng Gee Lim
ObjD
14
33
0
09 Mar 2021
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding
  on Point Clouds through Instance Multi-level Contextual Referring
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Zhihao Yuan
Xu Yan
Yinghong Liao
Ruimao Zhang
Sheng Wang
Zhen Li
Shuguang Cui
61
128
0
01 Mar 2021
PPGN: Phrase-Guided Proposal Generation Network For Referring Expression
  Comprehension
PPGN: Phrase-Guided Proposal Generation Network For Referring Expression Comprehension
Chao Yang
Guoqing Wang
Dongsheng Li
Huawei Shen
Su Feng
Bin Jiang
7
3
0
20 Dec 2020
Utilizing Every Image Object for Semi-supervised Phrase Grounding
Utilizing Every Image Object for Semi-supervised Phrase Grounding
Haidong Zhu
Arka Sadhu
Zhao-Heng Zheng
Ram Nevatia
ObjD
12
7
0
05 Nov 2020
Actor and Action Modular Network for Text-based Video Segmentation
Actor and Action Modular Network for Text-based Video Segmentation
Jianhua Yang
Yan Huang
K. Niu
Linjiang Huang
Zhanyu Ma
Liang Wang
11
9
0
02 Nov 2020
A Benchmark and Baseline for Language-Driven Image Editing
A Benchmark and Baseline for Language-Driven Image Editing
Jing Shi
Ning Xu
Trung Bui
Franck Dernoncourt
Zheng Wen
Chenliang Xu
DiffM
122
30
0
05 Oct 2020
AttnGrounder: Talking to Cars with Attention
AttnGrounder: Talking to Cars with Attention
Vivek Mittal
ViT
14
11
0
11 Sep 2020
Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression
  Grounding
Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding
Long Chen
Wenbo Ma
Jun Xiao
Hanwang Zhang
Shih-Fu Chang
ObjD
4
89
0
03 Sep 2020
Richly Activated Graph Convolutional Network for Robust Skeleton-based
  Action Recognition
Richly Activated Graph Convolutional Network for Robust Skeleton-based Action Recognition
Yisheng Song
Zhang Zhang
Caifeng Shan
Liang Wang
24
164
0
09 Aug 2020
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary
  Instructions
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
Xihui Liu
Zhe-nan Lin
Jianming Zhang
Handong Zhao
Quan Hung Tran
Xiaogang Wang
Hongsheng Li
DiffM
20
34
0
04 Aug 2020
PhraseCut: Language-based Image Segmentation in the Wild
PhraseCut: Language-based Image Segmentation in the Wild
Chenyun Wu
Zhe-nan Lin
Scott D. Cohen
Trung Bui
Subhransu Maji
VLM
13
111
0
03 Aug 2020
Describing Textures using Natural Language
Describing Textures using Natural Language
Chenyun Wu
Mikayla Timm
Subhransu Maji
3DV
20
10
0
03 Aug 2020
Referring Expression Comprehension: A Survey of Methods and Datasets
Referring Expression Comprehension: A Survey of Methods and Datasets
Yanyuan Qiao
Chaorui Deng
Qi Wu
ObjD
42
93
0
19 Jul 2020
MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language
  Queries at Phrase Level
MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level
Amar Shrestha
Krittaphat Pugdeethosapol
Haowen Fang
Qinru Qiu
ObjD
14
2
0
06 Jun 2020
Giving Commands to a Self-driving Car: A Multimodal Reasoner for Visual
  Grounding
Giving Commands to a Self-driving Car: A Multimodal Reasoner for Visual Grounding
Thierry Deruyttere
Guillem Collell
Marie-Francine Moens
LRM
6
8
0
19 Mar 2020
Cops-Ref: A new Dataset and Task on Compositional Referring Expression
  Comprehension
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension
Zhenfang Chen
Peng Wang
Lin Ma
Kwan-Yee Kenneth Wong
Qi Wu
ObjD
18
67
0
01 Mar 2020
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
22
341
0
18 Dec 2019
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Zihao W. Wang
Xihui Liu
Hongsheng Li
Lu Sheng
Junjie Yan
Xiaogang Wang
Jing Shao
VLM
23
299
0
12 Sep 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
A Fast and Accurate One-Stage Approach to Visual Grounding
Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
ObjD
12
360
0
18 Aug 2019
Exploiting Temporal Relationships in Video Moment Localization with
  Natural Language
Exploiting Temporal Relationships in Video Moment Localization with Natural Language
Songyang Zhang
Jinsong Su
Jiebo Luo
6
74
0
11 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
17
77
0
10 Aug 2019
Deep Self-Learning From Noisy Labels
Deep Self-Learning From Noisy Labels
Jiangfan Han
Ping Luo
Xiaogang Wang
NoLa
11
276
0
06 Aug 2019
Joint Visual Grounding with Language Scene Graphs
Joint Visual Grounding with Language Scene Graphs
Daqing Liu
Hanwang Zhang
Zhengjun Zha
Meng Wang
Qianru Sun
25
6
0
09 Jun 2019
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor
  Environments
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
Yuankai Qi
Qi Wu
Peter Anderson
X. Wang
W. Wang
Chunhua Shen
A. Hengel
LM&Ro
12
316
0
23 Apr 2019
Revisiting Visual Grounding
Revisiting Visual Grounding
E. Conser
Kennedy Hahn
Chandler M. Watson
Melanie Mitchell
9
5
0
03 Apr 2019
Previous
12