ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.03508
  4. Cited By
Rethinking Diversified and Discriminative Proposal Generation for Visual
  Grounding

Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding

9 May 2018
Zhou Yu
Jun-chen Yu
Chenchao Xiang
Zhou Zhao
Q. Tian
Dacheng Tao
    ObjD
ArXiv (abs)PDFHTML

Papers citing "Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding"

21 / 71 papers shown
PhraseCut: Language-based Image Segmentation in the Wild
PhraseCut: Language-based Image Segmentation in the Wild
Chenyun Wu
Zhe Lin
Scott D. Cohen
Trung Bui
Subhransu Maji
VLM
241
136
0
03 Aug 2020
Referring Expression Comprehension: A Survey of Methods and Datasets
Referring Expression Comprehension: A Survey of Methods and DatasetsIEEE transactions on multimedia (TMM), 2020
Yanyuan Qiao
Chaorui Deng
Qi Wu
ObjD
341
118
0
19 Jul 2020
Self-Segregating and Coordinated-Segregating Transformer for Focused
  Deep Multi-Modular Network for Visual Question Answering
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering
C. Sur
90
9
0
25 Jun 2020
Deep Multimodal Neural Architecture Search
Deep Multimodal Neural Architecture SearchACM Multimedia (ACM MM), 2020
Zhou Yu
Yuhao Cui
Jun-chen Yu
Meng Wang
Dacheng Tao
Qi Tian
165
108
0
25 Apr 2020
Image Co-skeletonization via Co-segmentation
Image Co-skeletonization via Co-segmentationIEEE Transactions on Image Processing (TIP), 2020
Koteswar Rao Jerripothula
Jianfei Cai
Jiangbo Lu
Junsong Yuan
92
9
0
12 Apr 2020
Multi-task Collaborative Network for Joint Referring Expression
  Comprehension and Segmentation
Multi-task Collaborative Network for Joint Referring Expression Comprehension and SegmentationComputer Vision and Pattern Recognition (CVPR), 2020
Gen Luo
Weihao Ye
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
ObjD
469
349
0
19 Mar 2020
MUTATT: Visual-Textual Mutual Guidance for Referring Expression
  Comprehension
MUTATT: Visual-Textual Mutual Guidance for Referring Expression ComprehensionIEEE International Conference on Multimedia and Expo (ICME), 2020
Shuai Wang
Fan Lyu
Wei Feng
Song Wang
ObjD
156
5
0
18 Mar 2020
A Real-time Global Inference Network for One-stage Referring Expression
  Comprehension
A Real-time Global Inference Network for One-stage Referring Expression ComprehensionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Weihao Ye
Rongrong Ji
Gen Luo
Xiaoshuai Sun
Jinsong Su
Xinghao Ding
Chia-Wen Lin
Q. Tian
ObjD
192
77
0
07 Dec 2019
Learning Cross-modal Context Graph for Visual Grounding
Learning Cross-modal Context Graph for Visual GroundingAAAI Conference on Artificial Intelligence (AAAI), 2019
Yongfei Liu
Bo Wan
Xiao-Dan Zhu
Xuming He
269
98
0
20 Nov 2019
Phrase Grounding by Soft-Label Chain Conditional Random Field
Phrase Grounding by Soft-Label Chain Conditional Random FieldConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Hamish Ivison
Anjali Narayan-Chen
111
10
0
01 Sep 2019
Zero-Shot Grounding of Objects from Natural Language Queries
Zero-Shot Grounding of Objects from Natural Language QueriesIEEE International Conference on Computer Vision (ICCV), 2019
Arka Sadhu
Kan Chen
Ram Nevatia
ObjD
250
172
0
20 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
252
44
0
12 Aug 2019
Bilinear Graph Networks for Visual Question Answering
Bilinear Graph Networks for Visual Question AnsweringIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Dalu Guo
Chang Xu
Dacheng Tao
GNN
199
68
0
23 Jul 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Deep Modular Co-Attention Networks for Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2019
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
323
929
0
25 Jun 2019
Joint Visual Grounding with Language Scene Graphs
Joint Visual Grounding with Language Scene Graphs
Daqing Liu
Hanwang Zhang
Zhengjun Zha
Meng Wang
Qianru Sun
191
6
0
09 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via
  Question Answering
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question AnsweringAAAI Conference on Artificial Intelligence (AAAI), 2019
Zhou Yu
D. Xu
Jun-chen Yu
Ting Yu
Zhou Zhao
Yueting Zhuang
Dacheng Tao
307
612
0
06 Jun 2019
Learning to Compose and Reason with Language Tree Structures for Visual
  Grounding
Learning to Compose and Reason with Language Tree Structures for Visual GroundingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Richang Hong
Daqing Liu
Xiaoyu Mo
Xiangnan He
Hanwang Zhang
ReLMLRM
235
197
0
05 Jun 2019
Multimodal Transformer with Multi-View Visual Representation for Image
  Captioning
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
193
426
0
20 May 2019
Image-Question-Answer Synergistic Network for Visual Dialog
Image-Question-Answer Synergistic Network for Visual DialogComputer Vision and Pattern Recognition (CVPR), 2019
Dalu Guo
Chang Xu
Dacheng Tao
168
77
0
26 Feb 2019
AU R-CNN: Encoding Expert Prior Knowledge into R-CNN for Action Unit
  Detection
AU R-CNN: Encoding Expert Prior Knowledge into R-CNN for Action Unit Detection
Chen Ma
Li Chen
Jun-hai Yong
129
91
0
14 Dec 2018
Learning to Assemble Neural Module Tree Networks for Visual Grounding
Learning to Assemble Neural Module Tree Networks for Visual Grounding
Daqing Liu
Hanwang Zhang
Feng Wu
Zhengjun Zha
377
306
0
08 Dec 2018
Previous
12