ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.06558
  4. Cited By
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and
  Unpaired Text-based Image Captioning

MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning

13 December 2021
Wenqiao Zhang
Haochen Shi
Jiannan Guo
Shengyu Zhang
Qingpeng Cai
Juncheng Li
Sihui Luo
Yueting Zhuang
    DiffM
ArXivPDFHTML

Papers citing "MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning"

21 / 21 papers shown
Title
Adaptation Method for Misinformation Identification
Adaptation Method for Misinformation Identification
Yangping Chen
Weijie Shi
Mengze Li
Yue Cui
H. Chen
Jia Zhu
Jiajie Xu
30
0
0
19 Apr 2025
IPO: Interpretable Prompt Optimization for Vision-Language Models
IPO: Interpretable Prompt Optimization for Vision-Language Models
Yingjun Du
Wenfang Sun
Cees G. M. Snoek
VLM
25
2
0
20 Oct 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
27
0
0
09 Aug 2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal
  Large Language Models
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Wenqiao Zhang
Tianwei Lin
Jiang Liu
Fangxun Shu
Haoyuan Li
...
Zheqi Lv
Hao Jiang
Juncheng Li
Siliang Tang
Yueting Zhuang
VLM
MLLM
30
4
0
20 Mar 2024
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction
  Data
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
Qifan Yu
Juncheng Li
Longhui Wei
Liang Pang
Wentao Ye
Bosheng Qin
Siliang Tang
Qi Tian
Yueting Zhuang
MLLM
VLM
25
67
0
22 Nov 2023
Improving Image Captioning via Predicting Structured Concepts
Improving Image Captioning via Predicting Structured Concepts
Ting Wang
Weidong Chen
Yuanhe Tian
Yan Song
Zhendong Mao
21
8
0
14 Nov 2023
ImageBind-LLM: Multi-modality Instruction Tuning
ImageBind-LLM: Multi-modality Instruction Tuning
Jiaming Han
Renrui Zhang
Wenqi Shao
Peng Gao
Peng-Tao Xu
...
Yafei Wen
Xiaoxin Chen
Xiangyu Yue
Hongsheng Li
Yu Qiao
MLLM
35
116
0
07 Sep 2023
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative
  Instructions
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
Juncheng Li
Kaihang Pan
Zhiqi Ge
Minghe Gao
Wei Ji
Wenqiao Zhang
Tat-Seng Chua
Siliang Tang
Hanwang Zhang
Yueting Zhuang
MLLM
27
68
0
08 Aug 2023
Learning in Imperfect Environment: Multi-Label Classification with
  Long-Tailed Distribution and Partial Labels
Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels
Wenqiao Zhang
Changshuo Liu
Lingze Zeng
Beng Chin Ooi
Siliang Tang
Yueting Zhuang
24
13
0
20 Apr 2023
CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain
  Adaptation
CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain Adaptation
Wenqiao Zhang
Changshuo Liu
Can Cui
Beng Chin Ooi
CML
27
0
0
30 Mar 2023
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization
  for Few-shot Generalization
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization
Kaihang Pan
Juncheng Billy Li
Hongye Song
Jun Lin
Xiaozhong Liu
Siliang Tang
OffRL
25
10
0
22 Mar 2023
Gradient-Regulated Meta-Prompt Learning for Generalizable
  Vision-Language Models
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
Juncheng Li
Minghe Gao
Longhui Wei
Siliang Tang
Wenqiao Zhang
Meng Li
Wei Ji
Qi Tian
Tat-Seng Chua
Yueting Zhuang
VLM
VPVLM
27
18
0
12 Mar 2023
Improving Scene Text Image Super-resolution via Dual Prior Modulation
  Network
Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
Shipeng Zhu
Zuoyan Zhao
Pengfei Fang
H. Xue
SupR
DiffM
42
24
0
21 Feb 2023
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image Captioning
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image Captioning
Dongsheng Xu
Qingbao Huang
Shuang Feng
Yiru Cai
Feng Shuang
Yi Cai
ViT
VLM
20
1
0
03 Feb 2023
Controllable Image Captioning via Prompting
Controllable Image Captioning via Prompting
Ning Wang
Jiahao Xie
Jihao Wu
Mingbo Jia
Linlin Li
14
23
0
04 Dec 2022
DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation
  Framework for Efficient Device Model Generalization
DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device Model Generalization
Zheqi Lv
Wenqiao Zhang
Shengyu Zhang
Kun Kuang
Feng Wang
...
Zhengyu Chen
T. Shen
Hongxia Yang
Bengchin Ooi
Fei Wu
39
52
0
12 Sep 2022
Dilated Context Integrated Network with Cross-Modal Consensus for
  Temporal Emotion Localization in Videos
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Juncheng Billy Li
Junlin Xie
Linchao Zhu
Long Qian
Siliang Tang
...
Haochen Shi
Shengyu Zhang
Longhui Wei
Qi Tian
Yueting Zhuang
30
12
0
03 Aug 2022
Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of
  Semi-Supervised Learning and Active Learning
Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning
Jiannan Guo
Yangyang Kang
Yu Duan
Xiaozhong Liu
Siliang Tang
Wenqiao Zhang
Kun Kuang
Changlong Sun
Fei Wu
27
4
0
07 Jun 2022
Compositional Temporal Grounding with Structured Variational Cross-Graph
  Correspondence Learning
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning
Juncheng Li
Junlin Xie
Long Qian
Linchao Zhu
Siliang Tang
Fei Wu
Yi Yang
Yueting Zhuang
X. Wang
26
73
0
24 Mar 2022
End-to-End Modeling via Information Tree for One-Shot Natural Language
  Spatial Video Grounding
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding
Meng Li
Tianbao Wang
Haoyu Zhang
Shengyu Zhang
Zhou Zhao
...
Wenming Tan
Jin Wang
Peng Wang
Shi Pu
Fei Wu
19
45
0
15 Mar 2022
BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive
  Pseudo Labeling and Informative Active Annotation
BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation
Wenqiao Zhang
Lei Zhu
James Hallinan
A. Makmur
Shengyu Zhang
Qingpeng Cai
Beng Chin Ooi
30
79
0
04 Mar 2022
1