ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.04026
  4. Cited By
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular
  Vision-Language Pre-training

Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training

11 January 2022
Yehao Li
Jiahao Fan
Yingwei Pan
Ting Yao
Weiyao Lin
Tao Mei
    MLLMObjD
ArXiv (abs)PDFHTMLGithub

Papers citing "Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training"

6 / 6 papers shown
Malicious Path Manipulations via Exploitation of Representation
  Vulnerabilities of Vision-Language Navigation Systems
Malicious Path Manipulations via Exploitation of Representation Vulnerabilities of Vision-Language Navigation Systems
Chashi Mahiul Islam
Shaeke Salman
M. Shams
Xiuwen Liu
Piyush Kumar
AAML
224
14
0
10 Jul 2024
SNP-S3: Shared Network Pre-training and Significant Semantic
  Strengthening for Various Video-Text Tasks
SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks
Xingning Dong
Qingpei Guo
Tian Gan
Qing Wang
Yue Yu
Xiangyuan Ren
Yuan Cheng
Wei Chu
258
6
0
31 Jan 2024
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge
  Distillation and Modal-adaptive Pruning
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive PruningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Tiannan Wang
Wangchunshu Zhou
Yan Zeng
Xinsong Zhang
VLM
248
70
0
14 Oct 2022
Write and Paint: Generative Vision-Language Models are Unified Modal
  Learners
Write and Paint: Generative Vision-Language Models are Unified Modal LearnersInternational Conference on Learning Representations (ICLR), 2022
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLMAI4CE
394
19
0
15 Jun 2022
BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset
BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions DatasetInternational Conference on Language Resources and Evaluation (LREC), 2022
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
287
6
0
28 May 2022
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual
  Concepts
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual ConceptsInternational Conference on Machine Learning (ICML), 2021
Yan Zeng
Xinsong Zhang
Hang Li
VLMCLIP
429
370
0
16 Nov 2021
1
Page 1 of 1