ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.16108
  4. Cited By
OmniBind: Teach to Build Unequal-Scale Modality Interaction for
  Omni-Bind of All

OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All

25 May 2024
Yuanhuiyi Lyu
Xueye Zheng
Dahun Kim
Lin Wang
ArXivPDFHTML

Papers citing "OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All"

18 / 18 papers shown
Title
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization
Xu Zheng
Yuanhuiyi Lyu
Lutao Jiang
Danda Pani Paudel
Luc Van Gool
Xuming Hu
12
0
0
10 May 2025
Segment Any RGB-Thermal Model with Language-aided Distillation
Segment Any RGB-Thermal Model with Language-aided Distillation
Dong Xing
Xianxun Zhu
Wei Zhou
Qika Lin
Hang Yang
Yuqing Wang
VLM
47
0
0
04 May 2025
4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer's Diagnosis
4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer's Diagnosis
Yuxiang Wei
Y. Zhang
Xi Xiao
Tianyang Wang
X. Wang
Vince D. Calhoun
25
0
0
23 Apr 2025
AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors
AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors
Ruoxuan Feng
Jiangyu Hu
Wenke Xia
Tianci Gao
Ao Shen
Yuhao Sun
Bin Fang
Di Hu
37
2
0
15 Feb 2025
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation
  via Hierarchical Modality Selection
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection
Xu Zheng
Yuanhuiyi Lyu
Lutao Jiang
Jiazhou Zhou
Lin Wang
Xuming Hu
67
4
0
22 Dec 2024
Gramian Multimodal Representation Learning and Alignment
Gramian Multimodal Representation Learning and Alignment
Giordano Cicchetti
Eleonora Grassucci
Luigi Sigillo
Danilo Comminiello
72
0
0
16 Dec 2024
Learning Modality-agnostic Representation for Semantic Segmentation from
  Any Modalities
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities
Xueye Zheng
Yuanhuiyi Lyu
Lin Wang
VLM
29
10
0
16 Jul 2024
Centering the Value of Every Modality: Towards Efficient and Resilient
  Modality-agnostic Semantic Segmentation
Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation
Xueye Zheng
Yuanhuiyi Lyu
Jiazhou Zhou
Lin Wang
17
7
0
16 Jul 2024
EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual
  Recognition and More
EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More
Xu Zheng
Ling Wang
Kanghao Chen
Yuanhuiyi Lyu
Jiazhou Zhou
Lin Wang
16
1
0
02 Jul 2024
UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind
  Them All
UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Yuanhuiyi Lyu
Xueye Zheng
Jiazhou Zhou
Lin Wang
16
14
0
19 Mar 2024
Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Ruiping Liu
Jiaming Zhang
Kunyu Peng
Yufan Chen
Ke Cao
Junwei Zheng
M. Sarfraz
Kailun Yang
Rainer Stiefelhagen
VLM
19
7
0
30 Jan 2024
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data
Hao-Xin Zhao
Junsong Chen
Lijun Wang
Huchuan Lu
28
8
0
24 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment
  Features
Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment Features
J. Kerr
Huang Huang
Albert Wilcox
Ryan Hoque
Jeffrey Ichnowski
Roberto Calandra
Ken Goldberg
51
27
0
26 Sep 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
PointCLIP: Point Cloud Understanding by CLIP
PointCLIP: Point Cloud Understanding by CLIP
Renrui Zhang
Ziyu Guo
Wei Zhang
Kunchang Li
Xupeng Miao
Bin Cui
Yu Qiao
Peng Gao
Hongsheng Li
VLM
3DPC
158
428
0
04 Dec 2021
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip
  Retrieval
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
298
771
0
18 Apr 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
922
0
24 Sep 2019
1