ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02547
  4. Cited By
Modeling Motion with Multi-Modal Features for Text-Based Video
  Segmentation

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation

6 April 2022
Wangbo Zhao
Kai Wang
Xiangxiang Chu
Fuzhao Xue
Xinchao Wang
Yang You
ArXivPDFHTML

Papers citing "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"

23 / 23 papers shown
Title
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence Matching
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence Matching
Heng Liu
Guanghui Li
Mingqi Gao
Xiantong Zhen
Feng Zheng
Y. Wang
VOS
40
0
0
18 Apr 2025
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Lingyi Hong
Zhongying Liu
Wenchao Chen
Chenzhi Tan
Yuang Feng
...
Jinglun Li
Zhaoyu Chen
Shuyong Gao
Wei Zhang
Wenqiang Zhang
VLM
VOS
34
12
0
30 Apr 2024
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi-Xin Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
27
17
0
25 Dec 2023
Temporal Collection and Distribution for Referring Video Object
  Segmentation
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
26
14
0
07 Sep 2023
Learning Cross-Modal Affinity for Referring Video Object Segmentation
  Targeting Limited Samples
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples
Guanghui Li
Mingqi Gao
Heng Liu
Xiantong Zhen
Feng Zheng
VOS
21
3
0
05 Sep 2023
Interpretation on Multi-modal Visual Fusion
Interpretation on Multi-modal Visual Fusion
Hao Chen
Hao Zhou
Yongjian Deng
26
0
0
19 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion
  Expressions
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
33
101
0
16 Aug 2023
Learning Referring Video Object Segmentation from Weak Annotation
Learning Referring Video Object Segmentation from Weak Annotation
Wangbo Zhao
Ke Nan
Songyang Zhang
Kai-xiang Chen
Dahua Lin
Yang You
VOS
19
2
0
04 Aug 2023
Spectrum-guided Multi-granularity Referring Video Object Segmentation
Spectrum-guided Multi-granularity Referring Video Object Segmentation
Bo Miao
Bennamoun
Yongsheng Gao
Ajmal Saeed Mian
VOS
29
34
0
25 Jul 2023
OnlineRefer: A Simple Online Baseline for Referring Video Object
  Segmentation
OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Dongming Wu
Tiancai Wang
Yuang Zhang
Xiangyu Zhang
Jianbing Shen
VOS
27
33
0
18 Jul 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
19
17
0
03 Jul 2023
LoSh: Long-Short Text Joint Prediction Network for Referring Video
  Object Segmentation
LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
Linfeng Yuan
Miaojing Shi
Zijie Yue
Qijun Chen
VOS
18
8
0
14 Jun 2023
PaintSeg: Training-free Segmentation via Painting
PaintSeg: Training-free Segmentation via Painting
Xiang Li
Chung-Ching Lin
Yinpeng Chen
Zicheng Liu
Jinglu Wang
Bhiksha Raj
27
5
0
30 May 2023
MotionBEV: Attention-Aware Online LiDAR Moving Object Segmentation with
  Bird's Eye View based Appearance and Motion Features
MotionBEV: Attention-Aware Online LiDAR Moving Object Segmentation with Bird's Eye View based Appearance and Motion Features
Bo Zhou
Jiapeng Xie
Yan Pan
Jiaji Wu
Chuanzhao Lu
3DPC
42
17
0
12 May 2023
Survey: Transformer based Video-Language Pre-training
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLM
ViT
61
44
0
21 Sep 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep
  Learning Workloads in GPU Clusters
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
48
47
0
08 Aug 2021
Full-Duplex Strategy for Video Object Segmentation
Full-Duplex Strategy for Video Object Segmentation
Ge-Peng Ji
Deng-Ping Fan
Keren Fu
Zhe Wu
Jianbing Shen
Ling Shao
VOS
78
129
0
06 Aug 2021
Visual Saliency Transformer
Visual Saliency Transformer
Nian Liu
Ni Zhang
Kaiyuan Wan
Ling Shao
Junwei Han
ViT
253
346
0
25 Apr 2021
Natural Language Video Localization: A Revisit in Span-based Question
  Answering Framework
Natural Language Video Localization: A Revisit in Span-based Question Answering Framework
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Joey Tianyi Zhou
Rick Siow Mong Goh
111
84
0
26 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,978
0
09 Feb 2021
Distilling Knowledge from Graph Convolutional Networks
Distilling Knowledge from Graph Convolutional Networks
Yiding Yang
Jiayan Qiu
Mingli Song
Dacheng Tao
Xinchao Wang
141
226
0
23 Mar 2020
Multi-task Collaborative Network for Joint Referring Expression
  Comprehension and Segmentation
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
ObjD
159
286
0
19 Mar 2020
Motion-Attentive Transition for Zero-Shot Video Object Segmentation
Motion-Attentive Transition for Zero-Shot Video Object Segmentation
Tianfei Zhou
Shunzhou Wang
Yi Zhou
Yazhou Yao
Jianwu Li
Ling Shao
VOS
122
189
0
09 Mar 2020
1