ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.11605
  4. Cited By
VG4D: Vision-Language Model Goes 4D Video Recognition

VG4D: Vision-Language Model Goes 4D Video Recognition

17 April 2024
Zhichao Deng
Xiangtai Li
Xia Li
Yunhai Tong
Shen Zhao
Mengyuan Liu
    3DPC
ArXivPDFHTML

Papers citing "VG4D: Vision-Language Model Goes 4D Video Recognition"

11 / 11 papers shown
Title
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks
Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks
Jiayi He
Hehai Lin
Q. Wang
Yi Ren Fung
Heng Ji
ReLM
LRM
86
3
0
05 Oct 2024
Multi-Modality Co-Learning for Efficient Skeleton-based Action
  Recognition
Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition
Jinfu Liu
C. L. P. Chen
Mengyuan Liu
37
11
0
22 Jul 2024
ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point
  Cloud Classification
ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification
Zhongbin Fang
Xia Li
Xiangtai Li
Shen Zhao
Mengyuan Liu
3DPC
23
4
0
16 Jan 2024
Dynamic Dense Graph Convolutional Network for Skeleton-based Human
  Motion Prediction
Dynamic Dense Graph Convolutional Network for Skeleton-based Human Motion Prediction
Xinshun Wang
Wanying Zhang
Can Wang
Yuan Gao
Mengyuan Liu
3DH
GNN
30
32
0
29 Nov 2023
Visual Language Maps for Robot Navigation
Visual Language Maps for Robot Navigation
Chen Huang
Oier Mees
Andy Zeng
Wolfram Burgard
LM&Ro
140
337
0
11 Oct 2022
Let Images Give You More:Point Cloud Cross-Modal Training for Shape
  Analysis
Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis
Xu Yan
Heshen Zhan
Chaoda Zheng
Jiantao Gao
Ruimao Zhang
Shuguang Cui
Zhen Li
3DPC
46
30
0
09 Oct 2022
Open-vocabulary Queryable Scene Representations for Real World Planning
Open-vocabulary Queryable Scene Representations for Real World Planning
Boyuan Chen
F. Xia
Brian Ichter
Kanishka Rao
K. Gopalakrishnan
Michael S. Ryoo
Austin Stone
Daniel Kappler
LM&Ro
138
179
0
20 Sep 2022
MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point
  Cloud Action Recognition
MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition
Xiaodong Chen
Wu Liu
Xinchen Liu
Yongdong Zhang
Jungong Han
Tao Mei
3DPC
31
12
0
01 Sep 2022
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
Hehe Fan
Xin Yu
Yuhang Ding
Yi Yang
Mohan S. Kankanhalli
3DPC
101
108
0
27 May 2022
PointCLIP: Point Cloud Understanding by CLIP
PointCLIP: Point Cloud Understanding by CLIP
Renrui Zhang
Ziyu Guo
Wei Zhang
Kunchang Li
Xupeng Miao
Bin Cui
Yu Qiao
Peng Gao
Hongsheng Li
VLM
3DPC
158
428
0
04 Dec 2021
PointNet: Deep Learning on Point Sets for 3D Classification and
  Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
210
13,886
0
02 Dec 2016
1