Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2407.07402
Cited By

ActionVOS: Actions as Prompts for Video Object Segmentation

ActionVOS: Actions as Prompts for Video Object Segmentation

10 July 2024

Liangyang Ouyang

ArXiv (abs)PDF HTML Github (31★)

Papers citing "ActionVOS: Actions as Prompts for Video Object Segmentation"

6 / 6 papers shown

SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation

SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation

Liangyang Ouyang

99

1

0

22 Nov 2025

Multi-speaker Attention Alignment for Multimodal Social Interaction

Multi-speaker Attention Alignment for Multimodal Social Interaction

Liangyang Ouyang

106

0

0

22 Nov 2025

Segment-to-Act: Label-Noise-Robust Action-Prompted Video Segmentation Towards Embodied Intelligence

Segment-to-Act: Label-Noise-Robust Action-Prompted Video Segmentation Towards Embodied Intelligence

126

1

0

20 Sep 2025

Multimodal Referring Segmentation: A Survey

Multimodal Referring Segmentation: A Survey

378

11

0

01 Aug 2025

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric VideosInternational Conference on Learning Representations (ICLR), 2025

277

16

0

16 Apr 2025

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

...

448

13

0

31 Dec 2024