Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2407.07402
Cited By
ActionVOS: Actions as Prompts for Video Object Segmentation
10 July 2024
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
Re-assign community
ArXiv (abs)
PDF
HTML
Github (31★)
Papers citing
"ActionVOS: Actions as Prompts for Video Object Segmentation"
6 / 6 papers shown
Title
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
Ruicong Liu
Yifei Huang
Liangyang Ouyang
Caixin Kang
Yoichi Sato
91
1
0
22 Nov 2025
Multi-speaker Attention Alignment for Multimodal Social Interaction
Liangyang Ouyang
Yifei Huang
Mingfang Zhang
Caixin Kang
Ryosuke Furuta
Yoichi Sato
102
0
0
22 Nov 2025
Segment-to-Act: Label-Noise-Robust Action-Prompted Video Segmentation Towards Embodied Intelligence
Wenxin Li
Kunyu Peng
Di Wen
Ruiping Liu
Mengfei Duan
Kai Luo
Kailun Yang
VLM
86
1
0
20 Sep 2025
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
362
10
0
01 Aug 2025
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos
International Conference on Learning Representations (ICLR), 2025
Jinfeng Xu
Yuanmin Huang
Baoqi Pei
Junlin Hou
Qingqiu Li
Guo Chen
Yuhui Zhang
Rui Feng
Weidi Xie
DiffM
253
16
0
16 Apr 2025
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Yuanmin Huang
Jilan Xu
Baoqi Pei
Yuping He
Guo Chen
...
Kunpeng Li
C. Yuan
Yidan Wang
Yu Qiao
L. Wang
420
13
0
31 Dec 2024
1