Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2204.02968
Cited By
Temporal Alignment Networks for Long-term Video
Computer Vision and Pattern Recognition (CVPR), 2022
6 April 2022
Tengda Han
Weidi Xie
Andrew Zisserman
AI4TS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Alignment Networks for Long-term Video"
23 / 73 papers shown
Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval
IEEE Transactions on Image Processing (IEEE TIP), 2023
Yabing Wang
Shuhui Wang
Hao Luo
Jianfeng Dong
F. Wang
Meng Han
Xun Wang
Meng Wang
207
13
0
11 Sep 2023
Opening the Vocabulary of Egocentric Actions
Neural Information Processing Systems (NeurIPS), 2023
Dibyadip Chatterjee
Fadime Sener
Shugao Ma
Angela Yao
VLM
303
23
0
22 Aug 2023
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding
Neural Information Processing Systems (NeurIPS), 2023
K. Mangalam
Raiymbek Akshulakov
Jitendra Malik
399
495
0
17 Aug 2023
Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Neural Information Processing Systems (NeurIPS), 2023
Kumar Ashutosh
Santhosh Kumar Ramakrishnan
Triantafyllos Afouras
Kristen Grauman
299
37
0
17 Jul 2023
Learning to Ground Instructional Articles in Videos through Narrations
IEEE International Conference on Computer Vision (ICCV), 2023
E. Mavroudi
Triantafyllos Afouras
Lorenzo Torresani
DiffM
217
27
0
06 Jun 2023
StepFormer: Self-supervised Step Discovery and Localization in Instructional Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Nikita Dvornik
Isma Hadji
Ran Zhang
Konstantinos G. Derpanis
Animesh Garg
Richard P. Wildes
Allan D. Jepson
192
38
0
26 Apr 2023
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision
International Conference on Learning Representations (ICLR), 2023
Jiani Huang
Ziyang Li
Mayur Naik
Ser-Nam Lim
667
9
0
15 Apr 2023
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting
Computer Vision and Pattern Recognition (CVPR), 2023
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
M. Shah
VLM
VPVLM
225
110
0
06 Apr 2023
Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Computer Vision and Pattern Recognition (CVPR), 2023
Yiwu Zhong
Licheng Yu
Yang Bai
Shangwen Li
Xueting Yan
Yin Li
AI4TS
236
46
0
31 Mar 2023
What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
Computer Vision and Pattern Recognition (CVPR), 2023
Brian Chen
Nina Shvetsova
Andrew Rouditchenko
D. Kondermann
Samuel Thomas
Shih-Fu Chang
Rogerio Feris
James R. Glass
Hilde Kuehne
351
9
0
29 Mar 2023
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations
Computer Vision and Pattern Recognition (CVPR), 2023
Jiahao Zhang
A. Cherian
Yanbin Liu
Yizhak Ben-Shabat
Cristian Rodriguez-Opazo
Stephen Gould
224
11
0
24 Mar 2023
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Sixun Dong
Huazhang Hu
Dongze Lian
Weixin Luo
Yichen Qian
Shenghua Gao
ViT
AI4TS
274
18
0
22 Mar 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Computer Vision and Pattern Recognition (CVPR), 2023
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TS
VLM
497
325
0
27 Feb 2023
What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
357
4
0
05 Jan 2023
Test of Time: Instilling Video-Language Models with a Sense of Time
Computer Vision and Pattern Recognition (CVPR), 2023
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
463
47
0
05 Jan 2023
Learning Video Representations from Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2022
Yue Zhao
Ishan Misra
Philipp Krahenbuhl
Rohit Girdhar
VLM
AI4TS
306
229
0
08 Dec 2022
Temporal Action Segmentation: An Analysis of Modern Techniques
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Guodong Ding
Fadime Sener
Angela Yao
602
116
0
19 Oct 2022
Turbo Training with Token Dropout
British Machine Vision Conference (BMVC), 2022
Tengda Han
Weidi Xie
Andrew Zisserman
ViT
211
14
0
10 Oct 2022
Multimodal Learning with Transformers: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Peng Xu
Xiatian Zhu
David Clifton
ViT
529
836
0
13 Jun 2022
A CLIP-Hitchhiker's Guide to Long Video Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
CLIP
418
73
0
17 May 2022
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
362
459
0
08 Dec 2021
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
Mathilde Caron
Ishan Misra
Julien Mairal
Priya Goyal
Piotr Bojanowski
Armand Joulin
OCL
SSL
1.2K
4,653
0
17 Jun 2020
Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation
Computer Vision and Pattern Recognition (CVPR), 2020
Min-Hung Chen
Baopu Li
Sid Ying-Ze Bao
G. Al-Regib
Z. Kira
TTA
461
142
0
05 Mar 2020
Previous
1
2