ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02968
  4. Cited By
Temporal Alignment Networks for Long-term Video

Temporal Alignment Networks for Long-term Video

Computer Vision and Pattern Recognition (CVPR), 2022
6 April 2022
Tengda Han
Weidi Xie
Andrew Zisserman
    AI4TS
ArXiv (abs)PDFHTML

Papers citing "Temporal Alignment Networks for Long-term Video"

23 / 73 papers shown
Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal
  Retrieval
Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal RetrievalIEEE Transactions on Image Processing (IEEE TIP), 2023
Yabing Wang
Shuhui Wang
Hao Luo
Jianfeng Dong
F. Wang
Meng Han
Xun Wang
Meng Wang
207
13
0
11 Sep 2023
Opening the Vocabulary of Egocentric Actions
Opening the Vocabulary of Egocentric ActionsNeural Information Processing Systems (NeurIPS), 2023
Dibyadip Chatterjee
Fadime Sener
Shugao Ma
Angela Yao
VLM
303
23
0
22 Aug 2023
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language
  Understanding
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language UnderstandingNeural Information Processing Systems (NeurIPS), 2023
K. Mangalam
Raiymbek Akshulakov
Jitendra Malik
399
495
0
17 Aug 2023
Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Video-Mined Task Graphs for Keystep Recognition in Instructional VideosNeural Information Processing Systems (NeurIPS), 2023
Kumar Ashutosh
Santhosh Kumar Ramakrishnan
Triantafyllos Afouras
Kristen Grauman
299
37
0
17 Jul 2023
Learning to Ground Instructional Articles in Videos through Narrations
Learning to Ground Instructional Articles in Videos through NarrationsIEEE International Conference on Computer Vision (ICCV), 2023
E. Mavroudi
Triantafyllos Afouras
Lorenzo Torresani
DiffM
217
27
0
06 Jun 2023
StepFormer: Self-supervised Step Discovery and Localization in
  Instructional Videos
StepFormer: Self-supervised Step Discovery and Localization in Instructional VideosComputer Vision and Pattern Recognition (CVPR), 2023
Nikita Dvornik
Isma Hadji
Ran Zhang
Konstantinos G. Derpanis
Animesh Garg
Richard P. Wildes
Allan D. Jepson
192
38
0
26 Apr 2023
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak SupervisionInternational Conference on Learning Representations (ICLR), 2023
Jiani Huang
Ziyang Li
Mayur Naik
Ser-Nam Lim
667
9
0
15 Apr 2023
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting
Vita-CLIP: Video and text adaptive CLIP via Multimodal PromptingComputer Vision and Pattern Recognition (CVPR), 2023
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
M. Shah
VLMVPVLM
225
110
0
06 Apr 2023
Learning Procedure-aware Video Representation from Instructional Videos
  and Their Narrations
Learning Procedure-aware Video Representation from Instructional Videos and Their NarrationsComputer Vision and Pattern Recognition (CVPR), 2023
Yiwu Zhong
Licheng Yu
Yang Bai
Shangwen Li
Xueting Yan
Yin Li
AI4TS
236
46
0
31 Mar 2023
What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in
  Untrimmed Multi-Action Videos from Narrated Instructions
What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated InstructionsComputer Vision and Pattern Recognition (CVPR), 2023
Brian Chen
Nina Shvetsova
Andrew Rouditchenko
D. Kondermann
Samuel Thomas
Shih-Fu Chang
Rogerio Feris
James R. Glass
Hilde Kuehne
351
9
0
29 Mar 2023
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations
Aligning Step-by-Step Instructional Diagrams to Video DemonstrationsComputer Vision and Pattern Recognition (CVPR), 2023
Jiahao Zhang
A. Cherian
Yanbin Liu
Yizhak Ben-Shabat
Cristian Rodriguez-Opazo
Stephen Gould
224
11
0
24 Mar 2023
Weakly Supervised Video Representation Learning with Unaligned Text for
  Sequential Videos
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential VideosComputer Vision and Pattern Recognition (CVPR), 2023
Sixun Dong
Huazhang Hu
Dongze Lian
Weixin Luo
Yichen Qian
Shenghua Gao
ViTAI4TS
274
18
0
22 Mar 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense
  Video Captioning
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video CaptioningComputer Vision and Pattern Recognition (CVPR), 2023
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TSVLM
497
325
0
27 Feb 2023
What You Say Is What You Show: Visual Narration Detection in
  Instructional Videos
What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
357
4
0
05 Jan 2023
Test of Time: Instilling Video-Language Models with a Sense of Time
Test of Time: Instilling Video-Language Models with a Sense of TimeComputer Vision and Pattern Recognition (CVPR), 2023
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
463
47
0
05 Jan 2023
Learning Video Representations from Large Language Models
Learning Video Representations from Large Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Yue Zhao
Ishan Misra
Philipp Krahenbuhl
Rohit Girdhar
VLMAI4TS
306
229
0
08 Dec 2022
Temporal Action Segmentation: An Analysis of Modern Techniques
Temporal Action Segmentation: An Analysis of Modern TechniquesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Guodong Ding
Fadime Sener
Angela Yao
602
116
0
19 Oct 2022
Turbo Training with Token Dropout
Turbo Training with Token DropoutBritish Machine Vision Conference (BMVC), 2022
Tengda Han
Weidi Xie
Andrew Zisserman
ViT
211
14
0
10 Oct 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Peng Xu
Xiatian Zhu
David Clifton
ViT
529
836
0
13 Jun 2022
A CLIP-Hitchhiker's Guide to Long Video Retrieval
A CLIP-Hitchhiker's Guide to Long Video Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
CLIP
418
73
0
17 May 2022
Prompting Visual-Language Models for Efficient Video Understanding
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLMVLM
362
459
0
08 Dec 2021
Unsupervised Learning of Visual Features by Contrasting Cluster
  Assignments
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
Mathilde Caron
Ishan Misra
Julien Mairal
Priya Goyal
Piotr Bojanowski
Armand Joulin
OCLSSL
1.2K
4,653
0
17 Jun 2020
Action Segmentation with Joint Self-Supervised Temporal Domain
  Adaptation
Action Segmentation with Joint Self-Supervised Temporal Domain AdaptationComputer Vision and Pattern Recognition (CVPR), 2020
Min-Hung Chen
Baopu Li
Sid Ying-Ze Bao
G. Al-Regib
Z. Kira
TTA
461
142
0
05 Mar 2020
Previous
12