ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.07702
  4. Cited By
Audio-Visual Contrastive Learning with Temporal Self-Supervision

Audio-Visual Contrastive Learning with Temporal Self-Supervision

15 February 2023
Simon Jenni
Alexander Black
John Collomosse
    SSL
ArXivPDFHTML

Papers citing "Audio-Visual Contrastive Learning with Temporal Self-Supervision"

10 / 10 papers shown
Title
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
45
0
0
12 Sep 2024
Sequential Contrastive Audio-Visual Learning
Sequential Contrastive Audio-Visual Learning
Ioannis Tsiamas
Santiago Pascual
Chunghsin Yeh
Joan Serra
26
2
0
08 Jul 2024
Siamese Vision Transformers are Scalable Audio-visual Learners
Siamese Vision Transformers are Scalable Audio-visual Learners
Yan-Bo Lin
Gedas Bertasius
30
5
0
28 Mar 2024
VADER: Video Alignment Differencing and Retrieval
VADER: Video Alignment Differencing and Retrieval
Alexander Black
Simon Jenni
Tu Bui
Md. Mehrab Tanjim
Stefano Petrangeli
Ritwik Sinha
Viswanathan Swaminathan
John Collomosse
13
2
0
23 Mar 2023
Time-Equivariant Contrastive Video Representation Learning
Time-Equivariant Contrastive Video Representation Learning
Simon Jenni
Hailin Jin
SSL
AI4TS
118
58
0
07 Dec 2021
VPN: Video Provenance Network for Robust Content Attribution
VPN: Video Provenance Network for Robust Content Attribution
Alexander Black
Tu Bui
Simon Jenni
Vishy Swaminathan
John Collomosse
23
9
0
21 Sep 2021
With a Little Help from My Friends: Nearest-Neighbor Contrastive
  Learning of Visual Representations
With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
Debidatta Dwibedi
Y. Aytar
Jonathan Tompson
P. Sermanet
Andrew Zisserman
SSL
183
450
0
29 Apr 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw
  Video, Audio and Text
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
231
573
0
22 Apr 2021
Video Representation Learning by Recognizing Temporal Transformations
Video Representation Learning by Recognizing Temporal Transformations
Simon Jenni
Givi Meishvili
Paolo Favaro
117
133
0
21 Jul 2020
Self-Supervised Feature Learning by Learning to Spot Artifacts
Self-Supervised Feature Learning by Learning to Spot Artifacts
Simon Jenni
Paolo Favaro
SSL
135
127
0
13 Jun 2018
1