ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09013
  4. Cited By
Self-Supervised Audio-Visual Co-Segmentation

Self-Supervised Audio-Visual Co-Segmentation

18 April 2019
Andrew Rouditchenko
Hang Zhao
Chuang Gan
Josh H. McDermott
Antonio Torralba
    VLMSSL
ArXiv (abs)PDFHTML

Papers citing "Self-Supervised Audio-Visual Co-Segmentation"

18 / 68 papers shown
Title
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating
  Source Separation
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation
Hang Zhou
Xudong Xu
Dahua Lin
Xiaogang Wang
Ziwei Liu
DiffM
80
84
0
20 Jul 2020
Generating Visually Aligned Sound from Videos
Generating Visually Aligned Sound from Videos
Peihao Chen
Yang Zhang
Mingkui Tan
Hongdong Xiao
Deng Huang
Chuang Gan
VGen
114
97
0
14 Jul 2020
Multiple Sound Sources Localization from Coarse to Fine
Multiple Sound Sources Localization from Coarse to Fine
Rui Qian
Di Hu
Heinrich Dinkel
Mengyue Wu
N. Xu
Weiyao Lin
69
157
0
13 Jul 2020
Do We Need Sound for Sound Source Localization?
Do We Need Sound for Sound Source Localization?
Takashi Oya
Shohei Iwase
Ryota Natsume
Takahiro Itazuri
Shugo Yamaguchi
Shigeo Morishima
41
22
0
11 Jul 2020
Labelling unlabelled videos from scratch with multi-modal
  self-supervision
Labelling unlabelled videos from scratch with multi-modal self-supervision
Yuki M. Asano
Mandela Patrick
Christian Rupprecht
Andrea Vedaldi
SSL
122
133
0
24 Jun 2020
AVLnet: Learning Audio-Visual Language Representations from
  Instructional Videos
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko
Angie Boggust
David Harwath
Brian Chen
D. Joshi
...
Rogerio Feris
Brian Kingsbury
M. Picheny
Antonio Torralba
James R. Glass
SSL
88
142
0
16 Jun 2020
Telling Left from Right: Learning Spatial Correspondence of Sight and
  Sound
Telling Left from Right: Learning Spatial Correspondence of Sight and Sound
Karren D. Yang
Bryan C. Russell
Justin Salamon
SSL
103
76
0
11 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter
  Network
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
103
23
0
04 Jun 2020
Music Gesture for Visual Sound Separation
Music Gesture for Visual Sound Separation
Chuang Gan
Deng Huang
Hang Zhao
J. Tenenbaum
Antonio Torralba
97
205
0
20 Apr 2020
The Unreasonable Effectiveness of Deep Learning in Artificial
  Intelligence
The Unreasonable Effectiveness of Deep Learning in Artificial Intelligence
T. Sejnowski
60
298
0
12 Feb 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
Ran He
75
160
0
14 Jan 2020
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
Chuang Gan
Yiwei Zhang
Jiajun Wu
Boqing Gong
J. Tenenbaum
82
139
0
25 Dec 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
160
433
0
28 Nov 2019
DEPA: Self-Supervised Audio Embedding for Depression Detection
DEPA: Self-Supervised Audio Embedding for Depression Detection
Pingyue Zhang
Mengyue Wu
Heinrich Dinkel
Kai Yu
67
57
0
29 Oct 2019
Self-supervised Moving Vehicle Tracking with Stereo Sound
Self-supervised Moving Vehicle Tracking with Stereo Sound
Chuang Gan
Hang Zhao
Peihao Chen
David D. Cox
Antonio Torralba
60
147
0
25 Oct 2019
Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian
  Mixture Model
Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian Mixture Model
Yoshiaki Bando
Y. Sasaki
Kazuyoshi Yoshii
BDL
52
9
0
29 Aug 2019
Cascade Attention Guided Residue Learning GAN for Cross-Modal
  Translation
Cascade Attention Guided Residue Learning GAN for Cross-Modal Translation
Bin Duan
Wei Wang
Hao Tang
Hugo Latapie
Yan Yan
117
35
0
03 Jul 2019
The Sound of Motions
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
88
254
0
11 Apr 2019
Previous
12