ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.05894
  4. Cited By
Coincidence, Categorization, and Consolidation: Learning to Recognize
  Sounds with Minimal Supervision

Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision

14 November 2019
A. Jansen
D. Ellis
Shawn Hershey
R. C. Moore
Manoj Plakal
Ashok Popat
Rif A. Saurous
    SSL
ArXiv (abs)PDFHTML

Papers citing "Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision"

24 / 24 papers shown
Title
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning
Luis Vilaca
Yi Yu
Paula Vinan
183
0
0
24 Nov 2024
Identifying Spatio-Temporal Drivers of Extreme Events
Identifying Spatio-Temporal Drivers of Extreme Events
Mohamad Hakam Shams Eddin
Juergen Gall
AI4TS
105
0
0
31 Oct 2024
Image and Video Tokenization with Binary Spherical Quantization
Image and Video Tokenization with Binary Spherical Quantization
Yue Zhao
Yuanjun Xiong
Philipp Krahenbuhl
94
24
0
11 Jun 2024
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
126
325
0
09 Oct 2023
Human Activity Recognition Using Self-Supervised Representations of
  Wearable Data
Human Activity Recognition Using Self-Supervised Representations of Wearable Data
Maximilien Burq
Niranjan Sridhar
91
0
0
26 Apr 2023
Egocentric Auditory Attention Localization in Conversations
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
65
16
0
28 Mar 2023
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated
  Open-Domain On-Screen Sound Separation
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation
Efthymios Tzinis
Scott Wisdom
Tal Remez
J. Hershey
111
30
0
20 Jul 2022
Multimodal Conversational AI: A Survey of Datasets and Approaches
Multimodal Conversational AI: A Survey of Datasets and Approaches
Anirudh S. Sundar
Larry Heck
100
30
0
13 May 2022
A Study on Robustness to Perturbations for Representations of
  Environmental Sound
A Study on Robustness to Perturbations for Representations of Environmental Sound
Sangeeta Srivastava
Ho-Hsiang Wu
Joao Rulff
Magdalena Fuentes
M. Cartwright
Claudio Silva
Anish Arora
J. P. Bello
62
5
0
20 Mar 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
97
109
0
02 Mar 2022
Human Activity Recognition on wrist-worn accelerometers using
  self-supervised neural networks
Human Activity Recognition on wrist-worn accelerometers using self-supervised neural networks
Niranjan Sridhar
L. Myers
23
1
0
22 Dec 2021
Expedition: A System for the Unsupervised Learning of a Hierarchy of
  Concepts
Expedition: A System for the Unsupervised Learning of a Hierarchy of Concepts
Omid Madani
SSL
34
1
0
17 Dec 2021
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Sangeeta Srivastava
Yun Wang
Andros Tjandra
Anurag Kumar
Chunxi Liu
Kritika Singh
Yatharth Saraf
SSL
99
25
0
14 Oct 2021
Universal Paralinguistic Speech Representations Using Self-Supervised
  Conformers
Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
Joel Shor
A. Jansen
Wei Han
Daniel S. Park
Yu Zhang
SSLAI4TS
129
59
0
09 Oct 2021
Attention Bottlenecks for Multimodal Fusion
Attention Bottlenecks for Multimodal Fusion
Arsha Nagrani
Shan Yang
Anurag Arnab
A. Jansen
Cordelia Schmid
Chen Sun
111
574
0
30 Jun 2021
Improving On-Screen Sound Separation for Open-Domain Videos with
  Audio-Visual Self-Attention
Improving On-Screen Sound Separation for Open-Domain Videos with Audio-Visual Self-Attention
Efthymios Tzinis
Scott Wisdom
Tal Remez
J. Hershey
VLM
81
8
0
17 Jun 2021
Semi-Supervised Audio Representation Learning for Modeling Beehive
  Strengths
Semi-Supervised Audio Representation Learning for Modeling Beehive Strengths
Tony Zhang
Szymon Zmyslony
Sergei Nozdrenkov
Matthew Smith
Brandon Hopkins
SSL
20
15
0
21 May 2021
Self-Supervised Learning from Automatically Separated Sound Scenes
Self-Supervised Learning from Automatically Separated Sound Scenes
Eduardo Fonseca
A. Jansen
D. Ellis
Scott Wisdom
Marco Tagliasacchi
J. Hershey
Manoj Plakal
Shawn Hershey
R. C. Moore
Xavier Serra
SSL
81
13
0
05 May 2021
Multimodal Self-Supervised Learning of General Audio Representations
Multimodal Self-Supervised Learning of General Audio Representations
Luyu Wang
Pauline Luc
Adrià Recasens
Jean-Baptiste Alayrac
Aaron van den Oord
SSL
137
41
0
26 Apr 2021
Broaden Your Views for Self-Supervised Video Learning
Broaden Your Views for Self-Supervised Video Learning
Adrià Recasens
Pauline Luc
Jean-Baptiste Alayrac
Luyu Wang
Ross Hemsley
...
Florent Altché
M. Valko
Jean-Bastien Grill
Aaron van den Oord
Andrew Zisserman
SSLAI4TS
131
128
0
30 Mar 2021
Multi-Format Contrastive Learning of Audio Representations
Multi-Format Contrastive Learning of Audio Representations
Luyu Wang
Aaron van den Oord
95
59
0
11 Mar 2021
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of
  On-Screen Sounds
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds
Efthymios Tzinis
Scott Wisdom
A. Jansen
Shawn Hershey
Tal Remez
D. Ellis
J. Hershey
81
71
0
02 Nov 2020
A Framework for Generative and Contrastive Learning of Audio
  Representations
A Framework for Generative and Contrastive Learning of Audio Representations
Prateek Verma
J. Smith
SSL
59
18
0
22 Oct 2020
Self-Supervised MultiModal Versatile Networks
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
175
375
0
29 Jun 2020
1