ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07301
  4. Cited By
ESResNet: Environmental Sound Classification Based on Visual Domain
  Models

ESResNet: Environmental Sound Classification Based on Visual Domain Models

International Conference on Pattern Recognition (ICPR), 2020
15 April 2020
A. Guzhov
Federico Raue
Jörn Hees
Andreas Dengel
    VLM
ArXiv (abs)PDFHTML

Papers citing "ESResNet: Environmental Sound Classification Based on Visual Domain Models"

38 / 38 papers shown
Spike Encoding for Environmental Sound: A Comparative Benchmark
Spike Encoding for Environmental Sound: A Comparative Benchmark
Andres Larroza
Javier Naranjo-Alcazar
Vicent Ortiz Castelló
M. Cobos
P. Zuccarello
437
0
0
14 Mar 2025
Transfer Learning in Vocal Education: Technical Evaluation of Limited
  Samples Describing Mezzo-soprano
Transfer Learning in Vocal Education: Technical Evaluation of Limited Samples Describing Mezzo-soprano
Zhenyi Hou
Xu Zhao
Kejie Ye
Xinyu Sheng
Shanggerile Jiang
...
Jiaxing Chen
Yan Zou
Yuchao Feng
Guangyu Fan
Xin Yuan
DiffM
199
2
0
30 Oct 2024
OneEncoder: A Lightweight Framework for Progressive Alignment of
  Modalities
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities
Hanane Azzag
Hanane Azzag
M. Lebbah
ObjD
380
3
0
17 Sep 2024
Sound Tagging in Infant-centric Home Soundscapes
Sound Tagging in Infant-centric Home Soundscapes
Mohammad Nur Hossain Khan
Jialu Li
Nancy L. McElwain
M. Hasegawa-Johnson
Bashima Islam
205
2
0
25 Jun 2024
Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion
Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion
Kiran Chhatre
Radek Danvevcek
Nikos Athanasiou
Giorgio Becherini
Christopher Peters
Michael J. Black
Timo Bolkart
552
45
0
07 Dec 2023
Spectro-ViT: A Vision Transformer Model for GABA-edited MRS
  Reconstruction Using Spectrograms
Spectro-ViT: A Vision Transformer Model for GABA-edited MRS Reconstruction Using Spectrograms
G. Dias
R. Berto
Mateus Oliveira
Lucas Ueda
S. Dertkigil
Paula D. P. Costa
Amirmohammad Shamaei
Roberto Souza
Ashley D. Harris
Letícia Rittner
196
0
0
26 Nov 2023
Formal Verification of Long Short-Term Memory based Audio Classifiers: A
  Star based Approach
Formal Verification of Long Short-Term Memory based Audio Classifiers: A Star based Approach
Neelanjana Pal
Taylor T. Johnson
182
0
0
16 Nov 2023
AudRandAug: Random Image Augmentations for Audio Classification
AudRandAug: Random Image Augmentations for Audio Classification
Teerath Kumar
Muhammad Turab
Alessandra Mileo
Malika Bendechache
Takfarinas Saber
235
7
0
09 Sep 2023
Bridging High-Quality Audio and Video via Language for Sound Effects
  Retrieval from Visual Queries
Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual QueriesIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
J. Wilkins
Justin Salamon
Magdalena Fuentes
J. P. Bello
Oriol Nieto
CLIP
158
7
0
17 Aug 2023
Accommodating Audio Modality in CLIP for Multimodal Processing
Accommodating Audio Modality in CLIP for Multimodal ProcessingAAAI Conference on Artificial Intelligence (AAAI), 2023
Ludan Ruan
Anwen Hu
Yuqing Song
Liang Zhang
S. Zheng
Qin Jin
VLM
212
18
0
12 Mar 2023
Audiovisual Masked Autoencoders
Audiovisual Masked AutoencodersIEEE International Conference on Computer Vision (ICCV), 2022
Mariana-Iuliana Georgescu
Eduardo Fonseca
Radu Tudor Ionescu
Mario Lucic
Cordelia Schmid
Anurag Arnab
SSL
368
58
0
09 Dec 2022
Effective Audio Classification Network Based on Paired Inverse Pyramid
  Structure and Dense MLP Block
Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP BlockInternational Conference on Intelligent Computing (ICIC), 2022
Yunhao Chen
Yunjie Zhu
Zihui Yan
Yifan Huang
Zhen Ren
Jianlu Shen
Lifang Chen
417
11
0
05 Nov 2022
Language-based Audio Retrieval Task in DCASE 2022 ChallengeWorkshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2022
Huang Xie
Samuel Lipping
Maria Sandsten
240
19
0
20 Sep 2022
SampleMatch: Drum Sample Retrieval by Musical Context
SampleMatch: Drum Sample Retrieval by Musical ContextInternational Society for Music Information Retrieval Conference (ISMIR), 2022
Stefan Lattner
186
12
0
01 Aug 2022
GAFX: A General Audio Feature eXtractor
GAFX: A General Audio Feature eXtractor
Zhaoyang Bu
Han Zhang
Xiaohu Zhu
157
1
0
19 Jul 2022
Feature Pyramid Attention based Residual Neural Network for
  Environmental Sound Classification
Feature Pyramid Attention based Residual Neural Network for Environmental Sound Classification
Liguang Zhou
Yuhongze Zhou
Xiaonan Qi
Junjie Hu
Tin Lun Lam
Yangsheng Xu
247
6
0
28 May 2022
Combination of Time-domain, Frequency-domain, and Cepstral-domain
  Acoustic Features for Speech Commands Classification
Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification
Yikang Wang
Hiromitsu Nishizaki
288
1
0
30 Mar 2022
Interactive Audio-text Representation for Automated Audio Captioning
  with Contrastive Learning
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning
Chen Chen
Nana Hou
Yuchen Hu
Heqing Zou
Xiaofeng Qi
Chng Eng Siong
VLM
227
24
0
29 Mar 2022
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio
  Classification
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Yuan Gong
Sameer Khurana
Andrew Rouditchenko
James R. Glass
VLM
223
33
0
13 Mar 2022
Maximizing Audio Event Detection Model Performance on Small Datasets
  Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation
  Study
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation StudyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Daniel C. Tompkins
Kshitiz Kumar
Jian Wu
290
6
0
07 Feb 2022
Connecting the Dots between Audio and Text without Parallel Data through
  Visual Knowledge Transfer
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Yanpeng Zhao
Jack Hessel
Youngjae Yu
Ximing Lu
Rowan Zellers
Yejin Choi
396
30
0
16 Dec 2021
NeuroView: Explainable Deep Network Decision Making
NeuroView: Explainable Deep Network Decision Making
C. Barberan
Randall Balestriero
Richard G. Baraniuk
FAtt
158
2
0
15 Oct 2021
Cross-domain Semi-Supervised Audio Event Classification Using
  Contrastive Regularization
Cross-domain Semi-Supervised Audio Event Classification Using Contrastive Regularization
Donmoon Lee
Kyogu Lee
218
3
0
29 Sep 2021
AudioCLIP: Extending CLIP to Image, Text and Audio
AudioCLIP: Extending CLIP to Image, Text and Audio
A. Guzhov
Federico Raue
Jörn Hees
Andreas Dengel
CLIPVLM
664
515
0
24 Jun 2021
ERANNs: Efficient Residual Audio Neural Networks for Audio Pattern
  Recognition
ERANNs: Efficient Residual Audio Neural Networks for Audio Pattern RecognitionPattern Recognition Letters (PR), 2021
S. Verbitskiy
Vladimir Berikov
Viacheslav Vyshegorodtsev
365
90
0
03 Jun 2021
Unsupervised Discriminative Learning of Sounds for Audio Event
  Classification
Unsupervised Discriminative Learning of Sounds for Audio Event ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sascha Hornauer
Ke Li
Stella X. Yu
Shabnam Ghaffarzadegan
Liu Ren
SSL
163
5
0
19 May 2021
ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of AudioIEEE International Joint Conference on Neural Network (IJCNN), 2021
A. Guzhov
Federico Raue
Jörn Hees
Andreas Dengel
219
41
0
23 Apr 2021
Detection of Audio-Video Synchronization Errors Via Event Detection
Detection of Audio-Video Synchronization Errors Via Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Joshua Peter Ebenezer
Yongjun Wu
Hai Wei
S. Sethuraman
Z. Liu
209
15
0
20 Apr 2021
AST: Audio Spectrogram Transformer
AST: Audio Spectrogram TransformerInterspeech (Interspeech), 2021
Yuan Gong
Yu-An Chung
James R. Glass
ViT
742
1,245
0
05 Apr 2021
Environmental Sound Classification on the Edge: A Pipeline for Deep
  Acoustic Networks on Extremely Resource-Constrained Devices
Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained DevicesPattern Recognition (Pattern Recogn.), 2021
Md Mohaimenuzzaman
Christoph Bergmeir
I. West
B. Meyer
429
61
0
05 Mar 2021
Multi-view Audio and Music Classification
Multi-view Audio and Music ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Huy P Phan
Huy Le Nguyen
Oliver Y. Chén
L. D. Pham
P. Koch
Ian Mcloughlin
Alfred Mertins
169
18
0
03 Mar 2021
SoundCLR: Contrastive Learning of Representations For Improved
  Environmental Sound Classification
SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification
Alireza Nasiri
Jianjun Hu
149
21
0
02 Mar 2021
Comparison of semi-supervised deep learning algorithms for audio
  classification
Comparison of semi-supervised deep learning algorithms for audio classificationEURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process), 2021
Léo Cances
Etienne Labbé
Thomas Pellegrini
198
22
0
16 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and
  Aggregation
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and AggregationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Yuan Gong
Yu-An Chung
James R. Glass
VLM
454
176
0
02 Feb 2021
Urban Sound Classification : striving towards a fair comparison
Urban Sound Classification : striving towards a fair comparison
Augustin Arnault
Baptiste Hanssens
Nicolas Riche
158
12
0
22 Oct 2020
CLAR: Contrastive Learning of Auditory Representations
CLAR: Contrastive Learning of Auditory RepresentationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Haider Al-Tahan
Y. Mohsenzadeh
SSL
449
67
0
19 Oct 2020
Rethinking CNN Models for Audio Classification
Rethinking CNN Models for Audio Classification
Kamalesh Palanisamy
Dipika Singhania
Angela Yao
SSL
248
170
0
22 Jul 2020
A Sequential Self Teaching Approach for Improving Generalization in
  Sound Event Recognition
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition
Anurag Kumar
V. Ithapu
272
35
0
30 Jun 2020
1
Page 1 of 1