Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1905.00078
Cited By
v1
v2 (latest)
Deep Learning for Audio Signal Processing
IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2019
30 April 2019
Hendrik Purwins
Yue Liu
Maria Sandsten
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Learning for Audio Signal Processing"
50 / 115 papers shown
Title
MF-GCN: A Multi-Frequency Graph Convolutional Network for Tri-Modal Depression Detection Using Eye-Tracking, Facial, and Acoustic Features
Sejuti Rahman
Swakshar Deb
MD. Sameer Iqbal Chowdhury
MD. Jubair Ahmed Sourov
Mohammad Shamsuddin
80
0
0
19 Nov 2025
AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation
Yulin Sun
Qisheng Xu
Yi Su
Qian Zhu
Yong Dou
Xinwang Liu
Kele Xu
95
0
0
21 Aug 2025
CoughViT: A Self-Supervised Vision Transformer for Cough Audio Representation Learning
Justin Luong
Hao Xue
Flora D. Salim
ViT
116
0
0
04 Aug 2025
Improving Deep Learning-based Respiratory Sound Analysis with Frequency Selection and Attention Mechanism
Nouhaila Fraihi
Ouassim Karrakchou
Mounir Ghogho
175
0
0
26 Jul 2025
Local Equivariance Error-Based Metrics for Evaluating Sampling-Frequency-Independent Property of Neural Network
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
130
0
0
04 Jun 2025
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers
Yuzhu Wang
Archontis Politis
Konstantinos Drossos
Maria Sandsten
156
1
0
22 May 2025
Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models
Riccardo Passoni
Francesca Ronchini
Luca Comanducci
Romain Serizel
Fabio Antonacci
DiffM
385
1
0
12 May 2025
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
Sijia Li
Young D. Kwon
Lik-Hang Lee
Pan Hui
255
0
0
31 Mar 2025
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning
ACM Computing Surveys (ACM CSUR), 2024
Luis Vilaca
Yi Yu
Paula Vinan
442
2
0
24 Nov 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
Applied Soft Computing (Appl. Soft Comput.), 2024
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
211
3
0
24 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Chen Zhang
Conghui Tan
Rongzhong Lian
MoMe
269
1
0
21 Oct 2024
Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks
Amirmohammad Mohammadi
Irené Masabarakiza
Ethan Barnes
Davelle Carreiro
A. V. Dine
Joshua Peeples
134
1
0
20 Sep 2024
Energy Consumption Trends in Sound Event Detection Systems
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Constance Douwes
Romain Serizel
281
1
0
13 Sep 2024
Enhancing Human Action Recognition and Violence Detection Through Deep Learning Audiovisual Fusion
Pooya Janani
Amirabolfazl Suratgar
Afshin Taghvaeipour
143
6
0
04 Aug 2024
Integrating IP Broadcasting with Audio Tags: Workflow and Challenges
Rhys Burchett-Vass
Arshdeep Singh
Gabriel Bibbó
Mark D. Plumbley
226
0
0
22 Jul 2024
Graph in Graph Neural Network
Jiongshu Wang
Jing Yang
Jiankang Deng
Hatice Gunes
Siyang Song
GNN
192
2
0
30 Jun 2024
Characterizing Continual Learning Scenarios and Strategies for Audio Analysis
Ruchi Bhatt
Pratibha Kumari
Dwarikanath Mahapatra
Abdulmotaleb El Saddik
Mukesh Saini
CLL
316
5
0
29 Jun 2024
Multi-Epoch learning with Data Augmentation for Deep Click-Through Rate Prediction
Zhongxiang Fan
Zhaocheng Liu
Jian Liang
Dongying Kong
Han Li
Peng Jiang
Shuang Li
Kun Gai
223
1
0
27 Jun 2024
The Impact of Feature Representation on the Accuracy of Photonic Neural Networks
Mauricio Gomes de Queiroz
Paul Jiménez
Raphael Cardoso
Mateus Vidaletti da Costa
Mohab Abdalla
Ian O'Connor
A. Bosio
Fabio Pavanello
172
1
0
26 Jun 2024
A Survey of Deep Learning Audio Generation Methods
Matej Bozic
Marko Horvat
VLM
MedIm
267
8
0
31 May 2024
V
k
D
:
V_kD:
V
k
D
:
Improving Knowledge Distillation using Orthogonal Projections
Computer Vision and Pattern Recognition (CVPR), 2024
Roy Miles
Ismail Elezi
Jiankang Deng
274
20
0
10 Mar 2024
Cascaded Cross-Modal Transformer for Audio-Textual Classification
Artificial Intelligence Review (Artif Intell Rev), 2024
Nicolae-Cătălin Ristea
Andrei Anghel
Radu Tudor Ionescu
226
2
0
15 Jan 2024
Behavioural Cloning in VizDoom
Ryan Spick
Timothy Bradley
Ayush Raina
P. Amadori
Guy Moss
LM&Ro
147
2
0
08 Jan 2024
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios
Yuzhu Wang
Archontis Politis
Maria Sandsten
129
8
0
17 Dec 2023
BarraCUDA: GPUs do Leak DNN Weights
Péter Horváth
Lukasz Chmielewski
Léo Weissbart
L. Batina
Y. Yarom
249
0
0
12 Dec 2023
LifeLearner: Hardware-Aware Meta Continual Learning System for Embedded Computing Platforms
Young D. Kwon
Jagmohan Chauhan
Hong Jia
Stylianos I. Venieris
Cecilia Mascolo
198
20
0
19 Nov 2023
Multi-View Spectrogram Transformer for Respiratory Sound Classification
Wentao He
Yuchen Yan
Jianfeng Ren
Ruibin Bai
Xudong Jiang
MedIm
ViT
246
15
0
16 Nov 2023
TACNET: Temporal Audio Source Counting Network
Amirreza Ahmadnejad
Ahmad Mahmmodian Darviishani
Mohmmad Mehrdad Asadi
Sajjad Saffariyeh
Pedram Yousef
Emad Fatemizadeh
149
3
0
04 Nov 2023
GIST: Generated Inputs Sets Transferability in Deep Learning
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Florian Tambon
Foutse Khomh
G. Antoniol
AAML
392
1
0
01 Nov 2023
BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable Basis
Neural Information Processing Systems (NeurIPS), 2023
Zelin Ni
Hang Yu
Shizhan Liu
Jianguo Li
Weiyao Lin
AI4TS
218
64
0
31 Oct 2023
Single channel speech enhancement by colored spectrograms
Sania Gul
Muhammad Salman Khan
Muhammad Fazeel
81
2
0
26 Oct 2023
FOLEY-VAE: Generación de efectos de audio para cine con inteligencia artificial
Mateo Cámara
José-Luis Blanco
VGen
123
1
0
24 Oct 2023
Object Size-Driven Design of Convolutional Neural Networks: Virtual Axle Detection based on Raw Data
Engineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023
Henik Riedel
Robert Steven Lorenzen
Clemens Hubler
201
3
0
04 Sep 2023
Homological Convolutional Neural Networks
Antonio Briola
Yuanrong Wang
Silvia Bartolucci
T. Aste
LMTD
219
7
0
26 Aug 2023
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Xiaoshi Zhong
Björn W. Schuller
LM&MA
AuLLM
577
51
0
24 Aug 2023
Efficient Monaural Speech Enhancement using Spectrum Attention Fusion
Jinyu Long
Jetic Gū
Binhao Bai
Zhibo Yang
Pingsun Wei
Junli Li
155
0
0
04 Aug 2023
The Ethical Implications of Generative Audio Models: A Systematic Literature Review
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
J. Barnett
231
47
0
07 Jul 2023
Reasoning over the Air: A Reasoning-based Implicit Semantic-Aware Communication Framework
IEEE Transactions on Wireless Communications (IEEE TWC), 2023
Yong Xiao
Yiwei Liao
Yingyu Li
Guangming Shi
H. Vincent Poor
Walid Saad
Merouane Debbah
M. Bennis
222
21
0
20 Jun 2023
SNeL: A Structured Neuro-Symbolic Language for Entity-Based Multimodal Scene Understanding
Silvan Ferreira
Allan Martins
Ivanovitch Silva
158
1
0
09 Jun 2023
ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear Implants
IEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023
Enoch Hsin-Ho Huang
Rong-Yu Chao
Yu Tsao
Chao-Min Wu
193
11
0
26 May 2023
Robust and lightweight audio fingerprint for Automatic Content Recognition
Anoubhav Agarwaal
Prabhat Kanaujia
Sartaki Sinha Roy
Susmita Ghose
128
5
0
16 May 2023
Compressing audio CNNs with graph centrality based filter pruning
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
122
2
0
05 May 2023
Automatic breach detection during spine pedicle drilling based on vibroacoustic sensing
Aidana Massalimova
Maikel Timmermans
N. Cavalcanti
Daniel Suter
Matthias Seibold
...
C. Laux
R. Sutter
Mazda Farshad
Kathleen Denis
Philipp Fürnstahl
95
9
0
27 Mar 2023
On Neural Architectures for Deep Learning-based Source Separation of Co-Channel OFDM Signals
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Gary C. F. Lee
Amir Weiss
A. Lancho
Yury Polyanskiy
G. Wornell
AI4TS
244
7
0
11 Mar 2023
Explainable AI for Time Series via Virtual Inspection Layers
Pattern Recognition (Pattern Recogn.), 2023
Johanna Vielhaben
Sebastian Lapuschkin
G. Montavon
Wojciech Samek
XAI
AI4TS
227
41
0
11 Mar 2023
A Light Weight Model for Active Speaker Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Junhua Liao
Haihan Duan
Kanghui Feng
Wanbing Zhao
Yanbing Yang
Liangyin Chen
200
61
0
08 Mar 2023
Hypernetworks build Implicit Neural Representations of Sounds
Filip Szatkowski
Karol J. Piczak
Przemtslaw Spurek
Jacek Tabor
Tomasz Trzciñski
448
15
0
09 Feb 2023
Efficient Domain Adaptation for Speech Foundation Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yue Liu
DongSeon Hwang
Zhouyuan Huo
Junwen Bai
Guru Prakash
...
K. Sim
Yu Zhang
Wei Han
Trevor Strohman
F. Beaufays
AI4CE
248
30
0
03 Feb 2023
Synthetic data generation method for data-free knowledge distillation in regression neural networks
Expert systems with applications (ESWA), 2023
Tianxun Zhou
K. Chiam
233
10
0
11 Jan 2023
ExploreADV: Towards exploratory attack for Neural Networks
Tianzuo Luo
Yuyi Zhong
S. Khoo
AAML
185
1
0
01 Jan 2023
1
2
3
Next