v1v2 (latest)

Deep Learning for Audio Signal Processing

IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2019

30 April 2019

Papers citing "Deep Learning for Audio Signal Processing"

50 / 115 papers shown

Title
MF-GCN: A Multi-Frequency Graph Convolutional Network for Tri-Modal Depression Detection Using Eye-Tracking, Facial, and Acoustic Features Sejuti Rahman Swakshar Deb MD. Sameer Iqbal Chowdhury MD. Jubair Ahmed Sourov Mohammad Shamsuddin 80 0 0 19 Nov 2025
AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation Yulin Sun Qisheng Xu Yi Su Qian Zhu Yong Dou Xinwang Liu Kele Xu 95 0 0 21 Aug 2025
CoughViT: A Self-Supervised Vision Transformer for Cough Audio Representation Learning Justin Luong Hao Xue Flora D. Salim ViT 116 0 0 04 Aug 2025
Improving Deep Learning-based Respiratory Sound Analysis with Frequency Selection and Attention Mechanism Nouhaila Fraihi Ouassim Karrakchou Mounir Ghogho 175 0 0 26 Jul 2025
Local Equivariance Error-Based Metrics for Evaluating Sampling-Frequency-Independent Property of Neural Network Kanami Imamura Tomohiko Nakamura Norihiro Takamune Kohei Yatabe Hiroshi Saruwatari 130 0 0 04 Jun 2025
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers Yuzhu Wang Archontis Politis Konstantinos Drossos Maria Sandsten 156 1 0 22 May 2025
Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models Riccardo Passoni Francesca Ronchini Luca Comanducci Romain Serizel Fabio Antonacci DiffM 385 1 0 12 May 2025
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices Sijia Li Young D. Kwon Lik-Hang Lee Pan Hui 255 0 0 31 Mar 2025
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation LearningACM Computing Surveys (ACM CSUR), 2024 Luis Vilaca Yi Yu Paula Vinan 442 2 0 24 Nov 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning TechniquesApplied Soft Computing (Appl. Soft Comput.), 2024 David Ortiz-Perez Manuel Benavent-Lledo José García Rodríguez David Tomás M. Flores Vizcaya-Moreno 211 3 0 24 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation Victor Junqiu Wei Weicheng Wang Chen Zhang Conghui Tan Rongzhong Lian MoMe 269 1 0 21 Oct 2024
Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks Amirmohammad Mohammadi Irené Masabarakiza Ethan Barnes Davelle Carreiro A. V. Dine Joshua Peeples 134 1 0 20 Sep 2024
Energy Consumption Trends in Sound Event Detection SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 Constance Douwes Romain Serizel 281 1 0 13 Sep 2024
Enhancing Human Action Recognition and Violence Detection Through Deep Learning Audiovisual Fusion Pooya Janani Amirabolfazl Suratgar Afshin Taghvaeipour 143 6 0 04 Aug 2024
Integrating IP Broadcasting with Audio Tags: Workflow and Challenges Rhys Burchett-Vass Arshdeep Singh Gabriel Bibbó Mark D. Plumbley 226 0 0 22 Jul 2024
Graph in Graph Neural Network Jiongshu Wang Jing Yang Jiankang Deng Hatice Gunes Siyang Song GNN 192 2 0 30 Jun 2024
Characterizing Continual Learning Scenarios and Strategies for Audio Analysis Ruchi Bhatt Pratibha Kumari Dwarikanath Mahapatra Abdulmotaleb El Saddik Mukesh Saini CLL 316 5 0 29 Jun 2024
Multi-Epoch learning with Data Augmentation for Deep Click-Through Rate Prediction Zhongxiang Fan Zhaocheng Liu Jian Liang Dongying Kong Han Li Peng Jiang Shuang Li Kun Gai 223 1 0 27 Jun 2024
The Impact of Feature Representation on the Accuracy of Photonic Neural Networks Mauricio Gomes de Queiroz Paul Jiménez Raphael Cardoso Mateus Vidaletti da Costa Mohab Abdalla Ian O'Connor A. Bosio Fabio Pavanello 172 1 0 26 Jun 2024
A Survey of Deep Learning Audio Generation Methods Matej Bozic Marko Horvat VLM MedIm 267 8 0 31 May 2024
$V_kD:$ Improving Knowledge Distillation using Orthogonal ProjectionsComputer Vision and Pattern Recognition (CVPR), 2024 Roy Miles Ismail Elezi Jiankang Deng 274 20 0 10 Mar 2024
Cascaded Cross-Modal Transformer for Audio-Textual ClassificationArtificial Intelligence Review (Artif Intell Rev), 2024 Nicolae-Cătălin Ristea Andrei Anghel Radu Tudor Ionescu 226 2 0 15 Jan 2024
Behavioural Cloning in VizDoom Ryan Spick Timothy Bradley Ayush Raina P. Amadori Guy Moss LM&Ro 147 2 0 08 Jan 2024
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios Yuzhu Wang Archontis Politis Maria Sandsten 129 8 0 17 Dec 2023
BarraCUDA: GPUs do Leak DNN Weights Péter Horváth Lukasz Chmielewski Léo Weissbart L. Batina Y. Yarom 249 0 0 12 Dec 2023
LifeLearner: Hardware-Aware Meta Continual Learning System for Embedded Computing Platforms Young D. Kwon Jagmohan Chauhan Hong Jia Stylianos I. Venieris Cecilia Mascolo 198 20 0 19 Nov 2023
Multi-View Spectrogram Transformer for Respiratory Sound Classification Wentao He Yuchen Yan Jianfeng Ren Ruibin Bai Xudong Jiang MedIm ViT 246 15 0 16 Nov 2023
TACNET: Temporal Audio Source Counting Network Amirreza Ahmadnejad Ahmad Mahmmodian Darviishani Mohmmad Mehrdad Asadi Sajjad Saffariyeh Pedram Yousef Emad Fatemizadeh 149 3 0 04 Nov 2023
GIST: Generated Inputs Sets Transferability in Deep LearningACM Transactions on Software Engineering and Methodology (TOSEM), 2023 Florian Tambon Foutse Khomh G. Antoniol AAML 392 1 0 01 Nov 2023
BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable BasisNeural Information Processing Systems (NeurIPS), 2023 Zelin Ni Hang Yu Shizhan Liu Jianguo Li Weiyao Lin AI4TS 218 64 0 31 Oct 2023
Single channel speech enhancement by colored spectrograms Sania Gul Muhammad Salman Khan Muhammad Fazeel 81 2 0 26 Oct 2023
FOLEY-VAE: Generación de efectos de audio para cine con inteligencia artificial Mateo Cámara José-Luis Blanco VGen 123 1 0 24 Oct 2023
Object Size-Driven Design of Convolutional Neural Networks: Virtual Axle Detection based on Raw DataEngineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023 Henik Riedel Robert Steven Lorenzen Clemens Hubler 201 3 0 04 Sep 2023
Homological Convolutional Neural Networks Antonio Briola Yuanrong Wang Silvia Bartolucci T. Aste LMTD 219 7 0 26 Aug 2023
Sparks of Large Audio Models: A Survey and Outlook S. Latif Moazzam Shoukat Fahad Shamshad Muhammad Usama Yi Ren ... Wenwu Wang Xulong Zhang Roberto Togneri Xiaoshi Zhong Björn W. Schuller LM&MA AuLLM 577 51 0 24 Aug 2023
Efficient Monaural Speech Enhancement using Spectrum Attention Fusion Jinyu Long Jetic Gū Binhao Bai Zhibo Yang Pingsun Wei Junli Li 155 0 0 04 Aug 2023
The Ethical Implications of Generative Audio Models: A Systematic Literature ReviewAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023 J. Barnett 231 47 0 07 Jul 2023
Reasoning over the Air: A Reasoning-based Implicit Semantic-Aware Communication FrameworkIEEE Transactions on Wireless Communications (IEEE TWC), 2023 Yong Xiao Yiwei Liao Yingyu Li Guangming Shi H. Vincent Poor Walid Saad Merouane Debbah M. Bennis 222 21 0 20 Jun 2023
SNeL: A Structured Neuro-Symbolic Language for Entity-Based Multimodal Scene Understanding Silvan Ferreira Allan Martins Ivanovitch Silva 158 1 0 09 Jun 2023
ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear ImplantsIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023 Enoch Hsin-Ho Huang Rong-Yu Chao Yu Tsao Chao-Min Wu 193 11 0 26 May 2023
Robust and lightweight audio fingerprint for Automatic Content Recognition Anoubhav Agarwaal Prabhat Kanaujia Sartaki Sinha Roy Susmita Ghose 128 5 0 16 May 2023
Compressing audio CNNs with graph centrality based filter pruningIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023 James A. King Ashutosh Kumar Singh Mark D. Plumbley GNN 122 2 0 05 May 2023
Automatic breach detection during spine pedicle drilling based on vibroacoustic sensing Aidana Massalimova Maikel Timmermans N. Cavalcanti Daniel Suter Matthias Seibold ... C. Laux R. Sutter Mazda Farshad Kathleen Denis Philipp Fürnstahl 95 9 0 27 Mar 2023
On Neural Architectures for Deep Learning-based Source Separation of Co-Channel OFDM SignalsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Gary C. F. Lee Amir Weiss A. Lancho Yury Polyanskiy G. Wornell AI4TS 244 7 0 11 Mar 2023
Explainable AI for Time Series via Virtual Inspection LayersPattern Recognition (Pattern Recogn.), 2023 Johanna Vielhaben Sebastian Lapuschkin G. Montavon Wojciech Samek XAI AI4TS 227 41 0 11 Mar 2023
A Light Weight Model for Active Speaker DetectionComputer Vision and Pattern Recognition (CVPR), 2023 Junhua Liao Haihan Duan Kanghui Feng Wanbing Zhao Yanbing Yang Liangyin Chen 200 61 0 08 Mar 2023
Hypernetworks build Implicit Neural Representations of Sounds Filip Szatkowski Karol J. Piczak Przemtslaw Spurek Jacek Tabor Tomasz Trzciñski 448 15 0 09 Feb 2023
Efficient Domain Adaptation for Speech Foundation ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Yue Liu DongSeon Hwang Zhouyuan Huo Junwen Bai Guru Prakash ... K. Sim Yu Zhang Wei Han Trevor Strohman F. Beaufays AI4CE 248 30 0 03 Feb 2023
Synthetic data generation method for data-free knowledge distillation in regression neural networksExpert systems with applications (ESWA), 2023 Tianxun Zhou K. Chiam 233 10 0 11 Jan 2023
ExploreADV: Towards exploratory attack for Neural Networks Tianzuo Luo Yuyi Zhong S. Khoo AAML 185 1 0 01 Jan 2023