ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.09581
  4. Cited By
SepTr: Separable Transformer for Audio Spectrogram Processing
v1v2v3 (latest)

SepTr: Separable Transformer for Audio Spectrogram Processing

Interspeech (Interspeech), 2022
17 March 2022
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Fahad Shahbaz Khan
    ViT
ArXiv (abs)PDFHTMLGithub (28★)

Papers citing "SepTr: Separable Transformer for Audio Spectrogram Processing"

13 / 13 papers shown
XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark
XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark
Ioan-Paul Ciobanu
Andrei Iulian Hiji
Nicolae-Cătălin Ristea
Paul Irofti
Cristian Rusu
Radu Tudor Ionescu
161
0
0
31 May 2025
Spatio-Temporal Fuzzy-oriented Multi-Modal Meta-Learning for Fine-grained Emotion Recognition
Spatio-Temporal Fuzzy-oriented Multi-Modal Meta-Learning for Fine-grained Emotion Recognition
Wenwen Qiang
Yuxuan Yang
Jingyao Wang
Changwen Zheng
493
0
0
18 Dec 2024
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision TransformersIEEE Robotics and Automation Letters (RA-L), 2024
Ruchik Mishra
Andrew Frye
M. M. Rayguru
Dan O. Popa
483
2
0
16 Sep 2024
Accuracy enhancement method for speech emotion recognition from
  spectrogram using temporal frequency correlation and positional information
  learning through knowledge transfer
Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer
Jeongho Kim
Seung-Ho Lee
172
10
0
26 Mar 2024
Cascaded Cross-Modal Transformer for Audio-Textual Classification
Cascaded Cross-Modal Transformer for Audio-Textual ClassificationArtificial Intelligence Review (Artif Intell Rev), 2024
Nicolae-Cătălin Ristea
Andrei Anghel
Radu Tudor Ionescu
248
3
0
15 Jan 2024
RoDia: A New Dataset for Romanian Dialect Identification from Speech
RoDia: A New Dataset for Romanian Dialect Identification from Speech
Codrut Rotaru
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
365
6
0
06 Sep 2023
Cascaded Cross-Modal Transformer for Request and Complaint Detection
Cascaded Cross-Modal Transformer for Request and Complaint DetectionACM Multimedia (ACM MM), 2023
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
250
3
0
27 Jul 2023
Transformer-based Sequence Labeling for Audio Classification based on MFCCs
C. Sonali
S. ChinmayiB
A. Balasubramanian
310
0
0
30 Apr 2023
SemanticAC: Semantics-Assisted Framework for Audio Classification
SemanticAC: Semantics-Assisted Framework for Audio ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yicheng Xiao
Yue Ma
Shuyan Li
Hantao Zhou
Ran Liao
Xiu Li
121
11
0
12 Feb 2023
Topological Data Analysis for Speech Processing
Topological Data Analysis for Speech ProcessingInterspeech (Interspeech), 2022
Eduard Tulchinskii
Kristian Kuznetsov
Laida Kushnareva
D. Cherniavskii
S. Barannikov
Irina Piontkovskaya
Sergey I. Nikolenko
Evgeny Burnaev
230
6
0
30 Nov 2022
AHD ConvNet for Speech Emotion Classification
Asfand Ali
Danial Nasir
Mohammad Hassan Jawad
110
0
0
10 Jun 2022
Learning Rate Curriculum
Learning Rate CurriculumInternational Journal of Computer Vision (IJCV), 2022
Florinel-Alin Croitoru
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Andrii Zadaianchuk
239
24
0
18 May 2022
Non-linear Neurons with Human-like Apical Dendrite Activations
Non-linear Neurons with Human-like Apical Dendrite Activations
Mariana-Iuliana Georgescu
Radu Tudor Ionescu
Nicolae-Cătălin Ristea
Andrii Zadaianchuk
445
23
0
02 Feb 2020
1