ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.09126
  4. Cited By
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes
  with Spatiotemporal Annotations of Sound Events

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

15 June 2023
Kazuki Shimada
A. Politis
Parthasaarathy Sudarsanam
D. Krause
Kengo Uchida
Sharath Adavanne
Aapo Hakala
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Tuomas Virtanen
Yuki Mitsufuji
ArXivPDFHTML

Papers citing "STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events"

9 / 9 papers shown
Title
OmniAudio: Generating Spatial Audio from 360-Degree Video
OmniAudio: Generating Spatial Audio from 360-Degree Video
Huadai Liu
Tianyi Luo
Qikai Jiang
Kaicheng Luo
Peiwen Sun
...
X. Li
Shiliang Zhang
Zhijie Yan
Zhou Zhao
Wei Xue
VGen
51
0
0
21 Apr 2025
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Xueping Zhang
Yaxiong Chen
Ruilin Yao
Yunfei Zi
Shengwu Xiong
21
0
0
11 Apr 2025
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent
  Approach
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
Rory Young
Nicolas Pugeault
AAML
54
3
0
14 Oct 2024
SELD-Mamba: Selective State-Space Model for Sound Event Localization and
  Detection with Source Distance Estimation
SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Da Mu
Zhicheng Zhang
Haobo Yue
Zehao Wang
Jin Tang
Jianqin Yin
Mamba
35
1
0
09 Aug 2024
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for
  Dynamic Speech Enhancement and Localization
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Bing Yang
Changsheng Quan
Yabo Wang
Pengyu Wang
Yujie Yang
Ying Fang
Nian Shao
Hui Bu
Xin Xu
Xiaofei Li
30
5
0
28 Jun 2024
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
Swapnil Bhosale
Haosen Yang
Diptesh Kanojia
Jiankang Deng
Xiatian Zhu
38
5
0
13 Jun 2024
End-to-end Audio-visual Speech Recognition with Conformers
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
221
0
12 Feb 2021
Sound event localization and detection based on crnn using rectangular
  filters and channel rotation data augmentation
Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
Francesca Ronchini
Daniel Arteaga
Andrés Pérez-López
6
8
0
13 Oct 2020
SMOTE: Synthetic Minority Over-sampling Technique
SMOTE: Synthetic Minority Over-sampling Technique
Nitesh V. Chawla
Kevin W. Bowyer
Lawrence Hall
W. Kegelmeyer
AI4TS
160
25,150
0
09 Jun 2011
1