Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.03433
Cited By
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
6 May 2022
Yuan Gong
Jingbo Yu
James R. Glass
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition"
9 / 9 papers shown
Title
Kimi-Audio Technical Report
KimiTeam
Ding Ding
Zeqian Ju
Yichong Leng
S. Liu
...
Z. Yang
Aoxiong Yin
Ruibin Yuan
Y. Zhang
Zaida Zhou
AuLLM
VLM
108
5
0
25 Apr 2025
Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning
Chun-Yi Kuan
Hung-yi Lee
AuLLM
LRM
70
1
0
03 Jan 2025
OmniBench: Towards The Future of Universal Omni-Language Models
Yizhi Li
Ge Zhang
Yinghao Ma
Ruibin Yuan
Kang Zhu
...
Zhaoxiang Zhang
Zachary Liu
Emmanouil Benetos
Wenhao Huang
Chenghua Lin
LRM
44
11
0
23 Sep 2024
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Yunfei Chu
Jin Xu
Xiaohuan Zhou
Qian Yang
Shiliang Zhang
Zhijie Yan
Chang Zhou
Jingren Zhou
AuLLM
28
267
0
14 Nov 2023
Active Learning of Non-semantic Speech Tasks with Pretrained Models
Harlin Lee
Aaqib Saeed
Andrea L. Bertozzi
VLM
14
2
0
31 Oct 2022
On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Z. Bukhsh
Aaqib Saeed
OODD
35
9
0
27 Oct 2022
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource Devices
Harlin Lee
Aaqib Saeed
19
2
0
12 Jul 2022
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates
Björn W. Schuller
A. Batliner
Christian Bergler
Cecilia Mascolo
Jing Han
...
Pietro Cicuta
L. Rothkrantz
J. Zwerts
Jelle Treep
Casper S. Kaandorp
52
109
0
24 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
101
144
0
02 Feb 2021
1