ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.09387
  4. Cited By
Clotho: An Audio Captioning Dataset

Clotho: An Audio Captioning Dataset

21 October 2019
K. Drossos
Samuel Lipping
Tuomas Virtanen
ArXivPDFHTML

Papers citing "Clotho: An Audio Captioning Dataset"

9 / 259 papers shown
Title
Effects of Word-frequency based Pre- and Post- Processings for Audio
  Captioning
Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning
Daiki Takeuchi
Yuma Koizumi
Yasunori Ohishi
N. Harada
K. Kashino
6
26
0
24 Sep 2020
RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound
  Synthesis
RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis
Yuki Okamoto
Keisuke Imoto
Shinnosuke Takamichi
Ryosuke Yamanishi
Takahiro Fukumori
Y. Yamashita
6
5
0
09 Jul 2020
Multi-task Regularization Based on Infrequent Classes for Audio
  Captioning
Multi-task Regularization Based on Infrequent Classes for Audio Captioning
Emre Çakir
K. Drossos
Tuomas Virtanen
15
17
0
09 Jul 2020
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal
  Shuffled Transformers
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
Shijie Geng
Peng Gao
Moitreya Chatterjee
Chiori Hori
Jonathan Le Roux
Yongfeng Zhang
Hongsheng Li
A. Cherian
21
11
0
08 Jul 2020
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio
  Captioning
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning
K. Nguyen
K. Drossos
Tuomas Virtanen
15
12
0
06 Jul 2020
The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning
  with Keywords and Sentence Length Estimation
The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation
Yuma Koizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
19
22
0
01 Jul 2020
A Transformer-based Audio Captioning Model with Keyword Estimation
A Transformer-based Audio Captioning Model with Keyword Estimation
Yuma Koizumi
Ryo Masumura
Kyosuke Nishida
Masahiro Yasuda
Shoichiro Saito
11
54
0
01 Jul 2020
Listen carefully and tell: an audio captioning system based on residual
  learning and gammatone audio representation
Listen carefully and tell: an audio captioning system based on residual learning and gammatone audio representation
Sergi Perez-Castanos
Javier Naranjo-Alcazar
P. Zuccarello
M. Cobos
16
11
0
27 Jun 2020
Audio Captioning using Gated Recurrent Units
Audio Captioning using Gated Recurrent Units
Aysegül Özkaya Eren
M. Sert
14
10
0
05 Jun 2020
Previous
123456