ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.17143
  4. Cited By
Exploring Train and Test-Time Augmentations for Audio-Language Learning
v1v2 (latest)

Exploring Train and Test-Time Augmentations for Audio-Language Learning

31 October 2022
Eungbeom Kim
Jinhee Kim
Yoori Oh
Kyungsu Kim
Minju Park
Jaeheon Sim
J. Lee
Kyogu Lee
ArXiv (abs)PDFHTML

Papers citing "Exploring Train and Test-Time Augmentations for Audio-Language Learning"

11 / 11 papers shown
Thinking While Listening: Simple Test Time Scaling For Audio Classification
Thinking While Listening: Simple Test Time Scaling For Audio Classification
Prateek Verma
Mert Pilanci
LRM
80
0
0
24 Sep 2025
From Contrast to Commonality: Audio Commonality Captioning for Enhanced Audio-Text Cross-modal Understanding in Multimodal LLMs
From Contrast to Commonality: Audio Commonality Captioning for Enhanced Audio-Text Cross-modal Understanding in Multimodal LLMs
Yuhang Jia
Xu Zhang
Yong Qin
Yang Chen
Shiwan Zhao
VLM
187
0
0
03 Aug 2025
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning
Manh Luong
Khai Nguyen
Dinh Q. Phung
Gholamreza Haffari
Zhuang Li
OT
271
0
0
08 Feb 2025
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Zhisheng Zhong
Chengyao Wang
Yuqi Liu
Senqiao Yang
Longxiang Tang
...
Shaozuo Yu
Sitong Wu
Eric Lo
Shu Liu
Jiaya Jia
AuLLM
280
18
0
12 Dec 2024
EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio
  Captioning Performance
EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Jaeyeon Kim
Minjeon Jeon
Jaeyoon Jung
Sang Hoon Woo
Jinjoo Lee
192
3
0
02 Sep 2024
EDTC: enhance depth of text comprehension in automated audio captioning
EDTC: enhance depth of text comprehension in automated audio captioning
Liwen Tan
Yin Cao
Yi Zhou
199
0
0
27 Feb 2024
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for
  Automated Audio Captioning
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning
Jaeyeon Kim
Jaeyoon Jung
Jinjoo Lee
Sang Hoon Woo
CLIPVLM
198
41
0
31 Jan 2024
Zero-shot audio captioning with audio-language model guidance and audio
  context keywords
Zero-shot audio captioning with audio-language model guidance and audio context keywords
Leonard Salewski
Stefan Fauth
A. Sophia Koepke
Zeynep Akata
191
15
0
14 Nov 2023
Audio Difference Learning for Audio Captioning
Audio Difference Learning for Audio CaptioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Tatsuya Komatsu
Yusuke Fujita
K. Takeda
Tomoki Toda
155
7
0
15 Sep 2023
Multilingual Audio Captioning using machine translated data
Multilingual Audio Captioning using machine translated data
Matéo Cousin
Etienne Labbé
Thomas Pellegrini
158
4
0
14 Sep 2023
Killing two birds with one stone: Can an audio captioning system also be
  used for audio-text retrieval?
Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval?
Etienne Labbé
Thomas Pellegrini
J. Pinquier
152
5
0
29 Aug 2023
1