ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.06702
  4. Cited By
Temporally Aligning Long Audio Interviews with Questions: A Case Study
  in Multimodal Data Integration

Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration

10 October 2023
Piyush Singh Pasi
Karthikeya Battepati
P. Jyothi
Ganesh Ramakrishnan
T. Mahapatra
Manoj Singh
ArXivPDFHTML

Papers citing "Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration"

4 / 4 papers shown
Title
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
110
192
0
14 Oct 2021
Deep Bregman Divergence for Contrastive Learning of Visual
  Representations
Deep Bregman Divergence for Contrastive Learning of Visual Representations
Mina Rezaei
Farzin Soleymani
B. Bischl
Shekoofeh Azizi
SSL
34
16
0
15 Sep 2021
CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Anirudh Gupta
Harveen Singh Chadha
Priyanshi Shah
Neeraj Chimmwal
Ankur Dhuriya
Rishabh Gaur
Vivek Raghavan
24
36
0
15 Jul 2021
pyannote.audio: neural building blocks for speaker diarization
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
177
307
0
04 Nov 2019
1