Answering Diverse Questions via Text Attached with Key Audio-Visual Clues

11 March 2024

Papers citing "Answering Diverse Questions via Text Attached with Key Audio-Visual Clues"

2 / 2 papers shown

Title
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Hu Xu Gargi Ghosh Po-Yao (Bernie) Huang Dmytro Okhonko Armen Aghajanyan Florian Metze Luke Zettlemoyer Florian Metze Luke Zettlemoyer Christoph Feichtenhofer CLIP VLM 245 554 0 28 Sep 2021
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting Xingjian Shi Zhourong Chen Hao Wang Dit-Yan Yeung W. Wong W. Woo 201 7,816 0 13 Jun 2015