
Title |
|---|
![]() Temporal Query Networks for Fine-grained Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2021 |
![]() A Case Study on Combining ASR and Visual Features for Generating
Instructional Video CaptionsConference on Computational Natural Language Learning (CoNLL), 2019 |