ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.05014
  4. Cited By
NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features
  for Large-scale Video Classification

NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification

12 November 2018
Rongcheng Lin
Jing Xiao
Jianping Fan
    VLM
ArXiv (abs)PDFHTMLGithub (207★)

Papers citing "NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification"

34 / 34 papers shown
Title
Deciphering the complaint aspects: Towards an aspect-based complaint identification model with video complaint dataset in financeIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Sarmistha Das
Basha Mujavarsheik
R E Zera Lyngkhoi
Sriparna Saha
Alka Maurya
121
0
0
26 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
289
0
0
11 Feb 2025
Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor
Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature ExtractorIEEE International Symposium on Biomedical Imaging (ISBI), 2025
Jiaqi Guo
Yunnan Wu
E. Kaimakamis
Georgios Petmezas
Vasileios E. Papageorgiou
N. Maglaveras
Aggelos K. Katsaggelos
329
2
0
21 Jan 2025
VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place
  Recognition
VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place RecognitionEuropean Conference on Computer Vision (ECCV), 2024
Ahmad Khaliq
Ming Xu
Stephen Hausler
Michael Milford
Sourav Garg
CoGe
196
5
0
28 Sep 2024
CardioSyntax: end-to-end SYNTAX score prediction -- dataset, benchmark and method
CardioSyntax: end-to-end SYNTAX score prediction -- dataset, benchmark and methodIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Alexander Ponomarchuk
Ivan Kruzhilov
Galina Zubkova
Artem Shadrin
Ruslan Utegenov
Ivan Bessonov
Pavel Blinov
205
0
0
29 Jul 2024
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
Xiaohong Liu
Xiongkuo Min
Guangtao Zhai
Chunyi Li
Tengchuan Kou
...
Qi Yan
Youran Qu
Xiaohui Zeng
Lele Wang
Renjie Liao
290
43
0
25 Apr 2024
Beyond still images: Temporal features and input variance resilience
Beyond still images: Temporal features and input variance resilienceScientific Reports (Sci Rep), 2023
AmirHosein Fadaei
M. Dehaqani
214
0
0
01 Nov 2023
MM-AU:Towards Multimodal Understanding of Advertisement Videos
MM-AU:Towards Multimodal Understanding of Advertisement VideosACM Multimedia (ACM MM), 2023
Digbalay Bose
Rajat Hebbar
Tiantian Feng
Krishna Somandepalli
Anfeng Xu
Shrikanth Narayanan
99
12
0
27 Aug 2023
M3PT: A Multi-Modal Model for POI Tagging
M3PT: A Multi-Modal Model for POI TaggingKnowledge Discovery and Data Mining (KDD), 2023
Jingsong Yang
Guanzhou Han
Deqing Yang
Jingping Liu
Yanghua Xiao
Xiang Xu
Baohua Wu
Shenghua Ni
180
4
0
16 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video
  Using Multiple Instances Learning
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
131
12
0
06 Jun 2023
Micro-video Tagging via Jointly Modeling Social Influence and Tag
  Relation
Micro-video Tagging via Jointly Modeling Social Influence and Tag RelationACM Multimedia (ACM MM), 2022
Xiao Wang
Tian Gan
Yin-wei Wei
Yue Yu
Dai Meng
Liqiang Nie
103
10
0
15 Mar 2023
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene
  Segmentation
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene SegmentationIEEE Access (IEEE Access), 2022
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
158
3
0
09 Dec 2022
Alignment-guided Temporal Attention for Video Action Recognition
Alignment-guided Temporal Attention for Video Action RecognitionNeural Information Processing Systems (NeurIPS), 2022
Yizhou Zhao
Zhenyang Li
Xun Guo
Yan Lu
135
19
0
30 Sep 2022
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence RewardAsian Conference on Computer Vision (ACCV), 2022
Yunlong Tang
Siting Xu
Teng Wang
Qin Lin
Qinglin Lu
Feng Zheng
VOS
230
12
0
25 Sep 2022
Computational Sarcasm Analysis on Social Media: A Systematic Review
Computational Sarcasm Analysis on Social Media: A Systematic Review
Faria Binte Kader
Nafisa Hossain Nujat
Tasmia Binte Sogir
Mohsinul Kabir
H. Mahmud
Md. Kamrul Hasan
160
7
0
13 Sep 2022
Alignment-Uniformity aware Representation Learning for Zero-shot Video
  Classification
Alignment-Uniformity aware Representation Learning for Zero-shot Video ClassificationComputer Vision and Pattern Recognition (CVPR), 2022
Shi Pu
Kaili Zhao
Mao Zheng
VLM
200
26
0
29 Mar 2022
A Survey on Automated Sarcasm Detection on Twitter
A Survey on Automated Sarcasm Detection on Twitter
Bleau Moores
Vijay K. Mago
179
19
0
05 Feb 2022
Temporal-attentive Covariance Pooling Networks for Video Recognition
Temporal-attentive Covariance Pooling Networks for Video Recognition
Zilin Gao
Qilong Wang
Bingbing Zhang
Q. Hu
P. Li
264
28
0
27 Oct 2021
Overview of Tencent Multi-modal Ads Video Understanding Challenge
Overview of Tencent Multi-modal Ads Video Understanding Challenge
Zhenzhi Wang
Liyu Wu
Zhimin Li
Jiangfeng Xiong
Qinglin Lu
124
5
0
16 Sep 2021
A Multimodal Framework for Video Ads Understanding
A Multimodal Framework for Video Ads UnderstandingACM Multimedia (ACM MM), 2021
Zejia Weng
Lingjiang Meng
Rui Wang
Zuxuan Wu
Yu-Gang Jiang
92
2
0
29 Aug 2021
Social Fabric: Tubelet Compositions for Video Relation Detection
Social Fabric: Tubelet Compositions for Video Relation Detection
Shuo Chen
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
ViT
149
25
0
18 Aug 2021
Frame Aggregation and Multi-Modal Fusion Framework for Video-Based
  Person Recognition
Frame Aggregation and Multi-Modal Fusion Framework for Video-Based Person RecognitionConference on Multimedia Modeling (MMM), 2020
Fangtao Li
Wenzhe Wang
Zihe Liu
Haoran Wang
C. Yan
Bin Wu
146
4
0
19 Oct 2020
Collaborative Distillation in the Parameter and Spectrum Domains for
  Video Action Recognition
Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition
Haisheng Su
Jing Su
Dongliang Wang
Weihao Gan
Wei Wu
Mengmeng Wang
Junjie Yan
Yu Qiao
108
8
0
15 Sep 2020
Temporal Context Aggregation for Video Retrieval with Contrastive
  Learning
Temporal Context Aggregation for Video Retrieval with Contrastive Learning
Jie Shao
Xin Wen
Bingchen Zhao
Xiangyang Xue
AI4TS
150
4
0
04 Aug 2020
Augmenting Data for Sarcasm Detection with Unlabeled Conversation
  Context
Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context
Hankyol Lee
Youngjae Yu
Gunhee Kim
130
25
0
11 Jun 2020
Temporal Aggregate Representations for Long-Range Video Understanding
Temporal Aggregate Representations for Long-Range Video Understanding
Fadime Sener
Dipika Singhania
Angela Yao
AI4TS
158
7
0
01 Jun 2020
A Report on the 2020 Sarcasm Detection Shared Task
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
125
63
0
12 May 2020
Cut-Based Graph Learning Networks to Discover Compositional Structure of
  Sequential Video Data
Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video DataAAAI Conference on Artificial Intelligence (AAAI), 2020
Kyoung-Woon On
Eun-Sol Kim
Y. Heo
Byoung-Tak Zhang
BDL
115
7
0
17 Jan 2020
BERT for Large-scale Video Segment Classification with Test-time
  Augmentation
BERT for Large-scale Video Segment Classification with Test-time Augmentation
Tianqi Liu
Qizhan Shao
119
4
0
02 Dec 2019
Multi-Label Classification with Label Graph Superimposing
Multi-Label Classification with Label Graph SuperimposingAAAI Conference on Artificial Intelligence (AAAI), 2019
Ya Wang
Dongliang He
Fu Li
Xiang Long
Zhichao Zhou
Jinwen Ma
Shilei Wen
169
180
0
21 Nov 2019
MOD: A Deep Mixture Model with Online Knowledge Distillation for Large
  Scale Video Temporal Concept Localization
MOD: A Deep Mixture Model with Online Knowledge Distillation for Large Scale Video Temporal Concept Localization
Rongcheng Lin
Jing Xiao
Jianping Fan
76
4
0
27 Oct 2019
Hallucinating Optical Flow Features for Video Classification
Hallucinating Optical Flow Features for Video ClassificationInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Yongyi Tang
Lin Ma
Lianqiang Zhou
149
21
0
28 May 2019
EnsembleNet: End-to-End Optimization of Multi-headed Models
EnsembleNet: End-to-End Optimization of Multi-headed Models
Hanhan Li
Joe Yue-Hei Ng
Apostol Natsev
135
17
0
24 May 2019
Efficient Video Classification Using Fewer Frames
Efficient Video Classification Using Fewer FramesComputer Vision and Pattern Recognition (CVPR), 2019
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
143
92
0
27 Feb 2019
1