论文阅读：Learnable pooling with Context Gating for video classification

这篇论文是2016年Google Cloud & YouTube-8M Video Understanding Challenge比赛中冠军得主的论文。
文章的两点贡献：

融合了VLAD, bag-of-visual-words和Fisher Vector三种编码方式，并且每个都做了一定程度的调整。其中，VLAD改为NetRVLAD, bag-of-visual-words改为Soft-DBoW, Fisher Vector改为NetFV。
提出了一个新的非线性的单元 Context Gating (CG)。CG可以捕获特征之间或者标签之间的依赖性。具体的还要再看一下再补充。

论文框架：

实验结果：

代码： https://github.com/antoine77340/Youtube-8M-WILLOW
工具：https://github.com/antoine77340/LOUPE.

论文阅读：Learnable pooling with Context Gating for video classification相关推荐

多模态 —— Learnable pooling with Context Gating for video classification
前言论文地址:arxiv 代码地址:github 这是视频理解的一篇paper,说是多模态的原因主要是该结构结合了视频embedding,音频embedding等特征做视频分类,可以说就是多模态融合 ...
【2017】Learnable pooling with Context Gating for videoclassification借助Context Gating进行可学习的池化以进行视频分类
intro: CVPR17 Youtube 8M workshop. Kaggle 1st place arxiv: https://arxiv.org/abs/1706.06905 github: ...
论文阅读：Target Adaptive Context Aggregation for Video Scene Graph Generation
Target Adaptive Context Aggregation for Video Scene Graph Generation 视频场景图中的目标自适应上下文聚合论文地址:https:// ...
【论文阅读】Rethinking Spatiotemporal Feature Learning For Video Understanding
[论文阅读]Rethinking Spatiotemporal Feature Learning For Video Understanding 这是一篇google的论文,它和之前介绍的一篇face ...
论文阅读：Volumetric and Multi-View CNNs for Object Classification on 3D Data
Preface 最近由于要做正颌手术中术后变形预测的问题,要处理三维数据,所以在研究三维卷积,三维分类的问题. 今天阅读一篇CVPR2016的论文:<Volumetric and Mul ...
【VideoQA最新论文阅读】第一篇视频问答综述Video Question Answering: a Survey of Models and Datasets
Video Question Answering: a Survey of Models and Datasets 长文预警!!! p.s.此篇文章于2021年1月25日新鲜出炉,在Springer需 ...
【论文阅读+翻译】Context-Aware Residual Module for Image Classification
如有侵权,联系删除 [2021ICPR] Context-Aware Residual Module for Image Classification 用于图像分类的上下文感知残差模块论文链接:ht ...
论文阅读 TSM: Temporal Shift Module for Efficient Video Understanding
TSM: Temporal Shift Module for Efficient Video Understanding Computer Vision and Pattern Recognition ...
论文阅读-Combining EfficientNet and Vision Transformers for Video Deepfake Detection（深度鉴伪）
一.论文信息论文名称:Combining EfficientNet and Vision Transformers for Video Deepfake Detection 论文代码:https:/ ...

论文阅读：Learnable pooling with Context Gating for video classification

论文阅读：Learnable pooling with Context Gating for video classification相关推荐

最新文章

热门文章