文章目录

  • 1、论文总述
  • 2、Regressing Mask-level Instances时与Deep Mask的不同
  • 3、分类stage的一个细节
  • 4、试用一款公式神器(MathpixSnip):
  • 5、Differentiable(可微的) RoI Warping Layers.
  • 6、Ablation experiments
  • 7、与其他方法的性能比较
  • 8、作者的一点建议
  • 参考文献

1、论文总述

这篇实力分割的paper的想法符合初学者的思路,即要想实现实例分割(检测+分割),需要级联(这里是串联,MaskRCNN是并联)各个网络:1、 先RPN找出box-level的proposals,2、 然后接全连接层对这些ROI进行像素级别的分割,提取出目标的前景,进而变成mask-level的proposals,3、 然后第三阶段再接全连接层,对这些proposals进行分类,到此就完成了实例分割,当然论文里也提到可以加到5个阶段,即再进行一次分割和分类。

注:串联操作也导致各个stage的损失不是独立,需要根据前一阶段的参数来BP,所以为了达到end_to_end的训练,论文中将ROI pooling那个操作变成了ROI wrapping操作,详见论文。

在多任务学习还不那么热的2015年,如果想的话也一般是这种思路,这个也是coco2015年实例分割的冠军,后来被coco2016年的冠军FCIS(下一篇博客写这篇的总结)吊打,然后MNC和FCIS一起被coco2017的冠军MaskRCNN吊打。。

In this work, we address instance-aware semantic segmentation solely based on CNNs, without using external
modules (e.g., [1]). We observe that the instance-aware semantic segmentation task can be decomposed into three different and related sub-tasks. ,❤️) Differentiating instances. In
this sub-task, the instances can be represented by bounding
boxes that are class-agnostic. ❤️) Estimating masks. In this
sub-task, a pixel-level mask is predicted for each instance.
❤️) Categorizing objects. In this sub-task, the category-wise
label is predicted for each mask-level instance. We expect
that each sub-task is simpler than the original instance segmentation task, and is more easily addressed by convolutional networks.

2、Regressing Mask-level Instances时与Deep Mask的不同

As a related method, DeepMask [25] also regresses discretized masks. DeepMask applies the regression layers
to dense sliding windows (fully-convolutionally), but our
method only regresses masks from a few proposed boxes(RPN)
and so reduces computational cost. Moreover, mask regression is only one stage in our network cascade that shares
features among multiple stages, so the marginal cost of the
mask regression layers is very small.

3、分类stage的一个细节

Following [13], we also use another
box-based pathway, where the RoI pooled features directly
fed into two 4096-d fc layers (this pathway is not illustrated
in Fig. 2). The mask-based and box-based pathways are
concatenated. On top of the concatenation, a softmax classifier of N+1 ways is used for predicting N categories plus
one background category. The box-level pathway may address the cases when the feature is mostly masked out by
the mask-level pathway (e.g., on background).

进到stage3的不仅有从stage2过来的,也有从stage1ROIpooling过来的,提高鲁棒性。

4、试用一款公式神器(MathpixSnip):

The loss term L3 of stage 3 exhibits the following form:

L3=L3(C(Θ)∣B(Θ),M(Θ))L_{3}=L_{3}(C(\Theta) | B(\Theta), M(\Theta)) L3​=L3​(C(Θ)∣B(Θ),M(Θ))

5、Differentiable(可微的) RoI Warping Layers.

The RoI pooling
layer [9, 15] performs max pooling on a discrete grid based
on a box. To derive a form that is differentiable w.r.t. the
box position, we perform RoI pooling by a differentiable
RoI warping layer followed by standard max pooling.

6、Ablation experiments

This baseline has an mAPr of
60.2% using VGG-16. We note that this baseline result is
competitive (see also Table 2), suggesting that decomposing
the task into three sub-tasks is an effective solution.

7、与其他方法的性能比较


注:这篇论文里写的推理时间为0.36秒,但在FCIS与它比较时写的是1.37秒,可能是由于用的basenet不一样,这里是VGG,FCIS中用的是ResNet。

8、作者的一点建议

Our method is designed with fast inference in mind, and
is orthogonal to some other successful strategies developed
previously for semantic segmentation. For example, one
may consider exploiting a CRF [5] to refine the boundaries
of the instance masks. This is beyond the scope of this paper
and will be investigated in the future

注:FCIS论文中说了这篇论文的3个缺点,并且改进了,具体的在FCIS博客中再写。

参考文献

1、实例分割初探

2、MSRA instance-aware semantic segmentation的思路线

论文阅读:MNC:Instance-aware Semantic Segmentation via Multi-task Network Cascades相关推荐

  1. 【论文阅读】Weakly Supervised Semantic Segmentation using Out-of-Distribution Data

    一篇弱监督分割领域的论文,发表在CVPR2022上: 论文标题: Weakly Supervised Semantic Segmentation using Out-of-Distribution D ...

  2. 论文阅读 CVPR2022《Rethinking Semantic Segmentation:A Prototype View》

    回顾一下语义分割,其实可以分为两种.一种是参数化的softmax投射,另一种是部分transformer的方法--query based的方法. 再回到论文的题目 关于prototype的类似定义,可 ...

  3. JSNet: Joint Instance and Semantic Segmentation of 3D Point Clouds

    Abstract 在本文中,我们提出了一种新的联合实例和语义分割方法,称为JSNet,以同时解决3D点云的实例和语义分割.首先,我们建立了一个有效的骨干网络来从原始点云中提取鲁棒的特征.其次,为了获得 ...

  4. ECCV2020语义分割——Self-Prediction for Joint Instance and Semantic Segmentation of Point Clouds

    Self-Prediction for Joint Instance and Semantic Segmentation of Point Clouds Abstract (一) Introducti ...

  5. 语义分割--Large Kernel Matters--Improve Semantic Segmentation by Global Convolutional Network

    Large Kernel Matters–Improve Semantic Segmentation by Global Convolutional Network https://arxiv.org ...

  6. 【论文阅读】[CVPR 2018] PU-Net: Point Cloud Upsampling Network. [tensorflow] [rec. oth.]

    文章目录 [论文阅读][CVPR 2018] PU-Net: Point Cloud Upsampling Network. [tensorflow] [rec. oth.]

  7. 论文阅读:Overview of the NLPCC 2018 Shared Task: Grammatical Error Correction

    论文阅读:Overview of the NLPCC 2018 Shared Task: Grammatical Error Correction 1. 引言 2. 任务定义 3. 数据 3.1 训练 ...

  8. Robust semantic segmentation by dense fusion network on blurred vhr remote sensing images

    Robust semantic segmentation by dense fusion network on blurred vhr remote sensing images 说明se的设计还是很 ...

  9. 论文阅读笔记(2):Learning a Self-Expressive Network for Subspace Clustering,SENet,用于大规模子空间聚类的自表达网络

    论文阅读笔记(2):Learning a Self-Expressive Network for Subspace Clustering. SENet--用于大规模子空间聚类的自表达网络 前言 摘要 ...

  10. Semi-supervised Semantic Segmentation with Error Localization Network(基于误差定位网络的半监督语义分割 )

    Semi-supervised Semantic Segmentation with Error Localization Network(基于误差定位网络的半监督语义分割 ) Abstract 本文 ...

最新文章

  1. Spark SQL Catalyst源代码分析之TreeNode Library
  2. C语言易错题集 第一部
  3. linux opendir php,php opendir()列出目录下所有文件的实例代码
  4. go语言一天入门(上)
  5. 2019全球PostgreSQL生态报告出炉,PG为何从RDBMS中脱颖而出?丨文末送书
  6. 要理解递归,得先理解递归--用Java语言由浅入深讲解汉诺塔游戏
  7. paip.输入法编程--英文ati化By音标原理与中文atiEn处理流程 python 代码为例
  8. 车机常用adb 命令总结
  9. AMI编码规则与HDB3编码规则详解
  10. 比Excel还简单的SQL语句查询
  11. Java实现图片上传到服务器,并把上传的图片读取出来
  12. [转载]胡永恒:《最深刻地影响我的人》
  13. “贵阳发生了什么”大数据主题报告会在京举行
  14. Javase杂谈(四)
  15. dp交换机命令_DP 交换机
  16. AtCoder Beginner Contest 214(补题)
  17. 天池-金融风控训练营-task5-模型融合
  18. php汽车找车位,六款停车APP吐血实测 竟能找到免费停车场?
  19. 针对HFS 2.3漏洞进行渗透测试提权(HFS:HttpFileServer)
  20. java meta_java面试:你真的知道metaspace??

热门文章

  1. fpc:lazarus 安装 Indy 的正确方法
  2. 送给即将进入大学升造侄子的寄语
  3. android电池电量监控,android电量监控 Battery Widget Reborn
  4. HCIA脱产班 学习笔记2
  5. ie的java打不开,ie浏览器无法打开链接
  6. Palm OS 5开发概述
  7. 3月21日互联互通社区科技沙龙
  8. 已解决ValueError: Shape of passed values is (6, 3), indices imply (4, 3)
  9. 【机器学习】生成对抗网络(GAN)
  10. 分子模拟-学习笔记(二)