学习 TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation 医学分割
Fusing Transformers and CNNs for Medical Image Segmentation
文章目录
- Fusing Transformers and CNNs for Medical Image Segmentation
- Abstract
- Introduction
- CNN
- Transformer
- TransFuse
- Proposed Method
- two parallel branches
- Transformer branch
- CNN branch
- BiFusion Module
- Loss Function
- Experiments and Results
- Data Acquisition
- Implementation Details
- Evaluation Results
- Conclusion
- 评审意见
Abstract
Medical image segmentation - the prerequisite of numerous clinical needs - has been significantly prospered by recent advances in convolutional neural networks (CNNs). However, it exhibits general limitations on modeling explicit long-range relation, and existing cures, resorting to building deep encoders along with aggressive downsampling operations, leads to redundant deepened networks and loss of localized details. Hence, the segmentation task awaits a better solution to improve the efficiency of modeling global contexts while maintaining a strong grasp of low-level details. In this paper, we propose a novel parallel-in-branch architecture, TransFuse, to address this challenge. TransFuse combines Transformers and CNNs in a parallel style, where both global dependency and low-level spatial details can be efficiently captured in a much shallower manner. Besides, a novel fusion technique - BiFusion module is created to efficiently fuse the multi-level features from both branches. Extensive experiments demonstrate that TransFuse achieves the newest state-of-the-art results on both 2D and 3D medical image sets including polyp, skin lesion, hip, and prostate segmentation, with significant parameter decrease and inference speed improvement
Introduction
CNN
- CNN在诸多医学图像分割任务中取得了优异性能,如多器官分割、肝部病变分割、脑补分割等,显示出CNN在建模特定任务特征表示的强大能力
- 缺点:CNN的一大问题是在捕获全文上下文信息方面缺乏效率,如果通过堆叠、扩大感受野的方式则需要连续的下采样-卷积操作,使得网络结构十分深,这一过程也会导致局部信息的丢失,而拒不细节信息对于密集预测任务也十分重要
Transformer
- 缺点:transformer也有自身局限,就是无法较好的建模细粒度特征,尤其是对于医学图像,细节特征十分重要,其在建模局部信息时缺少空间的归纳偏执
TransFuse
- 可以有效的捕获低层次空间特征和高级语义特征
- 无需搭建深层次网络就可以减轻梯度消失、特征无法有效复用等问题
- 模型效率和推理速度有了很大的提升,并且部署在云端或终端的效率也有很大提升
Proposed Method
two parallel branches
CNN branch
- 增加感受野,将特征从局部到全局
Transformer branch
- 从全局自注意力开始,最后恢复局部细节
two benefits
- 有效的发挥CNN和Transformer各自的优势,在不需要搭建深层次网络的同时,有效捕获全局信息,还能保持精确的low-level信息
- BIfusion在提取特征的过程中,同时利用了CNN和Transformer的不同性质,从而融合的更好
Transformer branch
HXWX3的输入切成patch(patch number= 16);随后patch经过线性映射并展平,在送入Transformer之前还会嵌入可训练的位置信息。
Transformer中包含L层MSA和MLP
- SA(zi)=softmax(qikTDh)vSA(z_i) = softmax(\frac{q_ik^T}{\sqrt{D_h}})v SA(zi)=softmax(DhqikT)v
Transformer encoder处理后的结果会送到Decoder,在解码器部分则采用渐进上采样的方式(PUP),类似SETR的操作
首先会将Z^L的输出reshape到原始二维维度,拥有D0个通道;然后使用两个连续的上采样-卷积层来恢复空间分辨率,最终获得不同大小的上采样结果,与CNN的feature map进行特征融合
CNN branch
- 使用了ResNet作为CNN的分支
- 保留前4层的输出,然后将他们与Transformer的结果融合获得融合后的特征提取
BiFusion Module
融合CNN和Transformer提取的特征
通道注意力
t^i=ChannelAttn(ti)\widehat{t}^i = ChannelAttn(t^i) ti=ChannelAttn(ti)
SE-Block
空间注意力
g^i=ChannelAttn(gi)\widehat{g}^i = ChannelAttn(g^i) gi=ChannelAttn(gi)
CBAM block 作为空间滤波器,增强局部细节,抑制无关区域,低层次的CNN特征会存在噪声
3x3卷积
b^i=Conv(tiW1i⋅giW2i)\widehat{b}^i = Conv(t^iW^i_1 \cdot g^iW^i_2) bi=Conv(tiW1i⋅giW2i)
Hadamard product,矩阵点乘,对两个分支的特征之间的细粒度交互进行建模
残差连接
- fi=Residual([bi,ti,gi])f^i = Residual([b^i,t^i,g^i]) fi=Residual([bi,ti,gi])
利用attention-gate(AG)来生成最终的分割结果
- fi+1=Conv([Up(fi),AG(fi+1,Up(fi))])f^{i+1} = Conv([Up(f^i), AG(f^{i+1}, Up(f^i))]) fi+1=Conv([Up(fi),AG(fi+1,Up(fi))])
Loss Function
- 加权mIoU损失
- BCE损失
Experiments and Results
Data Acquisition
息肉分割
Kvasir , CVC-ClinicDB , CVC-ColonDB ,EndoScene and ETIS
- 352×352
皮肤损伤
2017 International Skin Imaging Collaboration skin lesion segmentation dataset (ISIC2017)
- 192×256
髋关节分割
前列腺分割
volumetric Prostate Multi-modality MRIs from the Medical Segmentation Decathlon
- 320 × 320
Implementation Details
TransFuse-S
- ResNet-34 (R34) and 8-layer DeiT-Small (DeiT-S)
TransFuse-L
- Res2Net-50 and 10-layer DeiT-Base (DeiT-B)
TransFuse-L*
- ResNetV2-50 and ViT-B
Evaluation Results
Results of Polyp Segmentation
mean Dice
- Dice系数是一种集合相似度度量函数,通常用于计算两个样本的相似度,取值为[0,1]。
mIOU
Table1展示了息肉分割的对比结果,TransFuse-S/L与其他CNN网络相比均达到了SOTA,而且参数量比PraNet等减少了20%,实时性也更好
而预训练过的TransFuse-L*的性能也优于SETR和TransUNet
Results of Skin Lesion Segmentation
- 在皮肤损伤分割实验中性能指标是Jaccard指数和Dice系数以及逐像素点的accuracy。
- Table 2中,TransFuse性能优于UNet++,同时不需要任何预处理或后处理,而UNet++需要使用预训练的R34作为主干
Results of Hip Segmentation
- Hausdorff Distance (HD)
- Average Surface Distance (ASD)
- Table 3:髋关节分割的对比结果,主要需要分割腓骨、左股骨、右股骨。与UNet++和HRNet相比,TransFuse在HD和ASD这两个指标参数上均更加优秀,充分证明了本文提出的TransFuse可以有效捕获精细的结构,生成更加清晰准确的轮廓
Results of Prostate Segmentation
- nnUNet是目前在前列腺分割排名第一的分割网络
- Table 4:TransFuse与nnUMet的对比结果。可以看到与nnUNet-3d相比,TransFuse-S不仅性能更好,而且参数量减少了41%,吞吐率增加了50%。
Ablation Study
- Table 5:并行分支的消融实验,Table 6展示的是BiFusion的消融实验。
- 可以看到两个分支选择CNN搭配Transformer性能是最佳的,BiFusion结合空间注意力、通道注意力、计算内积都会提升性能。
Conclusion
评审意见
官方评审意见:https://miccai2021.org/openaccess/paperlinks/2021/09/01/496-Paper0016.html
评分:6,6,6
参考:http://t.csdn.cn/d2JR8
学习 TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation 医学分割相关推荐
- [Transformer]TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation
TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation Abstract Section I Introducti ...
- [论文阅读] TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation
论文地址:https://arxiv.org/abs/2102.08005 发表于:MICCAI'21 Abstract 医学图像分割,这一众多临床需求的先决条件–已经被卷积神经网络(CNN)的最新进 ...
- TransFuse:Fusing Transformerd and CNNs for Medical Image Segmentation
TransFuse:Fusing Transformerd and CNNs for Medical Image Segmentation Yundong Zhang, Huiye Liu, Qian ...
- 深度学习论文: U-Net and its variants for Medical Image Segmentation : A short review
深度学习论文: U-Net and its variants for Medical Image Segmentation : A short review U-Net and its variant ...
- TransUNet: Transformers Make StrongEncoders for Medical Image Segmentation文章详解(结合代码)
1.摘要 医学图像分割是开发医疗保健系统,特别是疾病诊断和治疗计划的必要前提.在各种医学图像分割任务中,U形架构(也称为U-Net)已成为事实上的标准,并取得了巨大的成功.然而,由于卷积运算的内在局部 ...
- [深度学习论文笔记]Pairwise Learning for Medical Image Segmentation
[深度学习论文笔记]Pairwise Learning for Medical Image Segmentation 医学图像分割的成对学习 Published: October 2020 Publi ...
- [深度学习论文笔记]UNETR: Transformers for 3D Medical Image Segmentation
UNETR: Transformers for 3D Medical Image Segmentation UNETR:用于三维医学图像分割的Transformer Published: Oct 20 ...
- 深度学习论文: EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers及其PyTorch实现
深度学习论文: EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers及其PyTorch实现 ...
- [深度学习论文笔记]TransBTSV2: Wider Instead of Deeper Transformer for Medical Image Segmentation
TransBTSV2: Wider Instead of Deeper Transformer for Medical Image Segmentation TransBTSV2:用于医学图像分割的宽 ...
最新文章
- VIM打开shell脚本中文乱码解决
- maven nexus memory optimization
- 【Python基础教程】for循环用法详解
- 首个金融领域的开源中文预训练语言模型FinBERT了解下
- Python中的包以及跨目录访问变量和函数(同级目录)
- Windows内存管理机密+揭穿内存优化工具的骗局
- 【转】Javascript面向对象编程(二):构造函数的继承
- [MySQL] 分组排序取前N条记录以及生成自动数字序列,类似group by后 limit
- [视频演示].NET Core开发的iNeuOS物联网平台,实现从设备PLC、云平台、移动APP数据链路闭环...
- 算法的时间复杂度[ZT]
- 开源爬虫框架各有什么优缺点?
- 20套初级javaweb项目开发案例(适合小白)
- PyCharm的安装和破解
- 如何高效的自学编程(新手篇)
- Error: ErrorCodeERRPS008:SubStatusES0001:Error: Could not read installation path from registry.
- 【资源分享(免积分)】增长黑客_创业公司的用户与收入增长秘籍 - 范冰(高清版免费).pdf
- storm apache java_Apache Storm 示例 Java 拓扑 - Azure HDInsight | Microsoft Docs
- 对y_pred强制二分类
- 金融数据挖掘(一):A股上市公司2021年年报
- 《乘风破浪的姐姐》的观后感,有钱真好!
热门文章
- Meteinfo结合Arcgis制作风矢量图
- 如何突破卫星影像建模难点?重建大师这样做!
- Python实战---使用正则表达式爬取古诗文网
- phpyun 打开调试开关显示错误信息
- PPT背景图片怎么设置?4个详细教程在这!
- 2019寒假作业2 编程总结
- CREO:CREO软件新建之草绘(.sec)、零件(.prt)、组件(.asm)、制造(.mfg)、绘图(.drw)、格式(.frm)、报表(.rep)、图表(.dgm)、布局简介、使用方法之详细攻略
- 基于 Python 制作吃豆豆小游戏(文档和源码~)
- HTML5里video标签支持哪些格式的视频文件?
- 【转】python问题 Traceback (most recent call last)