u-net语义分割

Semantic segmentation is a computer vision problem where we try to assign a class to each pixel . Unlike the classic image classification task where only one class value is predicted(assuming single label classification), in this problem we look for class value for each pixel. The application of image segmentation is predominantly seen in the medical field. However now this is being applied in other domains also e.g self driving car.

语义分割是一个计算机视觉问题，我们尝试为每个像素分配一个类。与经典图像分类任务不同，在传统图像分类任务中，仅预测一个类别值( 假设使用单个标签分类 )，在此问题中，我们为每个像素寻找类别值。图像分割的应用主要在医学领域。但是现在，这也被应用于其他领域，例如自动驾驶汽车。

In case of image classification we are particularly interested to know what is there in the image. Semantic segmentation comes with two wh questions which is what and where.

在图像分类的情况下，我们特别想知道图像中有什么。语义分割带有两个wh问题，即什么地方。

什么是U-net： (What Is U-net:)

U-Net is the most popular model for semantic segmentation task. Though we have other models to accomplish this task U-Net is widely accepted as the de-facto standard for this task. A typical U-Net architecture has two parts: Encoder and Decoder.

U-Net是最受欢迎的语义分割任务模型。尽管我们还有其他模型可以完成此任务，但U-Net已被广泛接受为该任务的实际标准。典型的U-Net架构包含两个部分：编码器和解码器。

编码器： (Encoder:)

The job of the encoder is same as any convolutional neural network,which is basically to determine the first wh question what. However when we downsample the image like a typical convnet we tend to lose the information regarding the localization of the segmented objects. The feature maps of the cnn would have learned what is there in the image without any idea of where it is. In the original implementation of the U-Net a 128*18*1 image is taken where the encoder is able to output a 8*8*256 shape.

编码器的工作与任何卷积神经网络相同，基本上是确定第一个问题是什么 。但是，当像典型的卷积网络那样对图像进行降采样时，我们往往会丢失有关分割对象定位的信息。 cnn的特征图将了解图像中的内容，而无需知道其位置。在U-Net的原始实现中，会拍摄128 * 18 * 1的图像，其中编码器能够输出8 * 8 * 256的形状。

解码器： (Decoder:)

Decoder tries to recover the lost information during the encoder’s operation on the image. To do so it applies a skip connection which provides the spatial information that was lost during the downsampling of the image. Also the decoder uses transposed convolution which converts the a small image to a larger one. In the decoder size of the image increases from 8*8*256 to 128*128 *1.

解码器尝试在图像上对编码器进行操作期间恢复丢失的信息。为此，它应用了一个跳过连接，该连接提供了在图像降采样期间丢失的空间信息。解码器也使用转置卷积，将小图像转换为大图像。在解码器中，图像的大小从8 * 8 * 256增加到128 * 128 * 1。

U-Net的变化： (Variations In The U-Net:)

We can find variety of implementation of the U-Net architecture. Instead of transposed convolution we can also apply the bilinear sampling method. Similarly if we can replace the encoder convolutional neural network by any popular network like ResNet or VGG-Net. We may or may not choose to use the pretrained weight.

我们可以找到U-Net架构的各种实现。除了转置卷积，我们还可以应用双线性采样方法。同样，如果我们可以用任何流行的网络(例如ResNet或VGG-Net)代替编码器卷积神经网络。我们可能会或可能不会选择使用预先训练的体重。

This was a theoretical overview of the U-Net model using semantic segmentation. In the next blog we can use this model to do salt identification and do the practical implementation of it.

这是使用语义分段的U-Net模型的理论概述。在下一个博客中，我们可以使用此模型进行盐识别并进行实际实现。

翻译自: https://medium.com/swlh/semantic-segmentation-using-u-net-e0f34e27724f

u-net语义分割

查看全文

http://www.taodudu.cc/news/show-863877.html

地理空间数据
嵌入式系统分类及其应用场景_词嵌入及其应用简介
hotelling变换_基于Hotelling-T²的偏最小二乘（PLS）中的变量选择
命名实体识别实体抽取_您的公司为什么要关心命名实体的识别
机器学习异常值检测_异常值是否会破坏您的机器学习预测？寻找最佳解决方案
yolov3算法优点缺点_优点缺点
主成分分析具体解释_主成分分析-现在用您自己的术语解释
netflix 数据科学家_数据科学和机器学习在Netflix中的应用
python画交互式地图_使用Python构建交互式地图-入门指南
大疆机器学习实习生_我们的数据科学机器人实习生
ai人工智能的本质和未来_人工智能的未来在于模型压缩
tableau使用_使用Tableau探索墨尔本房地产市场
谷歌云请更正这张卡片的信息_如何识别和更正Google Analytics（分析）报告中的（未设置）值
科技情报研究所工资_我们所说的情报是什么？
手语识别_使用深度学习进行手语识别
数据科学的5种基本的面向业务的批判性思维技能
大数据技术学习之旅_数据-数据科学之旅的起点
编写分段函数子函数_编写自己的函数
打破学习的玻璃墙_打破Google背后的创新深度学习
向量矩阵张量_张量，矩阵和向量有什么区别？
monk js_使用Monk AI进行手语分类
辍学的名人_辍学效果如此出色的5个观点
强化学习-动态规划_强化学习-第5部分
查看-增强会话_会话式人工智能-关键技术和挑战-第2部分
我从未看过荒原写作背景_您从未听说过的最佳数据科学认证
nlp算法文本向量化_NLP中的标记化算法概述
数据科学与大数据排名思考题_排名前5位的数据科学课程
《成为一名机器学习工程师》_如何在2020年成为机器学习工程师
打开应用蜂窝移动数据就关闭_基于移动应用行为数据的客户流失预测
端到端机器学习_端到端机器学习项目：评论分类

u-net语义分割_使用U-Net的语义分割相关推荐

语义分割和实例分割_一文读懂语义分割与实例分割
以人工智能为导向的现代计算机视觉技术,在过去的十年中发生了巨大的变化.今天,它被广泛用于图像分类.人脸识别.物体检测.视频分析以及机器人及自动驾驶汽车中的图像处理等领域.图像分割技术是目前预测图像领域 ...
enet分割_论文阅读 | CVPR 2016 | 语义分割论文: ENet
论文源址:https://arxiv.org/abs/1606.02147 tensorflow github: https://github.com/kwotsin/TensorFlow-ENet ...
python实现语义分割_如何用PyTorch进行语义分割？一文搞定
很久没给大家带来教程资源啦. 正值PyTorch 1.7更新,那么我们这次便给大家带来一个PyTorch简单实用的教程资源:用PyTorch进行语义分割. 图源:stanford 该教程是基于2020 ...
基于bert的语义匹配_构建基于BERT的语义搜索系统…针对“星际迷航”
基于bert的语义匹配 If you read my previous article on Towards Data Science you'll know I'm a bit of a Star ...
python语义网络图_知识图谱语义网络，语义网，链接数据和知识图谱（二）－－基础篇...
知识图谱语义网络,语义网,链接数据和知识图谱 (二)--基础篇发布时间:2018-05-14 16:10, 浏览次数:370 一.语义网络(Semantic Network) 对于初学者来讲,这个 ...
halcon粘连字符分割_机器视觉—字符识别之粘连字符分割识别
粘连字符的分割有很多种方法: 第一种方法:利用阈值法提取联通区域,再利用形态学分离字符: 第二种方法:每个字符定义一个感兴趣区域: 第三种方法: 计算区域每列像素数目,由于不同字符之间连接部分非常狭窄 ...
python人像精细分割_人像抠图 - 发丝级人像分割 - 照片人物特效 - 极链科技
利用人工智能深度网络技术训练出的模型,可以模拟 Photoshop 一键完成人像抠图.可应用于证件照合成,人像照片虚化背景,照片背景替换,特效制作,弹幕隐身等多重功能. 请求参数(Body): { & ...
单眼测试_单眼鸟瞰自动驾驶语义分割
单眼测试 Autonomous driving requires an accurate representation of the environment around the ego vehicl ...
语义分割和实例分割_语义分割入门的一点总结
点击上方"CVer",选择加"星标"或"置顶" 重磅干货,第一时间送达作者:Yanpeng Sun https://zhuanlan.zh ...
mask rcnn实例分割_使用Mask-RCNN的实例分割
mask rcnn实例分割 In this article, I will be creating my own trained model for detecting potholes. For d ...

u-net语义分割_使用U-Net的语义分割

什么是U-net： (What Is U-net:)

编码器： (Encoder:)

解码器： (Decoder:)

U-Net的变化： (Variations In The U-Net:)

相关文章：

u-net语义分割_使用U-Net的语义分割相关推荐

最新文章

热门文章