InceptionV3代码解析

参考书籍：需要电子版的可以去CSDN搜索

参考博文：https://blog.csdn.net/superman_xxx/article/details/65451916

读了Google的GoogleNet以及InceptionV3的论文，决定把它实现一下，尽管很难，但是网上有不少资源，就一条一条的写完了，对于网络的解析都在代码里面了，是在原博主的基础上进行修改的，添加了更多的细节，以及自己的理解。总之，是更详细更啰嗦的一个版本，适合初学者。

import tensorflow as tf
from datetime import datetime
import math
import time##参考tensorflow实战书籍+博客https://blog.csdn.net/superman_xxx/article/details/65451916，不过丰富了很多细节
##适合像我一样的初学者
slim = tf.contrib.slim
#Slim is an interface to contrib functions, examples and models.
#只是一个接口作用
trunc_normal = lambda stddev: tf.truncated_normal_initializer(0.0, stddev)
#匿名函数 lambda x: x * x  实际上就是：返回x的平方
# tf.truncated_normal_initializer产生截断的正态分布########定义函数生成网络中经常用到的函数的默认参数########
# 默认参数：卷积的激活函数、权重初始化方式、标准化器等
def inception_v3_arg_scope(weight_decay=0.00004,  # 设置L2正则的weight_decaystddev=0.1, # 标准差默认值0.1batch_norm_var_collection='moving_vars'):# 定义batch normalization（批量标准化/归一化）的参数字典batch_norm_params = {'decay': 0.9997,  # 定义参数衰减系数'epsilon': 0.001,'updates_collections': tf.GraphKeys.UPDATE_OPS,'variables_collections': {'beta': None,'gamma': None,'moving_mean': [batch_norm_var_collection],'moving_variance': [batch_norm_var_collection],#值就是前面设置的batch_norm_var_collection='moving_vars'}}# 给函数的参数自动赋予某些默认值
# slim.arg_scope常用于为tensorflow里的layer函数提供默认值以使构建模型的代码更加紧凑苗条(slim):with slim.arg_scope([slim.conv2d, slim.fully_connected],weights_regularizer=slim.l2_regularizer(weight_decay)):# 对[slim.conv2d, slim.fully_connected]自动赋值，可以是列表或元组# 使用slim.arg_scope后就不需要每次都重复设置参数了，只需要在有修改时设置with slim.arg_scope( # 嵌套一个slim.arg_scope对卷积层生成函数slim.conv2d的几个参数赋予默认值[slim.conv2d],weights_initializer=trunc_normal(stddev), # 权重初始化器activation_fn=tf.nn.relu, # 激活函数normalizer_fn=slim.batch_norm, # 标准化器normalizer_params=batch_norm_params) as sc: # 标准化器的参数设置为前面定义的batch_norm_paramsreturn sc # 最后返回定义好的scope########定义函数可以生成Inception V3网络的卷积部分########
#########Inception V3架构见TENSORFLOW实战书-黄文坚 p124-p125页
def inception_v3_base(inputs, scope=None):'''Args:inputs：输入的tensorscope：包含了函数默认参数的环境'''end_points = {} # 定义一个字典表保存某些关键节点供之后使用with tf.variable_scope(scope, 'InceptionV3', [inputs]):with slim.arg_scope([slim.conv2d, slim.max_pool2d, slim.avg_pool2d], # 对三个参数设置默认值stride=1, padding='VALID'):# 正式定义Inception V3的网络结构。首先是前面的非Inception Module的卷积层#输入图像尺寸299 x 299 x 3# slim.conv2d的第一个参数为输入的tensor，第二个是输出的通道数，卷积核尺寸，步长stride，padding模式net = slim.conv2d(inputs, 32, [3, 3], stride=2, scope='Conv2d_1a_3x3')# 输出尺寸149 x 149 x 32'''因为使用了slim以及slim.arg_scope，我们一行代码就可以定义好一个卷积层相比AlexNet使用好几行代码定义一个卷积层，或是VGGNet中专门写一个函数定义卷积层，都更加方便'''net = slim.conv2d(net, 32, [3, 3], scope='Conv2d_2a_3x3')# 输出尺寸147 x 147 x 32net = slim.conv2d(net, 64, [3, 3], padding='SAME', scope='Conv2d_2b_3x3')# 输出尺寸147 x 147 x 64net = slim.max_pool2d(net, [3, 3], stride=2, scope='MaxPool_3a_3x3')# 输出尺寸73 x 73 x 64net = slim.conv2d(net, 80, [1, 1], scope='Conv2d_3b_1x1')#输出尺寸 73 x 73 x 80.net = slim.conv2d(net, 192, [3, 3], scope='Conv2d_4a_3x3')#输出尺寸 71 x 71 x 192.net = slim.max_pool2d(net, [3, 3], stride=2, scope='MaxPool_5a_3x3')# 输出尺寸35 x 35 x 192.'''上面部分代码一共有5个卷积层，2个池化层，实现了对图片数据的尺寸压缩，并对图片特征进行了抽象有个疑问是框架例里给出的表格中是6个卷积和一个池化，并没有1x1的卷积，为什么要这么做，以及scope后面的名字为什么要这样叫。''''''接下来就是三个连续的Inception模块组，三个Inception模块组中各自分别有多个Inception Module，这部分是Inception Module V3的精华所在。每个Inception模块组内部的几个Inception Mdoule结构非常相似，但是存在一些细节的不同'''# Inception blockswith slim.arg_scope([slim.conv2d, slim.max_pool2d, slim.avg_pool2d],  # 设置所有模块组的默认参数stride=1, padding='SAME'):  # 将所有卷积层、最大池化、平均池化层步长都设置为1#注意这个模块已经统一指定了padding='SAME'，后面不用再说明# 第一个模块组包含了三个结构类似的Inception Module# 第一个模块组第一个Inception Module,Mixed_5bwith tf.variable_scope('Mixed_5b'):  # 第一个Inception Module名称。Inception Module有四个分支with tf.variable_scope('Branch_0'):  # 第一个分支64通道的1*1卷积branch_0 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')#输出尺寸35*35*64with tf.variable_scope('Branch_1'):  # 第二个分支48通道1*1卷积后一层链接一个64通道的5*5卷积branch_1 = slim.conv2d(net, 48, [1, 1], scope='Conv2d_0a_1x1')#输出尺寸35*35*48branch_1 = slim.conv2d(branch_1, 64, [5, 5], scope='Conv2d_0b_5x5')#输出尺寸35*35*64with tf.variable_scope('Branch_2'): #第三个分支64通道1*1卷积后一层链接2个96通道的5*5卷积branch_2 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')#输出尺寸35*35*64branch_2 = slim.conv2d(branch_2, 96, [3, 3], scope='Conv2d_0b_3x3')#输出尺寸35*35*96branch_2 = slim.conv2d(branch_2, 96, [3, 3], scope='Conv2d_0c_3x3')# 输出尺寸35*35*96with tf.variable_scope('Branch_3'):  # 第四个分支为3*3的平均池化后一层连接32通道的1*1卷积branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')# 输出尺寸35*35*192branch_3 = slim.conv2d(branch_3, 32, [1, 1], scope='Conv2d_0b_1x1')# 输出尺寸35*35*32net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 将四个分支的输出合并在一起（第三个维度合并，即输出通道上合并）64+64+96+32=256个通道# 输出尺寸35*35*256# 第一个模块组第二个Inception Module 名称是：Mixed_5cwith tf.variable_scope('Mixed_5c'):   #同样有4个分支，唯一不同的是第4个分支最后接的是64输出通道with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸35*35*64with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 48, [1, 1], scope='Conv2d_0b_1x1')# 输出尺寸35*35*48branch_1 = slim.conv2d(branch_1, 64, [5, 5], scope='Conv_1_0c_5x5')# 输出尺寸35*35*64with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸35*35*64branch_2 = slim.conv2d(branch_2, 96, [3, 3], scope='Conv2d_0b_3x3')# 输出尺寸35*35*96branch_2 = slim.conv2d(branch_2, 96, [3, 3], scope='Conv2d_0c_3x3')# 输出尺寸35*35*96with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')# 输出尺寸35*35*192branch_3 = slim.conv2d(branch_3, 64, [1, 1], scope='Conv2d_0b_1x1')# 输出尺寸35*35*64net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 将四个分支的输出合并在一起（第三个维度合并，即输出通道上合并）64+64+96+64=288个通道# 输出尺寸35*35*288# 第一个模块组第3个Inception Module 名称是：Mixed_5dwith tf.variable_scope('Mixed_5d'):with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 48, [1, 1], scope='Conv2d_0a_1x1')branch_1 = slim.conv2d(branch_1, 64, [5, 5], scope='Conv2d_0b_5x5')with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')branch_2 = slim.conv2d(branch_2, 96, [3, 3], scope='Conv2d_0b_3x3')branch_2 = slim.conv2d(branch_2, 96, [3, 3], scope='Conv2d_0c_3x3')with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')branch_3 = slim.conv2d(branch_3, 64, [1, 1], scope='Conv2d_0b_1x1')net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 将四个分支的输出合并在一起（第三个维度合并，即输出通道上合并）64+64+96+64=288个通道# 输出尺寸35*35*288# 第二个Inception模块组是一个非常大的模块组，包含了5个Inception Mdoule，2-5个Inception Mdoule结构非常相似#第二个模块组第一个Inception Module 名称是：Mixed_6a#输入是35*35*288with tf.variable_scope('Mixed_6a'):  #包含3个分支with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 384, [3, 3], stride=2,padding='VALID', scope='Conv2d_1a_1x1')# padding='VALID'图片尺寸会被压缩，通道数增加# 输出尺寸17*17*384with tf.variable_scope('Branch_1'): #64通道的1*1加2个96通道的3*3卷积branch_1 = slim.conv2d(net, 64, [1, 1], scope='Conv2d_0a_1x1')#输出尺寸35 * 35 * 64branch_1 = slim.conv2d(branch_1, 96, [3, 3], scope='Conv2d_0b_3x3')#输出尺寸35*35*96branch_1 = slim.conv2d(branch_1, 96, [3, 3], stride=2,padding='VALID', scope='Conv2d_1a_1x1')# 图片被压缩/输出尺寸17*17*96with tf.variable_scope('Branch_2'):branch_2 = slim.max_pool2d(net, [3, 3], stride=2, padding='VALID',scope='MaxPool_1a_3x3')#输出尺寸17 * 17 * 288（这里大家注意，书上还有很多博客上都是384+96+256=736并不是768，所以最后应该加288）net = tf.concat([branch_0, branch_1, branch_2], 3)# 输出尺寸定格在17 x 17 x 768# 第二个模块组第二个Inception Module 名称是：Mixed_6bwith tf.variable_scope('Mixed_6b'):  #4个分支with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸17*17*192with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 128, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸17 * 17 * 128branch_1 = slim.conv2d(branch_1, 128, [1, 7],scope='Conv2d_0b_1x7')# 输出尺寸17 * 17 * 128# 串联1*7卷积和7*1卷积合成7*7卷积，减少了参数，减轻了过拟合branch_1 = slim.conv2d(branch_1, 192, [7, 1], scope='Conv2d_0c_7x1')# 输出尺寸17 * 17 * 192with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 128, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸17 * 17 * 128# 反复将7*7卷积拆分branch_2 = slim.conv2d(branch_2, 128, [7, 1], scope='Conv2d_0b_7x1')branch_2 = slim.conv2d(branch_2, 128, [1, 7], scope='Conv2d_0c_1x7')branch_2 = slim.conv2d(branch_2, 128, [7, 1], scope='Conv2d_0d_7x1')branch_2 = slim.conv2d(branch_2, 192, [1, 7], scope='Conv2d_0e_1x7')# 这种方法算是利用Factorization into small convolutions 的典范with tf.variable_scope('Branch_3'):  #3*3的平均池化branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')# 输出尺寸17 * 17 * 768branch_3 = slim.conv2d(branch_3, 192, [1, 1], scope='Conv2d_0b_1x1')# 输出尺寸17 * 17 * 192net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 输出尺寸定格在17 x 17 x （192*4）=17*17*768# 第二个模块组第三个Inception Module 名称是：Mixed_6c# 同前面一个相似，第二个分支和第三个分支的前几层通道由120升为160with tf.variable_scope('Mixed_6c'):with tf.variable_scope('Branch_0'):'''我们的网络每经过一个inception module，即使输出尺寸不变，但是特征都相当于被重新精炼了一遍，其中丰富的卷积和非线性化对提升网络性能帮助很大。'''branch_0 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 160, [1, 1], scope='Conv2d_0a_1x1')branch_1 = slim.conv2d(branch_1, 160, [1, 7], scope='Conv2d_0b_1x7')branch_1 = slim.conv2d(branch_1, 192, [7, 1], scope='Conv2d_0c_7x1')with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 160, [1, 1], scope='Conv2d_0a_1x1')branch_2 = slim.conv2d(branch_2, 160, [7, 1], scope='Conv2d_0b_7x1')branch_2 = slim.conv2d(branch_2, 160, [1, 7], scope='Conv2d_0c_1x7')branch_2 = slim.conv2d(branch_2, 160, [7, 1], scope='Conv2d_0d_7x1')branch_2 = slim.conv2d(branch_2, 192, [1, 7], scope='Conv2d_0e_1x7')with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')branch_3 = slim.conv2d(branch_3, 192, [1, 1], scope='Conv2d_0b_1x1')net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 输出尺寸定格在17 x 17 x （192*4）=17*17*768# 第二个模块组第四个Inception Module 名称是：Mixed_6d# 和前面一个完全一样，增加卷积和非线性，提炼特征with tf.variable_scope('Mixed_6d'):with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 160, [1, 1], scope='Conv2d_0a_1x1')branch_1 = slim.conv2d(branch_1, 160, [1, 7], scope='Conv2d_0b_1x7')branch_1 = slim.conv2d(branch_1, 192, [7, 1], scope='Conv2d_0c_7x1')with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 160, [1, 1], scope='Conv2d_0a_1x1')branch_2 = slim.conv2d(branch_2, 160, [7, 1], scope='Conv2d_0b_7x1')branch_2 = slim.conv2d(branch_2, 160, [1, 7], scope='Conv2d_0c_1x7')branch_2 = slim.conv2d(branch_2, 160, [7, 1], scope='Conv2d_0d_7x1')branch_2 = slim.conv2d(branch_2, 192, [1, 7], scope='Conv2d_0e_1x7')with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')branch_3 = slim.conv2d(branch_3, 192, [1, 1], scope='Conv2d_0b_1x1')net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 输出尺寸定格在17 x 17 x （192*4）=17*17*768# 第二个模块组第五个Inception Module 名称是：Mixed_6e# 也同前面一样，通道数变为192，但是要将Mixed_6e存储在end_points中with tf.variable_scope('Mixed_6e'):with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')branch_1 = slim.conv2d(branch_1, 192, [1, 7], scope='Conv2d_0b_1x7')branch_1 = slim.conv2d(branch_1, 192, [7, 1], scope='Conv2d_0c_7x1')with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')branch_2 = slim.conv2d(branch_2, 192, [7, 1], scope='Conv2d_0b_7x1')branch_2 = slim.conv2d(branch_2, 192, [1, 7], scope='Conv2d_0c_1x7')branch_2 = slim.conv2d(branch_2, 192, [7, 1], scope='Conv2d_0d_7x1')branch_2 = slim.conv2d(branch_2, 192, [1, 7], scope='Conv2d_0e_1x7')with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')branch_3 = slim.conv2d(branch_3, 192, [1, 1], scope='Conv2d_0b_1x1')net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 输出尺寸定格在17 x 17 x （192*4）=17*17*768end_points['Mixed_6e'] = net# 将Mixed_6e存储于end_points中，作为Auxiliary Classifier辅助模型的分类# 第三个Inception模块包含了3个Inception Mdoule，后两个个Inception Mdoule结构非常相似# 第三个模块组第一个Inception Module 名称是：Mixed_7awith tf.variable_scope('Mixed_7a'):  # 3个分支with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸17*17*192branch_0 = slim.conv2d(branch_0, 320, [3, 3], stride=2,padding='VALID', scope='Conv2d_1a_3x3')# 压缩图片# 输出尺寸8*8*320with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 192, [1, 1], scope='Conv2d_0a_1x1')branch_1 = slim.conv2d(branch_1, 192, [1, 7], scope='Conv2d_0b_1x7')branch_1 = slim.conv2d(branch_1, 192, [7, 1], scope='Conv2d_0c_7x1')branch_1 = slim.conv2d(branch_1, 192, [3, 3], stride=2,padding='VALID', scope='Conv2d_1a_3x3')# 输出尺寸8*8*192with tf.variable_scope('Branch_2'):  # 池化层不会对输出通道数产生改变branch_2 = slim.max_pool2d(net, [3, 3], stride=2, padding='VALID',scope='MaxPool_1a_3x3')# 输出尺寸8*8*768net = tf.concat([branch_0, branch_1, branch_2], 3)# 输出图片尺寸被缩小，通道数增加，tensor的总size在持续下降中# 输出尺寸8*8*（320+192+768）=8*8*1280# 第三个模块组第二个Inception Module 名称是：Mixed_7b'''这个模块最大的区别是分支内又有分支，network in network in network'''with tf.variable_scope('Mixed_7b'): # 4 个分支with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 320, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸8*8*320with tf.variable_scope('Branch_1'): #第二个分支里还有分支branch_1 = slim.conv2d(net, 384, [1, 1], scope='Conv2d_0a_1x1')# 输出尺寸8*8*384branch_1 = tf.concat([slim.conv2d(branch_1, 384, [1, 3], scope='Conv2d_0b_1x3'),slim.conv2d(branch_1, 384, [3, 1], scope='Conv2d_0b_3x1')], 3)# 输出尺寸8 * 8 * （384+384）=8*8*768with tf.variable_scope('Branch_2'): #这个分支更复杂：1*1>3*3>1*3+3*1,总共有三层branch_2 = slim.conv2d(net, 448, [1, 1], scope='Conv2d_0a_1x1')branch_2 = slim.conv2d(branch_2, 384, [3, 3], scope='Conv2d_0b_3x3')branch_2 = tf.concat([slim.conv2d(branch_2, 384, [1, 3], scope='Conv2d_0c_1x3'),slim.conv2d(branch_2, 384, [3, 1], scope='Conv2d_0d_3x1')], 3)# 输出尺寸8 * 8 * （384+384）=8*8*768with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')branch_3 = slim.conv2d(branch_3, 192, [1, 1], scope='Conv2d_0b_1x1')# 输出尺寸8 * 8 * （384+384）=8*8*192net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 输出通道数增加到2048 # 输出尺寸8 * 8 * （320+768+768+192）=8*8*2048# 第三个模块组第三个Inception Module 名称是：Mixed_7c#同前一个一样with tf.variable_scope('Mixed_7c'):with tf.variable_scope('Branch_0'):branch_0 = slim.conv2d(net, 320, [1, 1], scope='Conv2d_0a_1x1')with tf.variable_scope('Branch_1'):branch_1 = slim.conv2d(net, 384, [1, 1], scope='Conv2d_0a_1x1')branch_1 = tf.concat([slim.conv2d(branch_1, 384, [1, 3], scope='Conv2d_0b_1x3'),slim.conv2d(branch_1, 384, [3, 1], scope='Conv2d_0c_3x1')], 3)with tf.variable_scope('Branch_2'):branch_2 = slim.conv2d(net, 448, [1, 1], scope='Conv2d_0a_1x1')branch_2 = slim.conv2d(branch_2, 384, [3, 3], scope='Conv2d_0b_3x3')branch_2 = tf.concat([slim.conv2d(branch_2, 384, [1, 3], scope='Conv2d_0c_1x3'),slim.conv2d(branch_2, 384, [3, 1], scope='Conv2d_0d_3x1')], 3)with tf.variable_scope('Branch_3'):branch_3 = slim.avg_pool2d(net, [3, 3], scope='AvgPool_0a_3x3')branch_3 = slim.conv2d(branch_3, 192, [1, 1], scope='Conv2d_0b_1x1')net = tf.concat([branch_0, branch_1, branch_2, branch_3], 3)# 输出尺寸8 * 8 * （320+768+768+192）=8*8*2048return net, end_points#Inception V3网络的核心部分，即卷积层部分就完成了'''设计inception net的重要原则是图片尺寸不断缩小，inception模块组的目的都是将空间结构简化，同时将空间信息转化为高阶抽象的特征信息，即将空间维度转为通道的维度。降低了计算量。Inception Module是通过组合比较简单的特征抽象（分支1）、比较比较复杂的特征抽象（分支2和分支3）和一个简化结构的池化层（分支4），一共四种不同程度的特征抽象和变换来有选择地保留不同层次的高阶特征，这样最大程度地丰富网络的表达能力。'''########全局平均池化、Softmax和Auxiliary Logits(之前6e模块的辅助分类节点)########
def inception_v3(inputs,num_classes=1000, # 最后需要分类的数量（比赛数据集的种类数）is_training=True, # 标志是否为训练过程，只有在训练时Batch normalization和Dropout才会启用dropout_keep_prob=0.8, # 节点保留比率prediction_fn=slim.softmax, # 最后用来分类的函数spatial_squeeze=True, # 参数标志是否对输出进行squeeze操作（去除维度数为1的维度，比如5*3*1转为5*3）reuse=None, # 是否对网络和Variable进行重复使用scope='InceptionV3'): # 包含函数默认参数的环境with tf.variable_scope(scope, 'InceptionV3', [inputs, num_classes], # 定义参数默认值reuse=reuse) as scope:#'InceptionV3'是命名空间with slim.arg_scope([slim.batch_norm, slim.dropout], # 定义标志默认值is_training=is_training):# 拿到最后一层的输出net和重要节点的字典表end_pointsnet, end_points = inception_v3_base(inputs, scope=scope) # 用定义好的函数构筑整个网络的卷积部分# Auxiliary Head logits作为辅助分类的节点，对分类结果预测有很大帮助，# 对end_points的结界做平均池化、卷积最后通过1*1的卷积将通道变为1000with slim.arg_scope([slim.conv2d, slim.max_pool2d, slim.avg_pool2d],stride=1, padding='SAME'): # 将卷积、最大池化、平均池化步长设置为1aux_logits = end_points['Mixed_6e'] # 通过end_points取到Mixed_6ewith tf.variable_scope('AuxLogits'):aux_logits = slim.avg_pool2d(aux_logits, [5, 5], stride=3, padding='VALID', # 在Mixed_6e之后接平均池化。压缩图像尺寸scope='AvgPool_1a_5x5')# 输入图像尺寸17*17*768，输出5*5*768aux_logits = slim.conv2d(aux_logits, 128, [1, 1], # 卷积。压缩图像尺寸。scope='Conv2d_1b_1x1')# 输出图像尺寸5*5*128# Shape of feature map before the final layer.aux_logits = slim.conv2d(aux_logits, 768, [5,5],weights_initializer=trunc_normal(0.01), # 权重初始化方式重设为标准差为0.01的正态分布padding='VALID', scope='Conv2d_2a_5x5')# 输出图像尺寸1*1*768aux_logits = slim.conv2d(aux_logits, num_classes, [1, 1], activation_fn=None,normalizer_fn=None, weights_initializer=trunc_normal(0.001),scope='Conv2d_2b_1x1')# 输出变为1*1*1000，这里的num_classes表示输出通道数，不用激活和标准化，权重重设为0.001的正态分布if spatial_squeeze: # tf.squeeze消除tensor中前两个为1的维度。aux_logits = tf.squeeze(aux_logits, [1, 2], name='SpatialSqueeze')#  这里非常值得注意，tf.squeeze(aux_logits, [1, 2]）为什么是[1, 2]而不是[0,1],括号里表示为1的维度
#因为训练的时候是输入的批次，第一维不是1，[32,1,1,1000].end_points['AuxLogits'] = aux_logits# 最后将辅助分类节点的输出aux_logits储存到字典表end_points中# 处理正常的分类预测逻辑# Final pooling and prediction# 这一过程的主要步骤：对Mixed_7c的输出进行8*8的全局平均池化>Dropout>1*1*1000的卷积>除去维数为1>softmax分类with tf.variable_scope('Logits'):net = slim.avg_pool2d(net, [8, 8], padding='VALID',scope='AvgPool_1a_8x8')#输入为8*8*2048 输出为1 x 1 x 2048net = slim.dropout(net, keep_prob=dropout_keep_prob, scope='Dropout_1b')end_points['PreLogits'] = net# 1*1*2048logits = slim.conv2d(net, num_classes, [1, 1], activation_fn=None,normalizer_fn=None, scope='Conv2d_1c_1x1')# 激活函数和规范化函数设为空 # 输出通道数1*1*1000if spatial_squeeze: # tf.squeeze去除输出tensor中维度为1的节点logits = tf.squeeze(logits, [1, 2], name='SpatialSqueeze')end_points['Logits'] = logitsend_points['Predictions'] = prediction_fn(logits, scope='Predictions')# Softmax对结果进行分类预测return logits, end_points # 最后返回logits和包含辅助节点的end_points
#end_points里面有'AuxLogits'、'Logits'、'Predictions'分别是辅助分类的输出，主线的输出以及经过softmax后的预测输出'''
到这里，前向传播已经写完，对其进行运算性能测试
'''
########评估网络每轮计算时间########
def time_tensorflow_run(session, target, info_string):# Args:# session:the TensorFlow session to run the computation under.# target:需要评测的运算算子。# info_string:测试名称。num_steps_burn_in = 10# 先定义预热轮数（头几轮跌代有显存加载、cache命中等问题因此可以跳过，只考量10轮迭代之后的计算时间）total_duration = 0.0 # 记录总时间total_duration_squared = 0.0 # 总时间平方和  -----用来后面计算方差#迭代计算时间for i in range(num_batches + num_steps_burn_in): # 迭代轮数start_time = time.time() # 记录时间_ = session.run(target) # 每次迭代通过session.run(target)duration = time.time() - start_time#每十轮输出一次if i >= num_steps_burn_in:if not i % 10:print ('%s: step %d, duration = %.3f' %(datetime.now(), i - num_steps_burn_in, duration))total_duration += duration  # 累加便于后面计算每轮耗时的均值和标准差total_duration_squared += duration * durationmn = total_duration / num_batches # 每轮迭代的平均耗时vr = total_duration_squared / num_batches - mn * mn# 方差，是把一般的方差公式进行化解之后的结果，值得 借鉴sd = math.sqrt(vr) # 标准差print ('%s: %s across %d steps, %.3f +/- %.3f sec / batch' %(datetime.now(), info_string, num_batches, mn, sd))#输出的时间是处理一批次的平均时间加减标准差# 测试前向传播性能
batch_size = 32 # 因为网络结构较大依然设置为32，以免GPU显存不够
height, width = 299, 299 # 图片尺寸
# 随机生成图片数据作为input
inputs = tf.random_uniform((batch_size, height, width, 3))with slim.arg_scope(inception_v3_arg_scope()):# scope中包含了batch normalization默认参数，激活函数和参数初始化方式的默认值logits, end_points = inception_v3(inputs, is_training=False)# inception_v3中传入inputs获取里logits和end_pointsinit = tf.global_variables_initializer() # 初始化全部模型参数
sess = tf.Session() # 创建session
sess.run(init)
num_batches=100 # 测试的batch数量
time_tensorflow_run(sess, logits, "Forward")
'''
虽然输入图片比VGGNet的224*224大了78%，但是forward速度却比VGGNet更快。
这主要归功于其较小的参数量，inception V3参数量比inception V1的700万
多了很多，不过仍然不到AlexNet的6000万参数量的一半。相比VGGNet的1.4
亿参数量就更少了。整个网络的浮点计算量为50亿次，比inception V1的15亿
次大了不少，但是相比VGGNet来说不算大。因此较少的计算量让inception V3
网络变得非常实用，可以轻松地移植到普通服务器上提供快速响应服务，甚至
移植到手机上进行实时的图像识别。
'''
#分析结果：前面的输出是当前时间下每10步的计算时间，最后输出的是当前时间下前向传播的总批次以及平均时间+-标准差

最后附上我的运行结果：

我也有很多不懂得地方，毕竟也是初学者，欢迎交流讨论

InceptionV3代码解析相关推荐

matrix_multiply代码解析
matrix_multiply代码解析关于matrix_multiply 程序执行代码里两个矩阵的乘法,并将相乘结果打印在屏幕上. 示例的主要目的是展现怎么实现一个自定义CPU计算任务. 参考:ht ...
CornerNet代码解析——损失函数
CornerNet代码解析--损失函数文章目录 CornerNet代码解析--损失函数前言总体损失 1.Heatmap的损失 2.Embedding的损失 3.Offset的损失前言今天要解 ...
视觉SLAM开源算法ORB-SLAM3 原理与代码解析
来源:深蓝学院,文稿整理者:何常鑫,审核&修改:刘国庆本文总结于上交感知与导航研究所科研助理--刘国庆关于[视觉SLAM开源算法ORB-SLAM3 原理与代码解析]的公开课. ORB-SLA ...
java获取object属性值_java反射获取一个object属性值代码解析
有些时候你明明知道这个object里面是什么,但是因为种种原因,你不能将它转化成一个对象,只是想单纯地提取出这个object里的一些东西,这个时候就需要用反射了. 假如你这个类是这样的: privat ...
python中的doc_基于Python获取docx/doc文件内容代码解析
这篇文章主要介绍了基于Python获取docx/doc文件内容代码解析,文中通过示例代码介绍的非常详细,对大家的学习或者工作具有一定的参考学习价值,需要的朋友可以参考下整体思路: 下载文件并修改后缀 ...
mongoose框架示例代码解析（一）
mongoose框架示例代码解析(一) 参考: Mongoose Networking Library Documentation(Server) Mongoose Networking Librar ...
ViBe算法原理和代码解析
ViBe - a powerful technique for background detection and subtraction in video sequences 算法官网:http:// ...
【Android 逆向】使用 Python 代码解析 ELF 文件 ( PyCharm 中进行断点调试 | ELFFile 实例对象分析 )
文章目录一.PyCharm 中进行断点调试二.ELFFile 实例对象分析一.PyCharm 中进行断点调试在上一篇博客 [Android 逆向]使用 Python 代码解析 ELF 文件 ( ...
密码算法中iv值是什么_?标检测中的?极?值抑制算法(nms)：python代码解析
⾮极⼤值抑制(Non-Maximum Suppression)原理⾮极⼤值抑制,顾名思义,找出极⼤值,抑制⾮极⼤值.这种思路和算法在各个领域中应⽤⼴泛,⽐如边缘检测算法canny算⼦中就使⽤了该⽅法 ...

InceptionV3代码解析

InceptionV3代码解析相关推荐

最新文章

热门文章