Keras-模型编译 (Keras - Model Compilation)

Previously, we studied the basics of how to create model using Sequential and Functional API. This chapter explains about how to compile the model. The compilation is the final step in creating a model. Once the compilation is done, we can move on to training phase.

以前，我们研究了如何使用顺序和功能API创建模型的基础知识。本章介绍如何编译模型。编译是创建模型的最后一步。编译完成后，我们可以进入训练阶段。

Let us learn few concepts required to better understand the compilation process.

让我们学习一些概念，以更好地理解编译过程。

失利 (Loss)

In machine learning, Loss function is used to find error or deviation in the learning process. Keras requires loss function during model compilation process.

在机器学习中，损失功能用于发现学习过程中的错误或偏差。 Keras在模型编译过程中需要损失函数。

Keras provides quite a few loss function in the losses module and they are as follows −

Keras在损失模块中提供了很多损失函数，它们如下-

mean_squared_errormean_squared_error
mean_absolute_errormean_absolute_error
mean_absolute_percentage_errormean_absolute_percentage_error
mean_squared_logarithmic_errormean_squared_logarithmic_error
squared_hingesquared_hinge
hinge合页
categorical_hingecategorical_hinge
logcoshLogcosh
huber_losshuber_loss
categorical_crossentropycategorical_crossentropy
sparse_categorical_crossentropysparse_categorical_crossentropy
binary_crossentropy二元交叉熵
kullback_leibler_divergencekullback_leibler_divergence
poisson泊松
cosine_proximitycosine_proximity
is_categorical_crossentropyis_categorical_crossentropy

All above loss function accepts two arguments −

以上所有损失函数都接受两个参数-

y_true − true labels as tensors

y_true -true标签为张量
y_pred − prediction with same shape as y_true

y_pred -预测与相同形状y_true

Import the losses module before using loss function as specified below −

在使用以下指定的损失函数之前，请导入损失模块-


from keras import losses

优化器 (Optimizer)

In machine learning, Optimization is an important process which optimize the input weights by comparing the prediction and the loss function. Keras provides quite a few optimizer as a module, optimizers and they are as follows:

在机器学习中，优化是通过比较预测函数和损失函数来优化输入权重的重要过程。 Keras提供了许多优化器作为模块，这些优化器如下：

SGD − Stochastic gradient descent optimizer.

SGD-随机梯度下降优化器。


keras.optimizers.SGD(learning_rate = 0.01, momentum = 0.0, nesterov = False)

RMSprop − RMSProp optimizer.

RMSprop -RMSProp优化器。


keras.optimizers.RMSprop(learning_rate = 0.001, rho = 0.9)

Adagrad − Adagrad optimizer.

Adagrad -Adagrad优化器。


keras.optimizers.Adagrad(learning_rate = 0.01)

Adadelta − Adadelta optimizer.

Adadelta -Adadelta优化器。


keras.optimizers.Adadelta(learning_rate = 1.0, rho = 0.95)

Adam − Adam optimizer.

Adam -Adam优化器。


keras.optimizers.Adam(
learning_rate = 0.001, beta_1 = 0.9, beta_2 = 0.999, amsgrad = False
)

Adamax − Adamax optimizer from Adam.

Adamax -Adam的Adamax优化器。


keras.optimizers.Adamax(learning_rate = 0.002, beta_1 = 0.9, beta_2 = 0.999)

Nadam − Nesterov Adam optimizer.

Nadam -Nesterov Adam优化器。


keras.optimizers.Nadam(learning_rate = 0.002, beta_1 = 0.9, beta_2 = 0.999)

Import the optimizers module before using optimizers as specified below −

如下所示使用优化器之前，请导入优化器模块-


from keras import optimizers

指标 (Metrics)

In machine learning, Metrics is used to evaluate the performance of your model. It is similar to loss function, but not used in training process. Keras provides quite a few metrics as a module, metrics and they are as follows

在机器学习中，指标用于评估模型的性能。它类似于损失函数，但未在训练过程中使用。 Keras提供了很多度量作为模块，这些度量如下：

accuracy准确性
binary_accuracybinary_accuracy
categorical_accuracycategorical_accuracy
sparse_categorical_accuracysparse_categorical_accuracy
top_k_categorical_accuracytop_k_categorical_accuracy
sparse_top_k_categorical_accuracysparse_top_k_categorical_accuracy
cosine_proximitycosine_proximity
clone_metricclone_metric

Similar to loss function, metrics also accepts below two arguments −

与损失函数类似，指标还接受以下两个参数-

y_true − true labels as tensors

y_true -true标签为张量
y_pred − prediction with same shape as y_true

y_pred -预测与相同形状y_true

Import the metrics module before using metrics as specified below −

在使用指标之前，请导入指标模块，如下所示：


from keras import metrics

编译模型 (Compile the model)

Keras model provides a method, compile() to compile the model. The argument and default value of the compile() method is as follows

Keras模型提供了一种方法compile()来编译模型。 compile()方法的参数和默认值如下


compile(
optimizer,
loss = None,
metrics = None,
loss_weights = None,
sample_weight_mode = None,
weighted_metrics = None,
target_tensors = None
)

The important arguments are as follows −

重要的论据如下-

loss function损失函数
Optimizer优化器
metrics指标

A sample code to compile the mode is as follows −

编译模式的示例代码如下-


from keras import losses
from keras import optimizers
from keras import metrics
model.compile(loss = 'mean_squared_error',
optimizer = 'sgd', metrics = [metrics.categorical_accuracy])

where,

哪里，

loss function is set as mean_squared_error

损失函数设置为mean_squared_error
optimizer is set as sgd

优化器设置为sgd
metrics is set as metrics.categorical_accuracy

指标设置为metrics.categorical_accuracy

模型训练 (Model Training)

Models are trained by NumPy arrays using fit(). The main purpose of this fit function is used to evaluate your model on training. This can be also used for graphing model performance. It has the following syntax −

NumPy数组使用fit()训练模型。该拟合函数的主要目的是用于评估训练模型。这也可以用于绘制模型性能。它具有以下语法-


model.fit(X, y, epochs = , batch_size = )

Here,

这里，

X, y − It is a tuple to evaluate your data.

X，y-这是评估您的数据的元组。
epochs − no of times the model is needed to be evaluated during training.

时期 -在训练期间无需评估模型的次数。
batch_size − training instances.

batch_size-训练实例。

Let us take a simple example of numpy random data to use this concept.

让我们举一个简单的numpy随机数据示例来使用此概念。

建立资料 (Create data)

Let us create a random data using numpy for x and y with the help of below mentioned command −

让我们借助下面提到的命令使用numpy为x和y创建随机数据-


import numpy as np
x_train = np.random.random((100,4,8))
y_train = np.random.random((100,10))

Now, create random validation data,

现在，创建随机验证数据，


x_val = np.random.random((100,4,8))
y_val = np.random.random((100,10))

建立模型 (Create model)

Let us create simple sequential model −

让我们创建简单的顺序模型-


from keras.models import Sequential model = Sequential()

添加图层 (Add layers)

Create layers to add model −

创建图层以添加模型-


from keras.layers import LSTM, Dense
# add a sequence of vectors of dimension 16
model.add(LSTM(16, return_sequences = True))
model.add(Dense(10, activation = 'softmax'))

编译模型 (compile model)

Now model is defined. You can compile using the below command −

现在定义模型。您可以使用以下命令进行编译-


model.compile(
loss = 'categorical_crossentropy', optimizer = 'sgd', metrics = ['accuracy']
)

套用fit() (Apply fit())

Now we apply fit() function to train our data −

现在我们应用fit()函数来训练我们的数据-


model.fit(x_train, y_train, batch_size = 32, epochs = 5, validation_data = (x_val, y_val))

创建多层感知器ANN (Create a Multi-Layer Perceptron ANN)

We have learned to create, compile and train the Keras models.

我们已经学会了创建，编译和训练Keras模型。

Let us apply our learning and create a simple MPL based ANN.

让我们运用我们的学习知识，创建一个基于MPL的简单ANN。

数据集模块 (Dataset module)

Before creating a model, we need to choose a problem, need to collect the required data and convert the data to NumPy array. Once data is collected, we can prepare the model and train it by using the collected data. Data collection is one of the most difficult phase of machine learning. Keras provides a special module, datasets to download the online machine learning data for training purposes. It fetches the data from online server, process the data and return the data as training and test set. Let us check the data provided by Keras dataset module. The data available in the module are as follows,

在创建模型之前，我们需要选择一个问题，需要收集所需的数据并将数据转换为NumPy数组。收集数据后，我们可以准备模型并使用收集的数据进行训练。数据收集是机器学习最困难的阶段之一。 Keras提供了一个特殊的模块，即数据集，用于下载在线机器学习数据以进行培训。它从在线服务器获取数据，处理数据，并将数据作为训练和测试集返回。让我们检查Keras数据集模块提供的数据。模块中可用的数据如下，

CIFAR10 small image classificationCIFAR10小图像分类
CIFAR100 small image classificationCIFAR100小图像分类
IMDB Movie reviews sentiment classificationIMDB电影评论情感分类
Reuters newswire topics classification路透社新闻分类
MNIST database of handwritten digitsMNIST手写数字数据库
Fashion-MNIST database of fashion articlesFashion-MNIST时尚文章数据库
Boston housing price regression dataset波士顿房屋价格回归数据集

Let us use the MNIST database of handwritten digits (or minst) as our input. minst is a collection of 60,000, 28x28 grayscale images. It contains 10 digits. It also contains 10,000 test images.

让我们使用手写数字 (或最低)的MNIST数据库作为输入。 minst是60,000张28x28灰度图像的集合。它包含10位数字。它还包含10,000个测试图像。

Below code can be used to load the dataset −

以下代码可用于加载数据集-


from keras.datasets import mnist
(x_train, y_train), (x_test, y_test) = mnist.load_data()

where

哪里

Line 1 imports minst from the keras dataset module.

第1 行从keras数据集模块导入minst 。
Line 3 calls the load_data function, which will fetch the data from online server and return the data as 2 tuples, First tuple, (x_train, y_train) represent the training data with shape, (number_sample, 28, 28) and its digit label with shape, (number_samples, ). Second tuple, (x_test, y_test) represent test data with same shape.

第3行调用load_data函数，该函数将从在线服务器中获取数据并以2个元组的形式返回数据。第一个元组(x_train，y_train)表示形状为(number_sample， 28，28)的训练数据，其数字标签为形状(number_samples，) 。第二个元组(x_test，y_test)表示具有相同形状的测试数据。

Other dataset can also be fetched using similar API and every API returns similar data as well except the shape of the data. The shape of the data depends on the type of data.

还可以使用类似的API来获取其他数据集，并且每个API都返回相似的数据，但数据的形状除外。数据的形状取决于数据的类型。

建立模型 (Create a model)

Let us choose a simple multi-layer perceptron (MLP) as represented below and try to create the model using Keras.

让我们选择一个简单的多层感知器(MLP)，如下所示，并尝试使用Keras创建模型。

The core features of the model are as follows −

该模型的核心特征如下-

Input layer consists of 784 values (28 x 28 = 784).

输入层包含784个值(28 x 28 = 784)。
First hidden layer, Dense consists of 512 neurons and ‘relu’ activation function.

第一个隐藏层Dense由512个神经元和“ relu”激活功能组成。
Second hidden layer, Dropout has 0.2 as its value.

第二个隐藏层Dropout的值为0.2。
Third hidden layer, again Dense consists of 512 neurons and ‘relu’ activation function.

第三个隐藏层，又是Dense，由512个神经元和“ relu”激活功能组成。
Fourth hidden layer, Dropout has 0.2 as its value.

第四隐藏层， Dropout的值为0.2。
Fifth and final layer consists of 10 neurons and ‘softmax’ activation function.

第五层也是最后一层由10个神经元和“ softmax”激活功能组成。
Use categorical_crossentropy as loss function.

使用categorical_crossentropy作为损失函数。
Use RMSprop() as Optimizer.

使用RMSprop()作为优化程序。
Use accuracy as metrics.

使用准确性作为指标。
Use 128 as batch size.

使用128作为批处理大小。
Use 20 as epochs.

使用20作为时代。

Step 1 − Import the modules

步骤1-导入模块

Let us import the necessary modules.

让我们导入必要的模块。


import keras
from keras.datasets import mnist
from keras.models import Sequential
from keras.layers import Dense, Dropout
from keras.optimizers import RMSprop
import numpy as np

Step 2 − Load data

第2步-加载数据

Let us import the mnist dataset.

让我们导入mnist数据集。


(x_train, y_train), (x_test, y_test) = mnist.load_data()

Step 3 − Process the data

步骤3-处理数据

Let us change the dataset according to our model, so that it can be feed into our model.

让我们根据我们的模型更改数据集，以便可以将其输入到我们的模型中。


x_train = x_train.reshape(60000, 784)
x_test = x_test.reshape(10000, 784)
x_train = x_train.astype('float32')
x_test = x_test.astype('float32')
x_train /= 255
x_test /= 255
y_train = keras.utils.to_categorical(y_train, 10)
y_test = keras.utils.to_categorical(y_test, 10)

Where

哪里

reshape is used to reshape the input from (28, 28) tuple to (784, )

reshape用于将输入从(28，28)元组调整为(784，)
to_categorical is used to convert vector to binary matrix

to_categorical用于将向量转换为二进制矩阵

Step 4 − Create the model

步骤4-创建模型

Let us create the actual model.

让我们创建实际的模型。


model = Sequential()
model.add(Dense(512, activation = 'relu', input_shape = (784,)))
model.add(Dropout(0.2))
model.add(Dense(512, activation = 'relu'))
model.add(Dropout(0.2))
model.add(Dense(10, activation = 'softmax'))

Step 5 − Compile the model

第5步-编译模型

Let us compile the model using selected loss function, optimizer and metrics.

让我们使用选定的损失函数，优化器和指标来编译模型。


model.compile(loss = 'categorical_crossentropy',
optimizer = RMSprop(),
metrics = ['accuracy'])

Step 6 − Train the model

步骤6-训练模型

Let us train the model using fit() method.

让我们使用fit()方法训练模型。


history = model.fit(
x_train, y_train,
batch_size = 128,
epochs = 20,
verbose = 1,
validation_data = (x_test, y_test)
)

最后的想法 (Final thoughts)

We have created the model, loaded the data and also trained the data to the model. We still need to evaluate the model and predict output for unknown input, which we learn in upcoming chapter.

我们已经创建了模型，加载了数据，并将数据训练到了模型中。我们仍然需要评估模型并预测未知输入的输出，这将在下一章中学习。


import keras
from keras.datasets import mnist
from keras.models import Sequential
from keras.layers import Dense, Dropout
from keras.optimizers import RMSprop
import numpy as np
(x_train, y_train), (x_test, y_test) = mnist.load_data()
x_train = x_train.reshape(60000, 784)
x_test = x_test.reshape(10000, 784)
x_train = x_train.astype('float32')
x_test = x_test.astype('float32')
x_train /= 255
x_test /= 255
y_train = keras.utils.to_categorical(y_train, 10)
y_test = keras.utils.to_categorical(y_test, 10)
model = Sequential()
model.add(Dense(512, activation='relu', input_shape = (784,)))
model.add(Dropout(0.2))
model.add(Dense(512, activation = 'relu')) model.add(Dropout(0.2))
model.add(Dense(10, activation = 'softmax'))
model.compile(loss = 'categorical_crossentropy',
optimizer = RMSprop(),
metrics = ['accuracy'])
history = model.fit(x_train, y_train,
batch_size = 128, epochs = 20, verbose = 1, validation_data = (x_test, y_test))

Executing the application will give the below content as output −

执行应用程序将给出以下内容作为输出-


Train on 60000 samples, validate on 10000 samples Epoch 1/20
60000/60000 [==============================] - 7s 118us/step - loss: 0.2453
- acc: 0.9236 - val_loss: 0.1004 - val_acc: 0.9675 Epoch 2/20
60000/60000 [==============================] - 7s 110us/step - loss: 0.1023
- acc: 0.9693 - val_loss: 0.0797 - val_acc: 0.9761 Epoch 3/20
60000/60000 [==============================] - 7s 110us/step - loss: 0.0744
- acc: 0.9770 - val_loss: 0.0727 - val_acc: 0.9791 Epoch 4/20
60000/60000 [==============================] - 7s 110us/step - loss: 0.0599
- acc: 0.9823 - val_loss: 0.0704 - val_acc: 0.9801 Epoch 5/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0504
- acc: 0.9853 - val_loss: 0.0714 - val_acc: 0.9817 Epoch 6/20
60000/60000 [==============================] - 7s 111us/step - loss: 0.0438
- acc: 0.9868 - val_loss: 0.0845 - val_acc: 0.9809 Epoch 7/20
60000/60000 [==============================] - 7s 114us/step - loss: 0.0391
- acc: 0.9887 - val_loss: 0.0823 - val_acc: 0.9802 Epoch 8/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0364
- acc: 0.9892 - val_loss: 0.0818 - val_acc: 0.9830 Epoch 9/20
60000/60000 [==============================] - 7s 113us/step - loss: 0.0308
- acc: 0.9905 - val_loss: 0.0833 - val_acc: 0.9829 Epoch 10/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0289
- acc: 0.9917 - val_loss: 0.0947 - val_acc: 0.9815 Epoch 11/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0279
- acc: 0.9921 - val_loss: 0.0818 - val_acc: 0.9831 Epoch 12/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0260
- acc: 0.9927 - val_loss: 0.0945 - val_acc: 0.9819 Epoch 13/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0257
- acc: 0.9931 - val_loss: 0.0952 - val_acc: 0.9836 Epoch 14/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0229
- acc: 0.9937 - val_loss: 0.0924 - val_acc: 0.9832 Epoch 15/20
60000/60000 [==============================] - 7s 115us/step - loss: 0.0235
- acc: 0.9937 - val_loss: 0.1004 - val_acc: 0.9823 Epoch 16/20
60000/60000 [==============================] - 7s 113us/step - loss: 0.0214
- acc: 0.9941 - val_loss: 0.0991 - val_acc: 0.9847 Epoch 17/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0219
- acc: 0.9943 - val_loss: 0.1044 - val_acc: 0.9837 Epoch 18/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0190
- acc: 0.9952 - val_loss: 0.1129 - val_acc: 0.9836 Epoch 19/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0197
- acc: 0.9953 - val_loss: 0.0981 - val_acc: 0.9841 Epoch 20/20
60000/60000 [==============================] - 7s 112us/step - loss: 0.0198
- acc: 0.9950 - val_loss: 0.1215 - val_acc: 0.9828

翻译自: https://www.tutorialspoint.com/keras/keras_model_compilation.htm

Keras-模型编译相关推荐

resnet keras 结构_Wandb用起来，一行Python代码实现Keras模型可视化
大数据文摘出品来源:wandb 编译:邢畅.宁静在训练神经网络的过程中,我们可能会希望可视化网络的性能和中间的结构,很多可视化代码的冗长复杂使得我们望而却步,有没有一行代码就能解决可视化的所有问题 ...
keras保存模型_onnx+tensorrt部署keras模型
由于项目需要,最近捣鼓了一波如何让用tensorrt部署训练好的模型来达到更快的推理速度,期间花费了大量的时间在知乎和各种网页上去搜索别人的方案,但始终没有找到我想要的条理相对清晰的记录贴(也许只是我 ...
Unet项目解析(7): 模型编译-优化函数、损失函数、指标列表
项目GitHub主页:https://github.com/orobix/retina-unet 参考论文:Retina blood vessel segmentation with a convol ...
TensorFlow 2.9的零零碎碎（五）-模型编译
目录 compile函数的定义在哪里? compile函数的参数 optimizer loss 为什么我用的是sparse_categorical_crossentropy? metrics 有的地方 ...
model.fit()模型编译时报错：ValueError: Failed to convert a NumPy array to a Tensor (Unsupported object type
model.fit()模型编译时报错:ValueError: Failed to convert a NumPy array to a Tensor (Unsupported object type ...
Keras如何保存、加载Keras模型
链接 Keras中文文档一.如何保存 Keras 模型? 1.保存/加载整个模型(结构 + 权重 + 优化器状态) 不建议使用 pickle 或 cPickle 来保存 Keras 模型. 你可以使 ...
keras训练完以后怎么预测_一文告诉你如何将Keras模型保存到文件中，并再次加载它们来进行预测。...
Keras是一个用于深度学习的简单而强大的Python库. 鉴于深度学习模式可能需要数小时.数天甚至数周的时间来培训,了解如何保存并将其从磁盘中加载是很重要的. 在本文中,您将发现如何将Keras模型 ...
TVM将深度学习模型编译为WebGL
TVM将深度学习模型编译为WebGL TVM带有全新的OpenGL / WebGL后端! OpenGL / WebGL后端 TVM已经瞄准了涵盖各种平台的大量后端:CPU,GPU,移动设备等.这次,添 ...
keras 模型用于预测时的注意事项
keras 模型用于预测时的注意事项为什么训练误差比测试误差高很多? 一个Keras的模型有两个模式:训练模式和测试模式.一些正则机制,如Dropout,L1/L2正则项在测试模式下将不被启用. 另 ...
keras 模型简介
keras模型在keras中主要有两种模型,顺序模型,以及模型类(类的内部有函数) model.layers 是层的列表,他们组成了模型model.inputs 是模型输入的张量model.outp ...

Keras-模型编译