学习笔记,仅供参考,有错必纠

PS : 本BLOG采用中英混合模式,有些英文下有中文翻译(并不是博主翻译的)


非线性回归模型

神经网络

Neural networks (Bishop 1995; Ripley 1996; Titterington 2010) are powerful nonlinear regression techniques inspired by theories about how the brain works.

The outcome is modeled by an intermediary set of unobserved variables (called hidden variables or hidden units here).

  • 翻译

结果变量利用一系列中间层的非观测变量(在此称为隐藏变量或隐藏元)进行建模。

These hidden units are linear combinations of the original predictors, but, they are not estimated in a hierarchical fashion(层级的方式).

As previously stated, each hidden unit is a linear combination of some or all of the predictor variables. However, this linear combination is typically transformed by a nonlinear function g(⋅)g(\cdot)g(⋅),such as the logistic function:
hk(x)=g(β0k+∑j=1Pxjβjk)g(u)=11+e−uh_k(x)=g\left( \beta_{0k}+ \sum_{j=1}^P x_j \beta_{jk} \right) \\g(u) = \frac{1}{1+e^{-u}} hk​(x)=g(β0k​+j=1∑P​xj​βjk​)g(u)=1+e−u1​
The β\betaβ coefficients are similar to regression coefficients; coefficient βjk\beta_{jk}βjk​ is the
effect of the jthj thjth predictor on the kthk thkth hidden unit. A neural network model usually involves multiple hidden units to model the outcome.

There are no constraints that help define these linear combinations. Because of this, there is little likelihood that the coefficients in each unit represent some coherent piece of information.

  • 翻译

在这里讨线性组合的形式没有任何约束。由于这一点,每个隐藏元上的系数可能不会反映出一致的信息。

Once the number of hidden units is defined, each unit must be related to the outcome. Another linear combination connects the hidden units to the outcome:
f(x)=γ0+∑k=1Hγkhkf(x)=\gamma_0 + \sum_{k=1}^H \gamma_k h_k f(x)=γ0​+k=1∑H​γk​hk​

For this type of network model and P predictors, there are a total of H(P+1)+H+1H (P +1) + H + 1H(P+1)+H+1 total parameters being estimated, which quickly becomes large as P increases.

Treating this model as a nonlinear regression model, the parameters are usually optimized to minimize the sum of the squared residuals.

  • 翻译

如果把这一模型作为一个非线性回归来看待,那么参数将要最小化残差平方和。

This can be a challenging numerical optimization problem (recall that there are no constraints on the parameters of this complex nonlinear model).

The parameters are usually initialized to random values and then specialized algorithms for solving the equations are used. The back-propagation algorithm (逆向传播算法) is a highly efficient methodology that works with derivatives to find the optimal parameters. However, it is common that a solution to this equation is not a global solution, meaning that we cannot guarantee that the resulting set of parameters are uniformly better than any other set.

Also, neural networks have a tendency to over-fit the relationship between the predictors and the response due to the large number of regression coefficients.

  • 翻译

此外,神经网络倾向于过度拟合预测变量与响应变量之间的关系,原因是待估参数过多。

To combat this issue, several different approaches have been proposed.

First, the iterative algorithms for solving for the regression equations can be prematurely halted(求解回归方程的迭代算法可以提前被中断) . This approach is referred to as early stopping (提前停止)and would stop the optimization procedure when some estimate of the error rate starts to increase.

Another approach to moderating over-fitting is to use weight decay(权重衰减), a penalization method to regularize the model(正则化模型) similar to ridge regression(岭回归).

The structure of the model described here is the simplest neural network architecture: a single-layer feed-forward network(单层前馈神经网络). There are many other kinds, such as models where there are more than one layer of hidden units (i.e., there is a layer of hidden units that models the other hidden units). Also, other model architectures have loops going both directions between layers.

Given the challenge of estimating a large number of parameters, the fitted model finds parameter estimates that are locally optimal(局部最优); that is, the algorithm converges(算法收敛), but the resulting parameter estimates are unlikely to be the globally optimal estimates.

Very often, different locally optimal solutions can produce models that are very different but have nearly equivalent performance.

This model instability can sometimes hinder this model(这种模型的不稳定性往往会制约神经网络的使用).

As an alternative, several models can be created using different starting values and averaging the results of these model to produce a more stable prediction

These models are often adversely affected by high correlation among the predictor variables.

Two approaches for mitigating this issue is to pre-filter the predictors to remove the predictorsthat are associated with high correlations (移除高相关性变量). Alternatively a feature extraction technique(特征提取技术), such as principal component analysis(PCA), can be used prior to modeling to eliminate correlations(减缓相关性).

非线性回归模型(part1)--神经网络相关推荐

  1. c++ 三次多项式拟合_非线性回归模型(一)--多项式回归

    在许多实际问题分析中,回归分析的应用十分广泛,它是处理变量之间相关关系最常用的一种统计方法.回归分析可分为线性回归和非线性回归. 线性回归分析相信大家都已经非常熟悉了,它主要分析有线性回归趋势的两个变 ...

  2. python非线性回归分析_sklearn实现非线性回归模型

    sklearn实现非线性回归模型 前言: sklearn实现非线性回归模型的本质是通过线性模型实现非线性模型,如何实现呢?sklearn就是先将非线性模型转换为线性模型,再利用线性模型的算法进行训练模 ...

  3. 非线性回归模型(part3)--K近邻

    学习笔记,仅供参考,有错必纠 PS : 本BLOG采用中英混合模式 非线性回归模型 k近邻 The KNN approach simply predicts a new sample using th ...

  4. 非线性回归模型(part2)--支持向量机

    学习笔记,仅供参考,有错必纠 PS : 本BLOG采用中英混合模式 非线性回归模型 支持向量机 SVMs are a class of powerful, highly flexible modelin ...

  5. 机器学习算法终极对比:树模型VS神经网络

    点击上方"小白学视觉",选择加"星标"或"置顶" 重磅干货,第一时间送达 选自towardsdatascience 作者:Andre Ye ...

  6. R语言经典模型BP神经网络

    你好,我是个会计学渣,我的毕业论文里有关于bp神经网络的内容,但是我不会 大概就是运用经典模型BP神经网络去模拟某公司会计盈余质量是好还是不好. 1.黄标为负相关 2.表格上半输入部分为财务数据,中间 ...

  7. matlab实验报告李琼指数函数,基于MATLAB的多元非线性回归模型

    第29卷第2期2009年3月 云南师范大学学报 JournalofYunnanNormalUniversity V01.29No.2 Mar.2009基于MATLAB的多元非线性回归模型' 董大校 ( ...

  8. matlab中非线性回归标准误,SPSS—非线性回归(模型表达式)案例解析_spss培训

    SPSS-非线性回归(模型表达式)案例解析 由简单到复杂,人生有下坡就必有上坡,有低潮就必有高潮的迭起,随着SPSS的深入学习,已经逐渐开始走向复杂,今天跟大家交流一下,SPSS非线性回归,希望大家能 ...

  9. 非线性回归模型的原理及评估——解决行星轨道的拟合问题

    非线性回归模型--行星轨道 概述 回归统计 普通和加权最小二乘法 LogisticLogisticLogistic回归 对数几率分布公式 拟合一个行星轨道 问题一 思路及解答 最小二乘法 问题二 解答 ...

最新文章

  1. HDU2066(SPFA算法)
  2. 《看透SpringMVC》第十二章 HandlerMapping
  3. bs4爬取的时候有两个标签相同_python爬虫初体验,爬取中国最好大学网大学名次...
  4. 从 npm 迁移至yarn
  5. Go 开发关键技术指南 | 为什么你要选择 GO?(内含超全知识大图)
  6. 10 年深度学习顶级论文和代码精选,请务必收藏!
  7. python中classmethod的用法_Python中的@classmethod是如何使用的?
  8. Android WebView 开发详解(三)
  9. DHTMLX 前端框架 建立你的一个应用程序教程(二)--设置布局
  10. windows和linux互传文件,用户配置文件和密码配置文件,用户和组管理
  11. php date当天,php5中date()获得的时间不是当前时间的解决方法
  12. php对接海康视频教程_手把手教你php对接海康api
  13. m2增长率曲线_中国通胀率(中国m2历年数据曲线图)
  14. wnmp php7,wnmp环境php7与其他php版本共存
  15. hp 服务器硬盘背板 供电线,100元淘了一个HP 6位硬盘笼+背板+线,准备用旧机改一个DSM。...
  16. 如何在安卓手机上编辑Excel表格?
  17. .net Application.DoEvents()
  18. word2010添加论文引用
  19. windows网络服务
  20. 警告:计算出的值未被使用warning: value computed is not used [-Wunused-value]

热门文章

  1. 【机器学习】数据挖掘算法——关联规则(一),相关概念,评价指标
  2. cmd弄电脑滚绿代码_windows10点击“此电脑”——“管理”会出现管理员已阻止你运行此应用。有关详细信息。。。...
  3. mybatis和hibernate的区别---Mybatis的学习笔记(四)
  4. 国内视频云市场转入整合阶段
  5. 阿里开源Euler:国内首个工业级图表征学习框架
  6. SpaceEmacs Rock Day5 学习笔记
  7. 重新审视自己和自己的目标
  8. MariaDB 加密特性及使用方法
  9. index作为key是反模式
  10. JFinal model简单包装,版本2