在机器学习中,导致overfitting的原因之一是noise,这个noise可以分为两种,即stochastic noise,随机噪声来自数据产生过程,比如测量误差等,和deterministic noise,确定性噪声来自added complexity,即model too complex。这两种类型的造成来源不同,但是对于学习的影响是相似的,large noise总会导致overfitting。


This is a very subtle question!

The most important thing to realize is that in learning, H is fixed and D is given, and so can be assumed fixed. Now we can ask, what is going on in this learning scenario. Here is what we can say:

i) If there is stochastic noise with ‘magnitude’ σ2, then you are in trouble.

ii) If there deterministic noise then you are in trouble.

The stochastic noise can be viewed as one part of the data generation process (eg. measurement errors). The deterministic noise can similarly be viewed as another part of the data generation process, namely f. The deterministic and stochastic noise are fixed. In your analogy, you can increase the stochastic noise by increasing the noise variance and you get into deeper trouble. Similarly, you can increase the deterministic noise by making f more complex and you will get into deeper trouble.

I just need to tell you what ‘trouble’ means. Well, we actually use another word instead of ‘trouble’ - overfitting.

This means you may be likely to make an inferior choice over the superior choice because the inferior choice has lower in-sample error. Doing stuff that looks good in-sample that leads to disasters out-of-sample is the essence of overfitting. An example of this is trying to choose the regularization parameter. If you pick a lower regularization parameter, then you have lower in-sample error, but it leads to higher out-of-sample error - you picked the λ with lowerEinbut it gave higher Eout. We call that overfitting. Underfitting is just the name we give to the opposite process in the context of picking the regularization parameter. Once the regularization parameter gets too high, as you pick a higher λ you get both higher Einand higher Eout. It also turns out that this means you over regularized and obtained an over-simplistic g - i.e. you ‘underfitted’, you didn’t fit the data enough. The underfitting and overfitting are just terms. The substance of what is going on under the hood is how the deterministic and stochastic noise are affecting what you should and should not do in-sample.
Now let’s get back to the subtle part of your question. There is actually another way to decrease the deterministic noise - increase the complexity of H (the other way is to decrease the complexity of f which we discussed above). Now is where the difference with stochastic noise pops up. With stochastic noise, it either goes up or down; if down, then things get better. With deterministic noise, if you just tell me that it went down, I need to ask you how. Did your target function get simpler - if yes, then great, it is just as if the stochastic noise went down. If it is that your H got more complicated, then things get interesting. To understand what is going on, the Bias Variance decomposition helps (bottom of page 125 in the textbook).

Eout=σ2+bias+var

σ2is the direct impact of the stochastic noise. bias is the direct impact of the deterministic noise. The var term is interesting and is the indirect impact of the noise, through H. The var term is mostly controlled by the size of H in relation to the number of data points. So getting back to the point, if you make H more complex, you will decrease the det. noise (bias) but you will increase the var (its indirect impact). Usually the latter dominates (overfitting, not because of the direct impact of the noise, but because of its indirect impact) … unless you are in the underfitting regime when the former dominates.

上面一段主要摘自《learning from data》一书,主要说明的内容是overfitting的含义以及noise对于overfitting的效用。
下面是对overfitting的很好的总结:
VC维大=>模型复杂度高=>error in sample 小=>模型不够平滑=>generalization能力弱=>error out of sample大=>overfitting=>模型并没有卵用。

总的来说,deterministic noise是由于你选择的H中的最好的hypothesis h∗对于不在H中的function f进行估计时的差。在给定x后,这个deterministic noise就确定了。
deterministic function可用来生成伪随机数(pseudo-random generator)。
详细的论述可以参看《learning from data》


2015-8-27
艺少

转载于:https://www.cnblogs.com/huty/p/8519205.html

stochastic noise and deterministic noise相关推荐

  1. 使用Transient noise和ac noise仿真晶体管噪声

    目录 前言 热噪声(thermal noise) 闪烁噪声(flick noise) 栅感应噪声(induced gate noise) 参考文献 前言 For the two types of no ...

  2. SmoothGrad: removing noise by adding noise

    SmoothGrad: removing noise by adding noise 文章目录 **SmoothGrad: removing noise by adding noise** 背景知识 ...

  3. SmoothGrad: removing noise by adding noise论文解读

    Abstract 解释神经网络的输出一直是一个非常困难的任务.对于一个图片分类器而言,一种类型的解释是分别那些对最终结果有很大影响的像素.这种策略的一个starting point is the gr ...

  4. 台湾大学林轩田机器学习基石课程学习笔记13 -- Hazard of Overfitting

    红色石头的个人网站:redstonewill.com 上节课我们主要介绍了非线性分类模型,通过非线性变换,将非线性模型映射到另一个空间,转换为线性模型,再来进行分类,分析了非线性变换可能会使计算复杂度 ...

  5. 机器学习基石13-Hazard of Overfitting

    注: 文章中所有的图片均来自台湾大学林轩田<机器学习基石>课程. 笔记原作者:红色石头 微信公众号:AI有道 上节课主要介绍了非线性分类模型,通过非线性变换,将非线性模型映射到另一个空间, ...

  6. Active Noise Cancelling-主动噪声消除

    ABSTRACT 我们生活中充斥着各种噪声,而随着科技的进步,各种音频降噪技术也不断涌现.主动降噪(Active Noise Cancelling)技术属于降噪技术的其中一种,其基本原理是通过播放&q ...

  7. 噪音 - Perlin Noise

    转载自:http://www.cnblogs.com/babyrender/archive/2008/10/27/BabyRender.html 说起perlin noise, 最初也就是在课上大概了 ...

  8. DND(darmstadt noise dataset)数据集的坑

    忙活了好久,以为可以使用DND进行训练模型,以为它和SIDD一样有input和groundtruth,这样可以当成训练集去训练去噪模型 其实并不是! 这个DND数据集,并不能提供训练,也就是说它并不能 ...

  9. 【Cinemachine智能相机教程】VirtualCamera(四):Noise属性

    摘要:[长文预警,先收藏后品尝]如何让Unity中的相机模拟现实世界手持拍摄的晃动效果呢?Cinemachine的Noise属性可以让你在5分钟内解决这个问题. 洪流学堂,让你快人几步.你好,我是郑洪 ...

  10. 愁绪千万端,扰乱不成眠——如何修复Noise?

    今天我们来介绍一下如何修复ECO阶段的Noise Violation. 首先,我们也来介绍一下什么是Noise. 网上资料很多,大致整理一下,noise(噪声)也称为crosstalk(串扰),sig ...

最新文章

  1. Mysql基础运维及复制架构——PRIT非完整恢复
  2. Understanding Ethereum Smart Contracts
  3. 什么样的项目经历会让面试官眼前一亮
  4. python调用jar 性能_亲自有效---python 调用jar
  5. 转: 自适应css布局—-流动布局新时代(译文)
  6. linux 查看文件工具,lsof---Linux查看文件信息的强大工具
  7. 反射(3)—动态、静态代理
  8. 总结div里面水平垂直居中的实现方法
  9. uniapp debug能打开release不行_盘点在 Release 下由循环体优化所产生的不确定性 Bug...
  10. .net抽象类和抽象函数abstract
  11. centos6.8 hugepage设置
  12. php嗅探视频地址,用PHP嗅探youku视频的真实地址!
  13. python提取qq群成员代码_Python提取QQ群成员QQ号及昵称
  14. AI智能语音系统源码搭建
  15. Mysql如何跨库查询数据?
  16. 数字电路设计的基本方法有哪些
  17. 诛仙手游-各属性道法换算
  18. 后端自我介绍_java开发自我介绍3篇
  19. vue-动态加载图片
  20. 六、字体样式和文本样式

热门文章

  1. GitHub上下载源代码的方法
  2. 华为中低端路由器清除密码方法
  3. InnoDB与Myisam的六大区别
  4. Mac安装和配置Tomcat的教程
  5. 求助!妹子一个rm -rf把公司服务器数据删没了,我该怎么办
  6. Kotlin 或将取代 Java —— 《Java 编程思想》作者 Bruce Eckel
  7. 揭秘2017双11背后的网络-双11的网络产品和技术概览
  8. 性能优化的常见模式及趋势
  9. 《Linux就该这么学》修正已知全部勘误,免费下载啦!
  10. 福利 | 限时免费体验 Google 官方 Android 课程