Locally weighted linear regression

1. Guide

　　The leftmost figure shows the result of fitting a y = θ₀ + θ₁x₁ to a dataset. We see that the data doesn’t really lie on straight line, and so the fit is not very good. This is called underfitting.---there is only one feature, it's too few.

So, we add an extra feature x₁², and fit y = θ₀ + θ₁x₁ + θ₂x_2,here x₂ = x₁². The middle figure shows a better fitting.

The rightmost figure is the result of fitting 5-th order polynomial.This is called overfitting.---there is too many features.(compare to dataset)

　　As discussed previously, and as shown in the example above, the choice of features is important to ensuring good performance of a learning algorithm.

2. LWR

　　We assuming there is sufficient training data, makes the choice of features less critical.

　　In the original linear regression algorithm, to make a prediction at a query point x (i.e., to evaluate h(x)), we would:

　　a. Fit θ to minimize ∑_i(y⁽ⁱ⁾ − θ^T x⁽ⁱ⁾)².

　　b. Output θ^T x.

　　In contrast, the locally weighted linear regression algorithm does the following:

　　a. Fit θ to minimize ∑_iw⁽ⁱ⁾(y⁽ⁱ⁾ − θ^T x⁽ⁱ⁾)².

　　b. Output θ^T x.

　　Here, the w⁽ⁱ⁾’s are non-negative valued weights. Intuitively, if w⁽ⁱ⁾ is large for a particular value of i, then in picking θ, we’ll try hard to make (y⁽ⁱ⁾ − θ^T x⁽ⁱ⁾)² small. If w⁽ⁱ⁾ is small, then the (y⁽ⁱ⁾ − θ^T x⁽ⁱ⁾)² error term will be pretty much ignored in the fit.

　　A fairly standard choice for the weights is:

　　ps. If x is vector-valued, this is generalized to be w⁽ⁱ⁾ = exp(−(x⁽ⁱ⁾−x)^T (x⁽ⁱ⁾−x)/(2τ²)), or w⁽ⁱ⁾ = exp(−(x⁽ⁱ⁾ − x)^T∑⁻¹(x⁽ⁱ⁾ − x)/2), for an appropriate choice of τ or ∑.

　　Note that the weights depend on the particular point x at which we’re trying to evaluate x. Moreover, if |x⁽ⁱ⁾ − x| is small, then w⁽ⁱ⁾ is close to 1; and if |x⁽ⁱ⁾ − x| is large, then w⁽ⁱ⁾ is small. Hence, θ is chosen giving a much higher “weight” to the (errors on) training examples close to the query point x. (Note also that while the formula for the weights takes a form that is cosmetically similar to the density of a Gaussian distribution, the w⁽ⁱ⁾’s do not directly have anything to do with Gaussians, and in particular the w⁽ⁱ⁾ are not random variables, normally distributed or otherwise.)---if |x⁽ⁱ⁾ − x| is large, then w⁽ⁱ⁾ is small, then the (y⁽ⁱ⁾ − θ^T x⁽ⁱ⁾)² error term will be pretty much ignored in the fit.

　　The parameter τ controls how quickly the weight of a training example falls off with distance of its x⁽ⁱ⁾ from the query point x; τ is called the bandwidth parameter.

　　Locally weighted linear regression is the first example we’re seeing of a non-parametric algorithm. The (unweighted) linear regression algorithm we saw earlier is known as a parametric learning algorithm, because it has a fixed, finite number of parameters (the θ_i’s), which are fit to the data. Once we’ve fit the θ_i’s and stored them away, we no longer need to keep the training data around to make future predictions. In contrast, to make predictions using locally weighted linear regression, we need to keep the entire training set around. The term “non-parametric” (roughly) refers to the fact that the amount of stuff we need to keep in order to represent the hypothesis h grows linearly with the size of the training set.(we store the dataset)

转载于:https://www.cnblogs.com/ustccjw/archive/2013/04/13/3017815.html

Locally weighted linear regression相关推荐

机器学习:局部加权线性回归(Locally Weighted Linear Regression)
线性回归先复习一下线性回归的损失函数: 我们的目标是使该函数最小,用矩阵表示为: 对参数w求导得: 令上式等于0可估计出回归系数w得最优解: 但线性回归往往容易欠拟合,除了使用更复杂得函数拟合,还可 ...
局部加权线性回归(Locally weighted linear regression)
这里有现成的,引用一下.http://www.cnblogs.com/czdbest/p/5767138.html 转载于:https://www.cnblogs.com/imageSet/p/757 ...
局部加权线性回归(Local Weighted Linear Regression)+局部加权回归+局部线性回归
局部加权线性回归(Local Weighted Linear Regression)+局部加权回归+局部线性回归 locally weighted scatterplot smoothing,LOWE ...
python123英文字符的鲁棒_Robust Locally Weighted Regression 鲁棒局部加权回归 -R实现
鲁棒局部加权回归 Ljt 作为一个初学者,水平有限,欢迎交流指正. 算法参考文献: (1) Robust Locally Weighted Regression and Smoothing Scatt ...
西瓜书+实战+吴恩达机器学习（四）监督学习之线性回归 Linear Regression
文章目录 0. 前言 1. 线性回归参数求解方法 2. 线性回归正则化 2.1. 岭回归 2.2. LASSO 3. 局部加权线性回归 4. 广义线性模型如果这篇文章对你有一点小小的帮助,请给个关注 ...
机器学习实战（七）线性回归（Linear Regression）
目录 0. 前言 1. 假设函数(Hypothesis) 2. 标准线性回归 2.1. 代价函数(Cost Function) 2.2. 梯度下降(Gradient Descent) 2.3. 特征缩 ...
多元线性回归算法: 线性回归Linear Regression、岭回归Ridge regression、Lasso回归、主成分回归PCR、偏最小二乘PLS
0. 问题描述输入数据:X=(x1,x2,....,xm)\mathbf{X} = (x_1, x_2,...., x_m)X=(x1,x2,....,xm), 相应标签 Y=(y1,y2,. ...
Linear Regression（线性回归）
线性回归原理: 线性回归应该是机器学习最基本的问题了.它是利用称为线性回归方程的最小平方函数对一个或多个自变量和因变量之间关系进行建模的一种回归分析,这种函数是一个或多个称为回归系数的模型参数的线性组 ...
机器学习笔记（一）-局部加权回归（Locally weighted regression）LWR
在网上通过看斯坦福大学的机器学习课程,觉得讲的非常好.同时,为了加强自己的记忆,决定将自己学到的东西和一些理解记录下来,希望有所收获.废话不多说,直接开始笔记: 局部加权回归(locally weig ...
R语言lowess函数数据平滑实战(Locally Weighted Regression, Loess)
R语言lowess函数数据平滑实战(Locally Weighted Regression, Loess) 目录 R语言lowess函数数据平滑实战(Locally Weighted Regressi ...

Locally weighted linear regression

Locally weighted linear regression相关推荐

最新文章

热门文章