[关闭]
@nrailgun 2015-10-02T13:04:08.000000Z 字数 2049 阅读 1548

统计学习方法概论

机器学习


Statistical learning

Also unknown as Statistical machine learning.

3 Keys in machine learning

Model

Denotes hypothesis space as

F={f|Y=fθ(X),θRn},

where X is input variable in input space X and Y is output variable in output space Y, Rn vector is parameter from n dimensional parameter space.

Similarily, we can define hypothesis space in probability view as

F={P|P(Y|X)}.

Strategy

Loss function

Definie expectation of loss as

Rexp(f)=x×yL(y,f(x))P(x,y)dxdy.

But you actually need empirical loss:
Remp(f)=1Ni=1NL(yi,f(xi)).

ERM & SRM

ERM means empirical risk minimization (minimizing empirical loss function). But it causes over-fitting problem when trainning set is not large enough.

SRM (structural risk minimization) is a strategy proposed against over-fitting problem. Structural risk minimization is equivalent to regularization. SRM is defined as:

Rsrm(f)=1Ni=1NL(yi,f(xi))+λJ(f),

where J(f) is the complexity of the model. You target is to minimizing SRM.

Algorithm

The approach to find optimized model. I guess SGD is most popular in practice.

Model Evaluation and Selection

Too complicated model causes over-fitting problems.

Regularization and Cross validation

Regularization

Simply add regularizer (or penalty term):

L(w)=1Ni=1N(f(xi;w)yi)2+λ2w2,

L1 norm works as well:
L(w)=1Ni=1N(f(xi;w)yi)2+λ2w1.

Occam's razor: The simpler the model is, the better the model is.

Cross validation

S-fold cross validation is the most often used cross validation method. Split dataset into s subsets, uses s1 of them for training and 1 for validation.

Generalization ability

Generalization error

Generalization error reflects the generalization ability of model learnt.

Generalization error bound

添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注