@nrailgun 2015-10-12T13:17:39.000000Z 字数 2542 阅读 2014

提升方法 AdaBoost

机器学习

AdaBoost

Input: Training set $T = \{ (x_1, y_1), (x_2, y_2), \dots, (x_N, y_N) \}$ , $x_i \in \mathcal X \subseteq \mathrm R^n$ ; A weak learning algorithm.

Output: Classifier $G(x)$ .

Initialize weight distribution
$D 1 = (w 11, \dots, w 1 N), w 1 i = 1 N, i = 1, 2, \dots, N$ $D_1 = (w_{11}, \dots, w_{1N}), w_{1i} = \frac{1}{N}, i = 1, 2, \dots, N$
For $m = 1, 2, \dots, M$ :
1. Train classfier $G_m(x): \mathcal X \to \{ -1, +1 \}$ with weight distribution $D_m$ ;
2. Define error rate:
  $e m = \sum i = 1 N w m i I (G m (x i) \neq y i)$ $e_m = \sum_{i=1}^N w_{mi} I(G_m(x_i) \not= y_i)$
3. Define $G_m(x)$ factor $\alpha$ as:
  $a m = 1 2 log 1 - e m e m$ $a_m = \frac{1}{2} \log{\frac{1-e_m}{e_m}}$
4. Update weight distribution of training set $D_{m+1} = (w_{m+1,1}, w_{m+1,2}, \dots, w_{m+1,N})$ :
  $w m + 1, i = w m i Z m exp (- α m y i G m (x i))$ $w_{m+1,i} = \frac{w_{mi}}{Z_m} \exp{(-\alpha_m y_i G_m(x_i))}$
  where $Z_m = \sum_{i=1}^N w_{mi} \exp{(-\alpha_m y_i G_m(x_i))}$ .
Construct linear combination of basic classifiers:
$G (x) = s i g n (f (x)) = s i g n (\sum m = 1 M α m G m (x))$ $G(x) = \mathrm{sign}(f(x)) = \mathrm{sign} \left( \sum_{m=1}^{M} \alpha_m G_m(x) \right)$

Forward Stagewise Additive Modeling

AdaBoost is a special case of Forward Stagewise Additive Modeling, with basic classifier as addictive model and exponential function as loss function.

Input: Training set $T = \{ (x_1, y_1), (x_2, y_2), \dots, (x_N, y_N) \}$ , $x_i \in \mathcal X \subseteq \mathrm R^n$ ; Loss function $L(y, f(x))$ ; Base function set $\{ b(x; \gamma) \}$ .

Output: Additive model $f(x)$ .

Initialize $f_0(x) = 0$ ;
For m=1,2,…,M:
1. Minimize loss function
  $(β m, γ m) = a r g m a x β, γ \sum i = 1 N L (y i, f m - 1 (x i) + β b (x i; γ))$ $(\beta_m, \gamma_m) = \mathrm{argmax}_{\beta, \gamma} \sum_{i=1}^N L(y_i, f_{m-1}(x_i) + \beta b(x_i; \gamma))$
2. Update $f_m(x) = f_{m-1}(x) + \beta_mb(x; \gamma_m)$ ;
Obtain addictive model:
$f (x) = f M (x) = \sum m = 1 M β M b (x; γ m)$ $f(x) = f_M(x) = \sum_{m=1}^M \beta_M b(x; \gamma_m)$

Boosting Tree

Boosting Tree is one kind of boosting with regression tree or classification tree as basic classifier. Boosting tree model can be presented as additive model of decision tree

f M (x) = \sum i = 1 M T (x; Θ m)

$f_M(x) = \sum_{i=1}^M T(x; \Theta_m)$

Input: Training set $T = \{ (x_1, y_1), (x_2, y_2), \dots, (x_N, y_N) \}$ , $x_i \in \mathcal X \subseteq \mathbf R^n$ ; $y_i \in \mathcal Y \subseteq \mathbf R$ .

Output: Boosting tree $f_M(x)$ .

Initialize $f_0(x) = 0$ ;
For m=1,2,…,M:
1. Calculate residue
  $r m i = y i - f m - 1 (x i), i = 1, 2, \dots, N$ $r_{mi} = y_i - f_{m-1}(x_i), i = 1, 2, \dots, N$
2. Learn a regression tree fitting residue $r_m$ , obtain $T(x; \Theta_m)$ ;
3. Update $f_m(x) = f_{m-1}(x) + T(x; \Theta_m);$
Obtain regression boosting tree:
$f M (x) = \sum m = 1 M T (x; Θ m)$ $f_M(x) = \sum_{m=1}^M T(x; \Theta_m)$

提升方法 AdaBoost

AdaBoost

Forward Stagewise Additive Modeling

Boosting Tree

内容目录