[关闭]
@nrailgun 2015-10-18T14:08:15.000000Z 字数 934 阅读 1428

CNNVR: Training Neural Network

机器学习


Step 1: Process Data

Zero-center data, then normalize data. In practice, you may also see PCA and whitening.

Step 2: Choose the Architecture

Select apropriate architecture. Set weights to small random numbers and biases to zero. Usually WN(0,0.01) works, if not, normalize by square root of fan in.

Tip:
1. Make sure you can overfit very small portion of the data.
2. Loss not going down: learning rate too low.
3. Loss exploding: learning rate too high.
4. "Coarse to Fine" cross-validation in stages.
5. Visualization.

Regularization knobs

Dropout: Drop a neuron at each iteration with probability p.

Learning Rate

SGD+Momentum>SGD. Momentum 0.9 usually works well.

ν=γν+αθJ(θ,x(i),y(i))

θ=θν

where ν is the current velocity vector, momentum α is learning rate, γ[0,1). Decreasing the learning rate over time.

添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注