@nrailgun
2015-10-18T14:08:15.000000Z
字数 934
阅读 1428
机器学习
Zero-center data, then normalize data. In practice, you may also see PCA and whitening.
Select apropriate architecture. Set weights to small random numbers and biases to zero. Usually
Tip:
1. Make sure you can overfit very small portion of the data.
2. Loss not going down: learning rate too low.
3. Loss exploding: learning rate too high.
4. "Coarse to Fine" cross-validation in stages.
5. Visualization.
Dropout: Drop a neuron at each iteration with probability