CNNVR: CNN
机器学习
Input volume of size [W1×H1×D1], using K neurons with receptive fields F×F and applying them at strides of S gives output volumn [W2,H2,D2], where W2=W1−FS−1, H2=H1−FS−1, and D2=K. In practice is common to zero pad the border with F−12 0s.
The number weight will be H2×W2×K×F×F×D1, and is too big. Weight sharing reduce this number to K×F×F×D1. Note: Sometimes globally weight sharing is not a good idea.
Size tricks
- Start with image that has power-of-2 size.
- Use stride 1 filter size 3×3 pad input with a border of 0s.
- Use pool size 2×2.