字数 371
阅读 1430
Input: Image
Output: Class label
Evaluation metric: Accuracy
Input: Image
Output: Box in the image
Evaluation metric: Intersection over union
Detection as regression? Need variable sized outputs.
Detection as classification? Need to test many positions and scales, only look at a tiny subset of possible positions (Region proposal: Selective search).