Use localization implementation to get bounding boxes #7

BradNeuberg · 2015-08-18T18:15:06Z

No description provided.

jhauswald · 2015-08-18T23:34:26Z

Difference between Detection and Localization Fig 1 here shows the difference for those interested.

All detection/localization implementations that I've read thus far start from some sort of sliding window approach where you simply search the image with bounding boxes of varying sizes and scales to find the exact location of the object. It's a pretty straight forward approach that's computationally intensive. The following papers all use that approach as their baseline and show how they improve speed and accuracy over it (some high level notes about each but I'd read them for more in depth info!):

DenseNet:
- Accelerates classification over large set of aspect ratios and image size regions.
- Creates ~25 images of multiple resolutions, batches them together and does inference on the batch of images at multiple scales (similar to building pyramid of different image scales in the SIFT algorithm for example).
- Hack Caffe to do this: flatten all images of different scales into size of original input to match expected batch size. Use padding to add space between images and to make the correct size.
- paper, source
DNNs for Object Detection (NIPS 2013):
- sliding window based, does not use pretrained weights.
- replace the last layer with a regression layer which generates an object binary mask. Apply bounding box regression to find the best bounding box for the object. iterative process.
- paper
Overfeat:
- attempts to solve classification, localization and detection (in increasing order of difficulty)
- Uses 6 different scales of the input image (table 5 in paper).
- Trains a regression model of 2 FC layers that produces the bounding box coordinates
- paper

BradNeuberg · 2015-08-19T19:24:28Z

Sounds like the approach we will use is:

http://arxiv.org/pdf/1311.2524v5.pdf

via this Python open source library:

https://github.com/jhauswald/selective_search_py

Updating this issue to indicate that we found an algorithm to use for iteration 2.

BradNeuberg assigned jhauswald Aug 18, 2015

BradNeuberg added this to the iteration1 milestone Aug 18, 2015

jhauswald changed the title ~~Study R-CNN papers and implementations~~ Localization and Detection Aug 18, 2015

BradNeuberg modified the milestones: iteration2, iteration1 Aug 19, 2015

BradNeuberg changed the title ~~Localization and Detection~~ Use localization implementation to get bounding boxes Aug 19, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use localization implementation to get bounding boxes #7

Use localization implementation to get bounding boxes #7

BradNeuberg commented Aug 18, 2015

jhauswald commented Aug 18, 2015

BradNeuberg commented Aug 19, 2015

Use localization implementation to get bounding boxes #7

Use localization implementation to get bounding boxes #7

Comments

BradNeuberg commented Aug 18, 2015

jhauswald commented Aug 18, 2015

BradNeuberg commented Aug 19, 2015