SFND 2D Feature Tracking

The idea of the camera course is to build a collision detection system - that's the overall goal for the Final Project. As a preparation for this, you will now build the feature tracking part and test various detector / descriptor combinations to see which ones perform best. This mid-term project consists of four parts:

First, you will focus on loading images, setting up data structures and putting everything into a ring buffer to optimize memory load.
Then, you will integrate several keypoint detectors such as HARRIS, FAST, BRISK and SIFT and compare them with regard to number of keypoints and speed.
In the next part, you will then focus on descriptor extraction and matching using brute force and also the FLANN approach we discussed in the previous lesson.
In the last part, once the code framework is complete, you will test the various algorithms in different combinations and compare them with regard to some performance measures.

See the classroom instruction and code comments for more details on each of these parts. Once you are finished with this project, the keypoint matching part will be set up and you can proceed to the next lesson, where the focus is on integrating Lidar points and on object detection using deep-learning.

Dependencies for Running Locally

cmake >= 2.8

All OSes: click here for installation instructions

make >= 4.1 (Linux, Mac), 3.81 (Windows)

Linux: make is installed by default on most Linux distros
Mac: install Xcode command line tools to get make
Windows: Click here for installation instructions

OpenCV >= 4.1

All OSes: refer to the official instructions
This must be compiled from source using the -D OPENCV_ENABLE_NONFREE=ON cmake flag for testing the SIFT and SURF detectors. If using homebrew: $> brew install --build-from-source opencv will install required dependencies and compile opencv with the opencv_contrib module by default (no need to set -DOPENCV_ENABLE_NONFREE=ON manually).
The OpenCV 4.1.0 source code can be found here

gcc/g++ >= 5.4

Linux: gcc / g++ is installed by default on most Linux distros
Mac: same deal as make - install Xcode command line tools
Windows: recommend using either MinGW-w64 or Microsoft's VCPKG, a C++ package manager. VCPKG maintains its own binary distributions of OpenCV and many other packages. To see what packages are available, type vcpkg search at the command prompt. For example, once you've VCPKG installed, you can install OpenCV 4.1 with the command:

c:\vcpkg> vcpkg install opencv4[nonfree,contrib]:x64-windows

Then, add C:\vcpkg\installed\x64-windows\bin and C:\vcpkg\installed\x64-windows\debug\bin to your user's PATH variable. Also, set the CMake Toolchain File to c:\vcpkg\scripts\buildsystems\vcpkg.cmake.

Basic Build Instructions

Clone this repo.
Build Docker container docker build . -t feature_tracking
Allow local connections to your X server xhost +local:
Run the docker container and forward display docker run -it -e DISPLAY=$DISPLAY -v /tmp/.X11-unix:/tmp/.X11-unix -v $(pwd):/app feature_tracking
(From the container) Make a build directory in the top level directory: mkdir build && cd build
(From the container) Compile: cmake .. && make
(From the container) Run it: ./2D_feature_tracking.

Task 7

Compiling using thw CMake file generates a executable ./task7 running it counts the number of keypoints on the preceding vehicle for all 10 images and all the detectors. It generates one .dat file for each detector in output/task7/. To visualize the results we provide a script plot.py. To run it you must install the dependencies first: from output/task7/ run python -m venv venv && source venv/bin/activate && pip install -r requirements.txt. Then run python plot.py FILE.dat. For example here is the result obtained with orb.dat:

Task 8

The number of matched keypoints for all 10 images using all possible combinations of detectors and descriptors. In the matching step, the BF approach is used with the descriptor distance ratio set to 0.8.

Detector / Descriptor	AKAZE	BRIEF	BRISK	FREAK	ORB	SIFT
sift	n/a	1250	1249	1240	Out of Memory Error	1250
brisk	n/a	2508	2508	2326	2508	2508
fast	n/a	3693	3693	3693	3693	3693
harris	n/a	214	214	214	214	214
shitomasi	n/a	1067	1067	1067	1067	1067
orb	n/a	1033	950	549	1033	1033
akaze	1491	1491	1491	1491	1491	1491

Task 9

Average Processing Time (ms) on all images for each detector/descriptor combination.

Detector / Descriptor	AKAZE	BRIEF	BRISK	FREAK	ORB	SIFT
sift		58.37	222.22	84.54		128.85
brisk		207.96	378.53	222.17	211.88	218.34
fast		4.08	170.74	28.48	4.64	16.03
harris		11.1	178.83	35.54	12.72	19.98
shitomasi		10.68	180.03	35.98	10.76	19.53
orb		7.71	174.05	31.28	11.29	27.68
akaze	86.72	44.83	206.57	71.98	44.76	58.08

Conclusion

To recommend the top 3 detector/descriptor combinations for the purpose of detecting keypoints on vehicles, we need to consider both the number of matched keypoints (as a measure of effectiveness) and the average processing time (as a measure of efficiency). The ideal combinations will have a high number of matched keypoints and a relatively low processing time.

Let's consider that both speed (average processing time) and the number of keypoints are equally important for determining the effectiveness of detector/descriptor combinations, we need to balance these two factors to find the top 3 choices. Our chosen approach is to calculate a combined score that takes both aspects into account.

One way to do this is by normalizing both the number of keypoints and processing times, and then calculating a composite score. However, since higher keypoints are better and lower times are better, we need to invert one of these measures. We'll invert the processing time so that for both metrics, a higher score is better.

Detector / Descriptor	AKAZE	BRIEF	BRISK	FREAK	ORB	SIFT
akaze	1.183	1.295	0.863	1.222	1.295	1.260
brisk	0.000	1.135	0.679	1.047	1.124	1.107
fast	0.000	2.000	1.555	1.935	1.999	1.968
harris	0.000	1.039	0.591	0.974	1.035	1.015
orb	0.000	1.270	0.803	1.076	1.260	1.217
shitomasi	0.000	1.271	0.819	1.204	1.271	1.248
sift	0.000	1.193	0.756	1.121	0.000	1.005

Based on the composite scores that consider both the number of matched keypoints and the average processing time, here are the top 3 detector/descriptor combinations for detecting keypoints on vehicles:

FAST + BRIEF: This combination is the top choice, with the highest composite score, indicating an excellent balance of a high number of matched keypoints and extremely low processing time.
FAST + ORB: This combination almost ties with FAST + BRIEF, showcasing a similar balance of effectiveness in keypoint detection and efficiency in processing speed.
FAST + FREAK: This combination still demonstrates a strong balance between the number of keypoints detected and processing efficiency.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
images		images
output		output
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CODEOWNERS		CODEOWNERS
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SFND 2D Feature Tracking

Dependencies for Running Locally

Basic Build Instructions

Task 7

Task 8

Task 9

Conclusion

About

Releases

Packages

Languages

License

yfe404/SFND_2D_Feature_Tracking

Folders and files

Latest commit

History

Repository files navigation

SFND 2D Feature Tracking

Dependencies for Running Locally

Basic Build Instructions

Task 7

Task 8

Task 9

Conclusion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages