Skip to content

Real-Time Grasp/Grab Detection in Videos

License

Notifications You must be signed in to change notification settings

Ivan-Sotelo/darknet

 
 

Repository files navigation

Real-Time Human Grasp Detection in Videos

This project proposes to implement an object detection network using deep learning techniques to predict in real-time if an RGB video contains a hand grasping an object.

Pre-Publication Paper: https://drive.google.com/file/d/1YBY8jsC4y6fuyIjIW1ykdgBmoluEp3AG/view?usp=sharing

Presentation Video: https://www.youtube.com/watch?v=y7nI9wQG0e8

Grab detection is the detection of hands grasping objects

Imgur

This is a fork of the implementation of AlexeyAB's implementation of YoloV4 on darknet. linked here: https://github.com/AlexeyAB/darknet

How to use:

First install the required dependencies as described in the YOLOv4 github: https://github.com/AlexeyAB/darknet#requirements

Alternatively Augmented startups provides a great 2-part step-by-step guide for Windows on Youtube: https://www.youtube.com/watch?v=5pYh1rFnNZs

https://www.youtube.com/watch?v=sUxAVpzZ8hU In this video clone this github instead of the YOLO4 github

Download the required trained weights:

https://drive.google.com/file/d/1B9WDT8EKs0NLcTynmniGzeyvvGuh_Fcc/view?usp=sharing

Put the weights in the folder

~/darknet/build/darknet/x64/backup

To use on videos go to the directory:

~/darknet/build/darknet/x64

then run on the command line:

darknet.exe detector demo data/obj.data yolo-obj.cfg backup/yolo-obj_best.weights filename_of_your_video

Our Grab Dataset:

A human grasping dataset taken from different angles.

Imgur

https://drive.google.com/file/d/1xeTGrnWud8X1A9PuonK_mwIHl6UsHeUr/view?usp=sharing

DEMO VIDEOS

Results on a video from the Grab Dataset Evaluation Set

https://drive.google.com/open?id=1Hs_dKiOXMXJupfJTYxankmKEhLa_U2Q0

Results on a video from the UTGrasp Dataset

https://drive.google.com/open?id=1L9LAARDvmwcIoDtLduz9YnWYOrOeSDeK

Experimental Results:

Average Precision results at IoU of 0.5

Imgur

Average FPS on Videos

Imgur

Comparison with other Object Detection Architectures in the task of Grab detection:

Imgur

About

Real-Time Grasp/Grab Detection in Videos

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C 63.5%
  • Cuda 15.2%
  • C++ 13.1%
  • Python 4.8%
  • CMake 1.4%
  • Batchfile 0.6%
  • Other 1.4%