YOHO (You Only Hear Once) 👂☝️

Introduction

YOHO does object detection in street view and provides audible scene descriptions. With YOHO, a visually impared person would be able to take out their smartphone, put in their earbuds, and take a walk in the street while their phone lets them know about their surroundings.

YOHO was created via transfer learning and fine-tuning on a YOLOv2 architechture (pre-trained on COCO dataset) on the Berkeley Deep Drive dataset, and further, placing an inference engine - Tell a Vision - on top of the model to get audible output from its predictions.

This is the architecture of the system:

Links

You may also download the notebook and the report from this repository.

Contact

If there are any questions or recommendations, you can reach out to me at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
You Only Hear Once.pdf		You Only Hear Once.pdf
You_Only_Hear_Once.ipynb		You_Only_Hear_Once.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOHO (You Only Hear Once) 👂☝️

Introduction

Links

Contact

About

Releases

Packages

Languages

rezmansouri/YOHO

Folders and files

Latest commit

History

Repository files navigation

YOHO (You Only Hear Once) 👂☝️

Introduction

Links

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages