Skip to content

A system to perform one among a set of some predefined actions based on a voice based user input.

Notifications You must be signed in to change notification settings

rohan843/Paradox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Paradox

This is a system that can be activated via a trigger word. Based on what is said to it once activated, and a predefined set of tasks, it can execute the most relevant task.

At its core, the system checks the semantic similarity between a spoken command and a predefined directive (to each of which we have associated a task), via a trained siamese network. It then matches the command to the most relevant task and executes it.

Primarily, the system is intended for use in IoT applications and in low resource real time environments.

Usage

Use this google drive link to download the 3 files: my_model.zip, sentence_matching_model.zip and glove.6B.100d.txt.

Unzip the 2 folders, and place them in the root project directory. Place the txt file in the root project directory too.

Then, the various files may be used.

TODO: Create a virtual environment

Packages Used

TODO: Convert this list to a requirements.txt file

  1. pip install pyyaml h5py whisper playsound==1.2.2 pyaudio

Important: Use pip install playsound==1.2.2 to install playsound

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages