Vosk is an open source speech recognition toolkit which supports 10 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian. Vosk works offline with small (50 Mb), but accurate model, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.
For installation instructions, examples and documentation visit Vosk Website