Speech Note

Linux desktop and Sailfish OS app for note taking, reading and translating with Speech to Text, Text to Speech and Machine Translation

Description

Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet.

Speech Note uses many different processing engines to do its job. Currently these are used:

Speech to Text (STT)
Text to Speech (TTS)
- espeak-ng
- MBROLA
- Piper
- RHVoice
- Coqui TTS
Machine Translation (MT)
- Bergamot Translator

Languages and Models

Following languages are supported:

Lang ID	Name	DeepSpeech (STT)	Whisper (STT)	Vosk (STT)	Piper (TTS)	RHVoice (TTS)	espeak (TTS)	MBROLA (TTS)	Coqui (TTS)	Bergamot (MT)
am	Amharic	● (e)	● (e)				●
ar	Arabic		● (e)	●			●	●		●
bg	Bulgarian		● (e)				●
bn	Bengali		● (e)				●		●
bs	Bosnian		● (e)				●
ca	Catalan	●	●	●	●		●		●	●
cs	Czech	●	●	●		●	●	●	●	●
da	Danish		● (e)		●		●		●	●
de	German	●	●	●	●		●		●	●
el	Greek	● (e)	● (e)		●		●		●
en	English	●	●	●	●	●	●		●	●
eo	Esperanto			●		●	●
es	Spanish	●	●	●	●		●		●	●
et	Estonian	● (e)	● (e)				●	●	●	●
eu	Basque	● (e)	● (e)				●
fa	Persian	●	● (e)	●			●	●	●	●
fi	Finnish	●	●		●		●		●
fr	French	●	●	●	●		●		●	●
ga	Irish						●		●
hi	Hindi		● (e)	●			●
hr	Croatian		●				●	●	●
hu	Hungarian	● (e)	●				●	●	●
id	Indonesian	● (e)	●				●	●
is	Icelandic		● (e)		●		●			●
it	Italian	●	●	●	●		●		●	●
jp	Japanese		●	●			●
ka	Georgian		● (e)		●	●	●
kk	Kazakh		● (e)	●	●		●
ko	Korean		● (e)	●			●
ky	Kyrgyz					●	●
lb	Luxembourgish				●
lt	Lithuanian		● (e)				●	●	●
lv	Latvian	●	● (e)				●		●
mk	Macedonian		● (e)			●	●
mn	Mongolian	● (e)	● (e)
ms	Malay		●				●	●
mt	Maltese		● (e)				●		●
ne	Nepali		● (e)		●		●
nl	Dutch	● (e)	●	●	●		●		●	●
no	Norwegian		●		●		●			●
pl	Polish	●	●	●	●	●	●	●	●	●
pt	Portuguese	● (e)	●	●	●		●	●	●	●
ro	Romanian	● (e)	●		●		●	●	●
ru	Russian	●	●	●	●	●	●			●
sk	Slovak		●			●	●		●
sl	Slovenian	● (e)	●				●		●
sq	Albanian		● (e)			●	●
sr	Serbian		● (e)		●		●
sv	Swedish		●	●	●		●	●	●
sw	Swahili	●	● (e)		●		●
th	Thai	● (e)	● (e)				●
tl	Tagalog		● (e)	●
tr	Turkish	● (e)	●	●	●		●	●
tt	Tatar		● (e)			●	●
uk	Ukrainian	●	●	●	●	●	●		●	●
uz	Uzbek		● (e)	●			●
vi	Vietnamese		● (e)	●	●		●
yo	Yoruba	● (e)	● (e)
zh	Chinese	●	● (e)	●	●		●		●

^{(e) experimental, most likely doesn't work well}
^{(*) Coqui TTS models are only available on x86-64}

Language models can be downloaded directly from the app.

Details of models which are currently configured for download are described in models.json (GitHub) or models.json (GitLab).

Contributions

Any contribution is very welcome!

Project is hosted both on GitHub and GitLab. Feel free to make a PR/MR, report an issue or reqest for new feature on the platform you prefer the most.

Translation

Translation files in Qt format are in translations dir (GitHub) or translations dir (GitLab).

Preferred way to contribute translation is via Transifex service, but if you would like to make a direct PR/MR, please do it.

Download

Linux Desktop: Flatpak
Sailfish OS: OpenRepos

Building from sources

Linux

Following tools/libraries are required for building (example of packages for Ubuntu 22.04): build-essential cmake git autoconf qtbase5-dev qtdeclarative5-dev qtmultimedia5-dev qtquickcontrols2-5-dev python3-dev zlib1g-dev libtool libboost-all-dev.

git clone <git repository url>

cd dsnote
mkdir build
cd build

cmake ../ -DCMAKE_BUILD_TYPE=Release -DWITH_DESKTOP=ON
make

In a runtime app requires following Qt QML modules (example of packages for Ubuntu 22.04): qml-module-qtquick-controls qml-module-qtquick-dialogs qml-module-qtquick-controls2 qml-module-qtquick-layouts.

Also to make Python components work (i.e.: 'Coqui TTS models', 'Restore punctuation' feature), following Python libriaries have to be installed (pip packages names): torch torchaudio transformers accelerate TTS.

To make build without support for Python components, add -DWITH_PY=OFF in cmake step.

Flatpak

git clone <git repository url>

cd dsnote/flatpak

flatpak-builder --user --install-deps-from=flathub --repo="/path/to/local/flatpak/repo" "/path/to/output/dir" net.mkiol.SpeechNote.yaml

Sailfish OS

git clone <git repository url>

cd dsnote
mkdir build
cd build

sfdk config --session specfile=../sfos/harbour-dsnote.spec
sfdk config --session target=SailfishOS-4.4.0.58-aarch64
sfdk cmake ../ -DCMAKE_BUILD_TYPE=Release -DWITH_SFOS=ON
sfdk package

Libraries

Speech Note relies on following open source projects:

License

Speech Note is developed as an open source project under Mozilla Public License Version 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 409 Commits
cmake		cmake
config		config
dbus		dbus
desktop		desktop
flatpak		flatpak
nonbreaking_prefixes		nonbreaking_prefixes
patches		patches
sfos		sfos
src		src
systemd		systemd
tests		tests
tools		tools
translations		translations
.clang-format		.clang-format
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
config.h.in		config.h.in

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Note

Description

Languages and Models

Contributions

Translation

Download

Building from sources

Linux

Flatpak

Sailfish OS

Libraries

License

About

Releases

Packages

Languages

License

albanobattistella/dsnote

Folders and files

Latest commit

History

Repository files navigation

Speech Note

Description

Languages and Models

Contributions

Translation

Download

Building from sources

Linux

Flatpak

Sailfish OS

Libraries

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages