butia_speech package

1. Description

This package provides some tools to make the robot DoRIS speak and listen. We use ROS packages and Python libraries to be possible to play audios and to recognize of speech (a.k.a speech-to-text and text-to-speech).

2. Requirements

You will need:

ROS Noetic
pip

3. Download dependencies

To install the dependencies you will need to run the install.sh with super user permission.

chmod +x install.sh
sudo ./install.sh

The dependencies will be installed.

4. Nodes

We developed some nodes that will go help to make the speech-to-text (STT) and text-to-speech (TTS). They can be seen in nodes directory.

The audio_player node is a ROS Service that receive a audio path and try to reproduce using the PyAudio and Wave libraries (it can be seen in src/butia_speech/wav_to_mouth.py). This node is also used to send movements to the jaw used in the mechatronic face used in the robot DoRIS.

The detector_hotword_node is a ROS Topic that, when called, listens and waits for a keyword. A possible use to this topic is to make the robot listen a word to make some specific task. We use the pvporcupine Python library to make the recognize (it can be seen in src/butia_speech/detect_hotword.py).

The speech_recognizer node is a ROS Service that, when called, listens and returns a text of what was spoken.

The speech_synthesizer node is a ROS Service and a ROS Topic that receive a text and a language (english by default, other languages aren't implemented yet). We developed the node with service and topic because the ROS Topic is no-blocking, so we can make others tasks while the robot speaks.

5. Services and messages

We created some services and messages to make possible the communication between ours packages.

5.1 Services

AudioPlayer.srv receives a audio path as string and returns a success boolean;
SpeechToText.srv returns a text as string;
SynthesizeSpeech.srv receives a text and a language as string and returns a success boolean.

5.2. Messages

SynthesizeSpeechMessage.msg uses the text and a language as string.

6. Usage

The first use of the nodes may take a while as the models are being downloaded.

For use of new tts, you need to run the Riva server on the embedded device:

On jetson:

Try at first:

docker start riva-speech

If some error happens and what not:

cd riva...<tab>
bash riva_start.sh

On nuc:

To start the nodes you must run:

rosrun butia_speech <node_name>.py

<node_name> available are: audio_player, detector_hotword_node, speech_recognizer, speech_synthesizer.

Citation

If you find this package useful, consider citing it using:

@misc{butia_speech,
    title={Butia Speech Package},
    author={{ButiaBots}},
    howpublished={\url{https://github.com/butia-bots/butia_speech/}},
    year={2022}
}

If you liked this repository, please don't forget to starred it!

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
audios		audios
config		config
include		include
launch		launch
msg		msg
nodes		nodes
resources		resources
src/butia_speech		src/butia_speech
srv		srv
voices		voices
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
install.sh		install.sh
package.xml		package.xml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

butia_speech package

1. Description

2. Requirements

3. Download dependencies

4. Nodes

5. Services and messages

5.1 Services

5.2. Messages

6. Usage

On jetson:

On nuc:

Citation

About

Releases

Packages

Languages

butia-bots/fbot_speech

Folders and files

Latest commit

History

Repository files navigation

butia_speech package

1. Description

2. Requirements

3. Download dependencies

4. Nodes

5. Services and messages

5.1 Services

5.2. Messages

6. Usage

On jetson:

On nuc:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages