Skip to content

Commit

Permalink
small readme updates
Browse files Browse the repository at this point in the history
  • Loading branch information
gkucsko committed Apr 17, 2023
1 parent 2345898 commit 5dc6a4d
Showing 1 changed file with 28 additions and 27 deletions.
55 changes: 28 additions & 27 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,35 +59,37 @@ audio_array = generate_audio(text_prompt)

[lion.webm](https://user-images.githubusercontent.com/5068315/230684766-97f5ea23-ad99-473c-924b-66b6fab24289.webm)

### πŸ‘₯ Speaker Prompts
### 🎀 Voice/Audio Cloning

You can provide certain speaker prompts such as NARRATOR, MAN, WOMAN, etc. Please note that these are not always respected, especially if a conflicting audio history prompt is given.
Bark has the capability to fully clone voices - including tone, pitch, emotion and prosody. The model also attempts to preserve music, ambient noise, etc. from input audio. However, to mitigate misuse of this technology, we limit the audio history prompts to a limited set of Suno-provided, fully synthetic options to choose from for each language. Specify following the pattern: `{lang_code}_speaker_{number}`.

```python
text_prompt = """
WOMAN: I would like an oatmilk latte please.
MAN: Wow, that's expensive!
I have a silky smooth voice, and today I will tell you about
the exercise regimen of the common sloth.
"""
audio_array = generate_audio(text_prompt)
audio_array = generate_audio(text_prompt, history_prompt="en_speaker_1")
```

[latte.webm](https://user-images.githubusercontent.com/5068315/230684864-12d101a1-a726-471d-9d56-d18b108efcb8.webm)

### 🎀 Voice/Audio Cloning
[sloth.webm](https://user-images.githubusercontent.com/5068315/230684883-a344c619-a560-4ff5-8b99-b4463a34487b.webm)

Bark has the capability to fully clone voices - including tone, pitch, emotion and prosody. The model also attempts to preserve music, ambient noise, etc. from input audio. However, to mitigate misuse of this technology, we limit the audio history prompts to a limited set of Suno-provided, fully synthetic options to choose from.
*Note: since Bark recognizes languages automatically from input text, it is possible to use for example a german history prompt with english text. This usually leads to english audio with a german accent.*


### πŸ‘₯ Speaker Prompts

You can provide certain speaker prompts such as NARRATOR, MAN, WOMAN, etc. Please note that these are not always respected, especially if a conflicting audio history prompt is given.

```python
text_prompt = """
I have a silky smooth voice, and today I will tell you about
the exercise regimen of the common sloth.
WOMAN: I would like an oatmilk latte please.
MAN: Wow, that's expensive!
"""
audio_array = generate_audio(text_prompt, history_prompt="en_speaker_1")
audio_array = generate_audio(text_prompt)
```

[sloth.webm](https://user-images.githubusercontent.com/5068315/230684883-a344c619-a560-4ff5-8b99-b4463a34487b.webm)
[latte.webm](https://user-images.githubusercontent.com/5068315/230684864-12d101a1-a726-471d-9d56-d18b108efcb8.webm)


## πŸ’» Installation

Expand Down Expand Up @@ -129,24 +131,23 @@ Below is a list of some known non-speech sounds, but we are finding more every d
- capitalization for emphasis of a word
- `MAN/WOMAN:` for bias towards speaker


**Supported Languages**

| Language | Status |
| --- | --- |
| Chinese (Mandarin) | βœ… |
| English | βœ… |
| French | βœ… |
| German | βœ… |
| Hindi | βœ… |
| Italian | βœ… |
| Japanese | βœ… |
| Korean | βœ… |
| Polish | βœ… |
| Portuguese | βœ… |
| Russian | βœ… |
| Spanish | βœ… |
| Turkish | βœ… |
| English (en) | βœ… |
| German (de) | βœ… |
| Spanish (es) | βœ… |
| French (fr) | βœ… |
| Hindi (hi) | βœ… |
| Italian (it) | βœ… |
| Japanese (ja) | βœ… |
| Korean (ko) | βœ… |
| Polish (pl) | βœ… |
| Portuguese (pt) | βœ… |
| Russian (ru) | βœ… |
| Turkish (tr) | βœ… |
| Chinese, simplified (zh) | βœ… |
| Arabic | Coming soon! |
| Bengali | Coming soon! |
| Telugu | Coming soon! |
Expand Down

0 comments on commit 5dc6a4d

Please sign in to comment.