audio2shapes API

从语音预测每一帧的 52 个 blendshape 值，输入语音，返回以 json 格式存储的 blendshape。我们为了先看模型训练的效果也为了简化训练的过程，目前仅预测了jawopen的值。

调用方式

POST

运行

实验环境

conda create -n flask python=3.9
conda activate flask
pip install -r requirements.txt

预训练模型

下载transformer_model ，并放在 utils/ 目录下。

服务端

启动服务端

python app.py

客户端

先在 config.json 修改服务端返回的公网地址，如下图中的 http://172.31.70.115:8888。务必在 config.json 中配置正确的语音文件路径，目前支持格式：wav。

然后启动客户端请求

python launch.py -c ./config.json

结果

结果会以json格式写在 res.json 文件中。

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
audios		audios
static/images		static/images
utils		utils
README.md		README.md
app.py		app.py
config.json		config.json
launch.py		launch.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

audio2shapes API

调用方式

运行

实验环境

预训练模型

服务端

客户端

结果

About

Releases

Packages

Languages

Symbolzzz/audio2shapes

Folders and files

Latest commit

History

Repository files navigation

audio2shapes API

调用方式

运行

实验环境

预训练模型

服务端

客户端

结果

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages