This solution demonstrates the application of Microsoft AI and GPT to help solve challenges common to contact center scenarios
Author: Jixin Jia (Gin)
Created: 2023/03/01
Version: 2.0
Transcribe
conversations in real-time- Simultaneously
translate
into multiple languages - Identify speaker by
diarizing
the conversation - Analyze speech
sentiment
in real-time Summarize
conversation and get a list of key topics discussed usingAzure OpenAI
-
This WSGI demo app runs on single instance of Gunicorn for production workload. It is recommended to build it on a docker orchestration service if you target a more scalable workload.
-
All transaction runs within browser (client end). There are no data transmitted to or stored at the backend server in this demo.
-
To run the demo following Azure services must be provisioned and provided at run time:
Azure resource | Purpose | Required information |
---|---|---|
Azure AI Service | For performing live speech-to-text, speaker diarization, translation and sentiment analysis | Azure AI key , Azure AI endpoint , Azure AI region , Speech language , Translate langage |
Azure OpenAI Service | For generating a conversation summary, identify key entities mentioned and categorize conversation into a main topic | Azure OpenAI key , Azure OpenAI endpoint , Deployed chatgpt model name |
Install dependencies. This solution has been tested on Python 3.7~3.10
pip install -r requirements.txt
For Unix:
./run.sh
For Windows:
waitress-serve --listen=0.0.0.0:80 wsgi:app
Type following in address bar:
localhost:80
Enjoy the demo !
Special thanks to Nobu Tanahashi for his ingenious Conversation Transcription source code and idea used in this demo.