Translate to: 简体中文
Batch convert video to text using openai's whisper or the local coreML whisper.cpp
The "tiktok-whisper" tool can batch convert video to text using OpenAI's Whisper or the local coreML Whisper.cpp. It has features like exporting copy as Excel, saving conversion results to SQLite or PostgreSQL, video duration statistics, and keyword search to locate videos. The tool also provides options to use whisper_cpp + coreML for local transcription and pgvector for vectorized search(yet to be implemented).
- Batch convert videos to text
- Save conversion results to SQLite or PostgreSQL
- Video duration statistics
- Export copy as Excel
- Use whisper_cpp + coreML for local transcription
- batch download xiaoyuzhou podcasts with a simple url
cd ./internal/app
go install github.com/google/wire
# modify binaryPath and modelPath manually
wire
cd tiktok-whisper
go build -o v2t ./cmd/v2t/main.go
./v2t help
windows
cd tiktok-whisper
go build -o v2t.exe .\cmd\v2t\main.go
.\v2t.exe help
# Convert only one file
./v2t convert -audio --input ./test/data/test.mp3
# Convert all files in directory with specified file extension
./v2t convert -audio --directory ./test/data --type m4a
# Convert all mp4 files in the specified directory to text
./v2t convert --video --directory "./test/data/mp4" --userNickname "testUser"
# download xiaoyuzhou with single episode url
./v2t download xiaoyuzhou -e "https://www.xiaoyuzhoufm.com/episode/6398c6ae3a2b7eba5ceb462f"
# or an episode list
./v2t download xiaoyuzhou -e "https://www.xiaoyuzhoufm.com/episode/6398c6ae3a2b7eba5ceb462f,https://www.xiaoyuzhoufm.com/episode/6445559d420fc63f0b9e5747"
# download all episodes from xiaoyuzhou with podcast url
./v2t download xiaoyuzhou -p "https://www.xiaoyuzhoufm.com/podcast/61e389402454b42a2b06177c"
- Keyword search to locate videos
- Original video jump link
- Like, share, and comment statistics
- Use pgvector for vectorized search