Skip to content
View shine-xia's full-sized avatar
  • Juphoon System Software Co., Ltd.
  • China

Block or report shine-xia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 1,806 195 Updated Mar 8, 2025

A curated list of awesome voice conversion, projects and communities.

223 13 Updated Jan 13, 2025

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Python 8,093 1,409 Updated Feb 1, 2025

Remote heart rate detection through Eulerian magnification of face videos

Python 327 55 Updated Dec 6, 2022

Desktop implementation of Remote Photoplethysmography – Measuring heart rate using facial video.

C++ 563 144 Updated Mar 4, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 753 58 Updated Dec 23, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,355 177 Updated Feb 14, 2025
Python 40 8 Updated Aug 4, 2024

Like cURL, but for gRPC: Command-line tool for interacting with gRPC servers

Go 11,330 522 Updated Feb 19, 2025

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 5,147 581 Updated Mar 7, 2025

Port of Funasr's Sense-voice model in C/C++

C 274 28 Updated Mar 4, 2025

利用语言模型,纠正OCR识别错误

Python 458 101 Updated May 22, 2023

papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…

4,593 968 Updated Feb 9, 2023

Paper collection of about the face anti-spoofing

365 57 Updated Aug 12, 2022

Clone of the mercurial repository http://zbar.hg.sourceforge.net:8000/hgroot/zbar/zbar

C 2,516 1,067 Updated Mar 18, 2024
Jupyter Notebook 94 3 Updated Mar 14, 2024

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,295 190 Updated Mar 3, 2025
Python 244 21 Updated Nov 28, 2024

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Jupyter Notebook 253 14 Updated Aug 12, 2024

DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer (CVPR 2024)

Python 45 4 Updated Sep 2, 2024

A comprehensive collection of IQA papers

TeX 1,124 73 Updated Jan 5, 2025

寧波話吳語拼音輸入方案 · 宁波话吴语拼音输入方案 · A Rime input schema for Ningbo Dialect

42 4 Updated Feb 27, 2025

甬江話字詞表・A dictionary for Ningbo Dialect

Python 30 Updated Feb 27, 2025

Multilingual Voice Understanding Model

Python 4,793 432 Updated Jan 8, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,609 1,154 Updated Mar 7, 2025

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,317 871 Updated Dec 10, 2024

UT-Sarulab MOS prediction system using SSL models

Python 216 14 Updated Apr 11, 2024

Application of MB-iSTFT-VITS components to vits2_pytorch

Python 123 28 Updated Nov 19, 2024

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Python 437 66 Updated Nov 17, 2022
Next