- San Diego
-
00:29
(UTC -08:00) - https://www.yi-zeng.com/
- @EasonZeng623
Highlights
- Pro
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
Making large AI models cheaper, faster and more accessible
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
A generative speech model for daily dialogue.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Adding guardrails to large language models.
Release for Improved Denoising Diffusion Probabilistic Models
Code for visualizing the loss landscape of neural nets
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHX…
Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.
python library for invisible image watermark (blind image watermark)
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. …
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
AutoPrompt: Automatic Prompt Construction for Masked Language Models.