-
omniparse Public
Forked from adithya-s-k/omniparseIngest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Python GNU General Public License v3.0 UpdatedJun 30, 2024 -
fms-fsdp Public
Forked from foundation-model-stack/fms-fsdp🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
Python Apache License 2.0 UpdatedJun 6, 2024 -
silero-vad Public
Forked from snakers4/silero-vadSilero VAD: pre-trained enterprise-grade Voice Activity Detector
Python MIT License UpdatedApr 22, 2024 -
StyleTTS2 Public
Forked from yl4579/StyleTTS2StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Python MIT License UpdatedApr 14, 2024 -
SALMONN Public
Forked from bytedance/SALMONNSALMONN: Speech Audio Language Music Open Neural Network
Python Apache License 2.0 UpdatedApr 11, 2024 -
Macaw-LLM Public
Forked from lyuchenyang/Macaw-LLMMacaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Python Apache License 2.0 UpdatedApr 3, 2024 -
speech-dataset-generator Public
Forked from davidmartinrius/speech-dataset-generator🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
Python MIT License UpdatedMar 25, 2024 -
mergekit-qwen2 Public
Forked from Aratako/mergekit-qwen2Tools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedMar 24, 2024 -
llama2.c Public
Forked from karpathy/llama2.cInference Llama 2 in one file of pure C
C MIT License UpdatedMar 20, 2024 -
AutoAWQ Public
Forked from casper-hansen/AutoAWQAutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Python MIT License UpdatedMar 18, 2024 -
TencentPretrain Public
Forked from Tencent/TencentPretrainTencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Python Other UpdatedMar 13, 2024 -
lightllm Public
Forked from ModelTC/lightllmLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python Apache License 2.0 UpdatedMar 12, 2024 -
kotoba-recipes Public
Forked from kotoba-tech/kotoba-recipesSupport Continual pre-training & Instruction Tuning forked from llama-recipes
Python UpdatedFeb 17, 2024 -
Chinese-LLaMA-Alpaca Public
Forked from ymcui/Chinese-LLaMA-Alpaca中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Python Apache License 2.0 UpdatedJan 29, 2024 -
S-LoRA Public
Forked from S-LoRA/S-LoRAS-LoRA: Serving Thousands of Concurrent LoRA Adapters
Python Apache License 2.0 UpdatedJan 21, 2024 -
yarn Public
Forked from jquesnelle/yarnYaRN: Efficient Context Window Extension of Large Language Models
Python MIT License UpdatedJan 18, 2024 -
awesome-japanese-llm Public
Forked from llm-jp/awesome-japanese-llm日本語LLMまとめ - Overview of Japanese LLMs
Apache License 2.0 UpdatedJan 13, 2024 -
VITS-fast-fine-tuning Public
Forked from Plachtaa/VITS-fast-fine-tuningThis repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Python Apache License 2.0 UpdatedJan 5, 2024 -
dsir Public
Forked from p-lambda/dsirDSIR large-scale data selection framework for language model training
Python MIT License UpdatedJan 2, 2024 -
-
GPU-Benchmarks-on-LLM-Inference Public
Forked from XiongjieDai/GPU-Benchmarks-on-LLM-InferenceMultiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Jupyter Notebook UpdatedDec 18, 2023 -
NLP-Tutorials-with-HuggingFace Public
Forked from laxmimerit/NLP-Tutorials-with-HuggingFaceLearn NLP Tutorials with HuggingFace Transformers
Jupyter Notebook UpdatedDec 10, 2023 -
vits Public
Forked from jaywalnut310/vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Python MIT License UpdatedDec 6, 2023 -
CodeGen Public
Forked from salesforce/CodeGenCodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Python Apache License 2.0 UpdatedNov 21, 2023 -
so-vits-svc Public
Forked from svc-develop-team/so-vits-svcSoftVC VITS Singing Voice Conversion
Python GNU Affero General Public License v3.0 UpdatedNov 11, 2023 -
amadeus Public
Forked from e-p-armstrong/amadeusCreate RP training data from a VN, using GPT-4
Jupyter Notebook MIT License UpdatedNov 2, 2023 -
UltraChat Public
Forked from thunlp/UltraChatLarge-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Python MIT License UpdatedOct 27, 2023 -
baize-chatbot Public
Forked from project-baize/baize-chatbotLet ChatGPT teach your own chatbot in hours with a single GPU!
Python GNU General Public License v3.0 UpdatedJul 15, 2023 -
Text-to-CQL Public
Forked from Guoaibo/Text-to-CQLWe propose the Text-to-CQL task and provide the dataset.
UpdatedJun 26, 2023 -
alpaca_ja Public
Forked from shi3z/alpaca_jaalpacaデータセットを日本語化したものです
Apache License 2.0 UpdatedJun 3, 2023