Starred repositories
Python - 100天从新手到大师
🦜🔗 Build context-aware reasoning applications
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🔊 Text-Prompted Generative Audio Model
12 Weeks, 24 Lessons, AI for All!
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A guidance language for controlling large language models.
Instruct-tune LLaMA on consumer hardware
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
StableLM: Stability AI Language Models
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
llama3 implementation one matrix multiplication at a time
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Examples and guides for using the Gemini API
Official inference library for Mistral models
This repository contains demos I made with the Transformers library by HuggingFace.
YSDA course in Natural Language Processing
Best Practices, code samples, and documentation for Computer Vision.
Anthropic's educational courses
A series of large language models trained from scratch by developers @01-ai
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
YOLOv6: a single-stage object detection framework dedicated to industrial applications.