Skip to content
View cwt000297's full-sized avatar

Block or report cwt000297

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 126,194 10,229 Updated Feb 15, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Ultralytics / veRL …

Python 784 60 Updated Feb 13, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,507 4,972 Updated Feb 14, 2025

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 10,583 674 Updated Feb 15, 2025

Fully open reproduction of DeepSeek-R1

Python 19,896 1,706 Updated Feb 14, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,296 1,733 Updated Feb 14, 2025

Train transformer language models with reinforcement learning.

Python 11,621 1,566 Updated Feb 14, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 139,370 27,938 Updated Feb 14, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 9,637 915 Updated Feb 15, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,446 472 Updated Feb 15, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 37,883 5,692 Updated Feb 15, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 7,275 514 Updated Feb 10, 2025

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 37,210 3,839 Updated Jan 2, 2025

Collection of AWESOME vision-language models for vision tasks

2,490 196 Updated Dec 3, 2024

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Python 1,057 142 Updated Aug 4, 2024

Interactive Video Generation via Masked-Diffusion

Python 74 7 Updated Apr 15, 2024

[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis

Python 153 3 Updated Apr 22, 2023

[CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

Python 36 2 Updated Mar 5, 2024

[SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

Python 96 11 Updated May 31, 2024

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Python 467 19 Updated Nov 16, 2024

An innovative method designed to augment the capabilities of existing video diffusion models

Python 22 Updated May 10, 2024

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,291 388 Updated Oct 25, 2023

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Python 870 55 Updated Aug 21, 2024

A paper list of some recent Transformer-based CV works.

1,185 140 Updated Feb 14, 2025

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Python 187 8 Updated Mar 29, 2024

Mamba SSM architecture

Python 13,979 1,209 Updated Jan 18, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,196 5,504 Updated Feb 14, 2025

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,322 130 Updated Aug 1, 2024
Next