Skip to content
View ChenJian7578's full-sized avatar

Block or report ChenJian7578

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,115 241 Updated Mar 10, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,534 605 Updated Mar 7, 2025

Mixture-of-Experts for Large Vision-Language Models

Python 2,109 133 Updated Dec 3, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,465 1,658 Updated Feb 26, 2025

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。

Python 274 40 Updated Feb 18, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 41,711 5,662 Updated Mar 9, 2025

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Python 1 Updated Jun 6, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,429 4,864 Updated Feb 23, 2025

cgan(条件对抗生成网络)

Python 2 Updated Aug 30, 2022

这是一个yolo3-pytorch的源码,可以用于训练自己的模型。

Python 2,039 583 Updated Jan 26, 2024

功能: 使用阿里云智能语音服务中的录音文件识别 API,实现将视频、音频文件转写出 srt 字幕

Python 122 30 Updated Feb 2, 2022