Skip to content
View GlassyWing's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • China

Block or report GlassyWing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of the proposed minGRU in Pytorch

Python 265 21 Updated Dec 18, 2024

「大模型」3小时从0训练27M参数的视觉多模态VLM,个人显卡即可推理训练!

Python 502 55 Updated Dec 13, 2024

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Python 3,276 411 Updated Dec 13, 2024

Official Implementation of LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 542 29 Updated Dec 26, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,899 429 Updated Jan 5, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,503 696 Updated Dec 24, 2024

real time face swap and one-click video deepfake with only a single image

Python 42,084 6,178 Updated Jan 5, 2025

The only guide you need to learn everything about GMM

Jupyter Notebook 105 15 Updated Dec 4, 2024

Unofficial implementation of PatchCore anomaly detection

Python 328 92 Updated Sep 28, 2022

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

C 142 17 Updated Dec 27, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,670 474 Updated Dec 31, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,481 458 Updated Sep 9, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,187 147 Updated Sep 3, 2024

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Python 4,172 370 Updated Aug 1, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,644 520 Updated Dec 25, 2024

Code for paper "New Benchmarks for Barcode Detection using both Synthetic and Real Data" https://link.springer.com/chapter/10.1007%2F978-3-030-57058-3_34

Python 79 20 Updated Aug 20, 2022

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,005 116 Updated Dec 24, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,345 153 Updated Oct 21, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,916 182 Updated Oct 31, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,271 403 Updated Aug 7, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,098 1,451 Updated Aug 9, 2024

Annotated version of the Mamba paper

Jupyter Notebook 465 18 Updated Feb 27, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,752 183 Updated Sep 28, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 37,241 4,777 Updated Jan 5, 2025

tiny vision language model

Jupyter Notebook 6,247 510 Updated Jan 5, 2025

Structured state space sequence models

Jupyter Notebook 2,516 301 Updated Jul 17, 2024

Penpot: The open-source design tool for design and code collaboration

Clojure 34,515 1,759 Updated Jan 3, 2025

The OS for your personal finances

Ruby 34,759 2,488 Updated Jan 4, 2025

Instant voice cloning by MIT and MyShell.

Python 30,318 2,997 Updated Dec 24, 2024

Official repository of Agent Attention (ECCV2024)

Python 574 39 Updated Nov 17, 2024
Next