Skip to content
View airainday's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report airainday

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,612 259 Updated Dec 21, 2024

Generative Models by Stability AI

Python 24,909 2,764 Updated Sep 4, 2024

图像配准算法。包括 SIFT、ORB、SURF、AKAZE、BRIEF、matchTemplate

Python 98 17 Updated Jul 31, 2022

一键自动化 下载、安装、激活 Office 的利器。

C# 8,921 829 Updated Feb 22, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,578 316 Updated May 21, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,707 232 Updated Dec 4, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,796 2,293 Updated Aug 12, 2024

This is the pytorch implement of our paper "RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model"

Python 112 2 Updated Nov 19, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,225 1,270 Updated Sep 5, 2024

An open source implementation of CLIP.

Python 10,627 1,003 Updated Dec 4, 2024

Research Code for Multimodal-Cognition Team in Ant Group

Python 127 5 Updated Jul 11, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,469 497 Updated Dec 21, 2024

Dedicated theme for Hexo

CSS 168 25 Updated Nov 9, 2022

支持大麦网,淘票票、缤玩岛等多个平台,演唱会演出抢票脚本

HTML 1,222 181 Updated Dec 14, 2024

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 4,745 582 Updated Dec 17, 2024

百亿参数的中英文双语基座大模型

Python 2,701 215 Updated Jul 28, 2023

PyTorch package for the discrete VAE used for DALL·E.

Python 10,809 1,940 Updated Jan 31, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 65,968 8,021 Updated Dec 20, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,449 2,572 Updated Dec 15, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,606 3,377 Updated Jul 23, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,695 479 Updated Aug 6, 2024

整体的介绍 FastAPI,快速上手开发,结合 API 交互文档逐个讲解核心模块的使用。视频学习地址:

JavaScript 1,155 334 Updated Aug 9, 2023

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,556 237 Updated Aug 1, 2024

Official implementation of AnimateDiff.

Python 10,754 877 Updated Jul 31, 2024

A Vue 3 Component Library. Fairly Complete. Theme Customizable. Uses TypeScript. Fast.

TypeScript 16,383 1,689 Updated Dec 20, 2024

⭐️ 基于 FastAPI+Vue3+Naive UI 的现代化轻量管理平台 A modern and lightweight management platform based on FastAPI, Vue3, and Naive UI.

Vue 842 164 Updated Sep 25, 2024

程序员相关电子书资料免费分享,欢迎关注个人微信公众号:编程与实战

4,736 1,200 Updated Apr 4, 2024

C++那些事

C++ 39,793 8,587 Updated Jun 14, 2024

End-to-End Object Detection with Transformers

Python 13,772 2,484 Updated Mar 12, 2024

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 2,059 213 Updated Aug 15, 2024
Next