junedgar

Follow

junedgar

Follow

9 followers · 38 following

ustc
hangzhou

Achievements

Achievements

Starred repositories

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,831 159 Updated Dec 2, 2024

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,385 87 Updated May 31, 2023

ytongbai / LVM

Python 1,774 55 Updated Jun 28, 2024

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,744 484 Updated Nov 27, 2024

YifanXu74 / MQ-Det

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Python 273 13 Updated Feb 23, 2024

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

1,861 93 Updated Dec 9, 2024

yzhang2016 / video-generation-survey

A reading list of video generation

445 29 Updated Dec 16, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 7,954 586 Updated Dec 6, 2024

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 66,362 8,136 Updated Dec 9, 2024

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,461 143 Updated Nov 21, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,580 1,001 Updated Dec 4, 2024

rom1504 / clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,440 215 Updated Apr 15, 2024

LAION-AI / aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

Jupyter Notebook 491 20 Updated Aug 15, 2022

LAION-AI / LAION-5B-WatermarkDetection

Python 104 14 Updated Jan 10, 2023

positive666 / Prompt-Can-Anything

You can do anything by sota AI with prompt ,auto AI tools , VL larger model fine and project

Jupyter Notebook 184 12 Updated Sep 25, 2023

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,108 253 Updated Nov 26, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,670 5,489 Updated Dec 16, 2024

PeterL1n / RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 8,656 1,138 Updated Apr 2, 2024

OpenLMLab / OpenChineseLLaMA

Forked from meta-llama/llama

Chinese large language model base generated through incremental pre-training on Chinese datasets

Python 234 17 Updated May 30, 2023

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,424 778 Updated Jul 31, 2024

OpenGVLab / InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…

Python 3,210 232 Updated Aug 20, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,788 3,450 Updated May 18, 2024

HeliosZhao / Make-A-Protagonist

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Python 319 36 Updated Aug 1, 2023

THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,111 423 Updated Aug 23, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,513 1,874 Updated Apr 30, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 96,269 15,654 Updated Dec 16, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,490 2,923 Updated Sep 2, 2024

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 71,175 7,749 Updated Dec 16, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,715 2,279 Updated Aug 12, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 7,984 761 Updated Oct 16, 2024

Starred topics

fine-grained-classification

Awesome Lists

Algorithm

3D

portrait-matting

hand-detection