Skip to content
View AllenMao's full-sized avatar
  • Chongqing

Block or report AllenMao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

这是一个stable-diffusion的库。

Python 119 17 Updated Aug 12, 2023

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,949 530 Updated Dec 25, 2024

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Python 340 57 Updated Jan 22, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 19,599 5,881 Updated Jan 20, 2025

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 2,830 435 Updated Nov 11, 2024

Unified Controllable Visual Generation Model

Python 632 35 Updated Jan 27, 2025

Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning

Python 35 2 Updated Sep 24, 2023
Python 1 Updated Feb 21, 2024

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,360 187 Updated Dec 26, 2023

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,783 488 Updated Feb 7, 2025

This is the official reproduction of FancyVideo.

Python 687 72 Updated Oct 30, 2024

🔍 a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT

Python 1 Updated Jul 31, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,655 4,865 Updated Feb 6, 2025

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 6,601 791 Updated Dec 9, 2024

MedViLL official code. (Published IEEE JBHI 2021)

Python 92 12 Updated Dec 26, 2024

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Python 187 17 Updated Sep 20, 2022

Learning to Cluster Faces (CVPR 2019, CVPR 2020)

Python 711 143 Updated Dec 27, 2021

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

Python 288 38 Updated May 4, 2022

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 260 7 Updated Sep 11, 2024
Python 356 17 Updated Jul 16, 2024

Interactive Map Correction for 3D Graph SLAM

C++ 874 258 Updated Aug 4, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,759 889 Updated Jan 28, 2025

Universal LLM Deployment Engine with ML Compilation

Python 19,889 1,650 Updated Feb 6, 2025

Face detection and recognition framework

C 1,760 472 Updated Jan 14, 2025

Espressif deep-learning library for AIoT applications

Assembly 620 131 Updated Jan 26, 2025

A code for paper Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking

Jupyter Notebook 4 Updated Jan 9, 2024

Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀

C++ 1,071 100 Updated Feb 7, 2025

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 6,022 859 Updated May 13, 2024

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,129 74 Updated Apr 15, 2024

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Python 562 55 Updated Jan 23, 2024
Next