AllenMao

Follow

MLCV AllenMao

Follow

Medical image analysis; Deep learning

12 followers · 21 following

Chongqing

Achievements

Achievements

Stars

bubbliiiing / stable-diffusion

这是一个stable-diffusion的库。

Python 119 17 Updated Aug 12, 2023

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,949 530 Updated Dec 25, 2024

InternLM / InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Python 340 57 Updated Jan 22, 2025

NanmiCoder / MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫 | 知乎问答文章｜评论爬虫

Python 19,599 5,881 Updated Jan 20, 2025

PeterH0323 / Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 2,830 435 Updated Nov 11, 2024

salesforce / UniControl

Unified Controllable Visual Generation Model

Python 632 35 Updated Jan 27, 2025

tripplyons / oft

Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning

Python 35 2 Updated Sep 24, 2023

liming-ai / unicontrolnet

Python 1 Updated Feb 21, 2024

luban-agi / Awesome-Domain-LLM

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,360 187 Updated Dec 26, 2023

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,783 488 Updated Feb 7, 2025

360CVGroup / FancyVideo

This is the official reproduction of FancyVideo.

Python 687 72 Updated Oct 30, 2024

AllenMao / MindSearch

Forked from InternLM/MindSearch

🔍 a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT

Python 1 Updated Jul 31, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,655 4,865 Updated Feb 6, 2025

jianchang512 / ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 6,601 791 Updated Dec 9, 2024

SuperSupermoon / MedViLL

MedViLL official code. (Published IEEE JBHI 2021)

Python 92 12 Updated Dec 26, 2024

salesforce / ALPRO

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Python 187 17 Updated Sep 20, 2022

yl-1993 / learn-to-cluster

Learning to Cluster Faces (CVPR 2019, CVPR 2020)

Python 711 143 Updated Dec 27, 2021

mx-mark / VideoTransformer-pytorch

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

Python 288 38 Updated May 4, 2022

RLHF-V / RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 260 7 Updated Sep 11, 2024

lzhnb / GS-IR

Python 356 17 Updated Jul 16, 2024

koide3 / interactive_slam

Interactive Map Correction for 3D Graph SLAM

C++ 874 258 Updated Aug 4, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

13,759 889 Updated Jan 28, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 19,889 1,650 Updated Feb 6, 2025

espressif / esp-who

Face detection and recognition framework

C 1,760 472 Updated Jan 14, 2025

espressif / esp-dl

Espressif deep-learning library for AIoT applications

Assembly 620 131 Updated Jan 26, 2025

ivana-13 / guided_masking

A code for paper Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking

Jupyter Notebook 4 Updated Jan 9, 2024

pierotofy / OpenSplat

Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀

C++ 1,071 100 Updated Feb 7, 2025

levihsu / OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 6,022 859 Updated May 13, 2024

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,129 74 Updated Apr 15, 2024

vacancy / SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Python 562 55 Updated Jan 23, 2024