Skip to content
View aww1q's full-sized avatar

Block or report aww1q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,928 1,108 Updated Feb 28, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,776 169 Updated Jan 22, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 293 11 Updated Feb 28, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,320 720 Updated Mar 6, 2025
Python 3,815 302 Updated Mar 6, 2025

HumanOmni

Python 75 4 Updated Mar 3, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,556 1,148 Updated Mar 7, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,723 222 Updated Dec 5, 2024

Large Language Model Text Generation Inference

Python 9,857 1,159 Updated Mar 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,516 6,097 Updated Mar 7, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,181 553 Updated Feb 26, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,778 503 Updated Mar 6, 2025

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 440 18 Updated Apr 8, 2024

Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,300 103 Updated Aug 19, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,508 127 Updated Jul 19, 2024

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,157 177 Updated Nov 26, 2024

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 3,821 231 Updated Dec 7, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,492 767 Updated Mar 5, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,340 491 Updated Feb 12, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,560 124 Updated Aug 13, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,621 116 Updated Jul 5, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,159 2,306 Updated Mar 6, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,302 122 Updated Apr 24, 2024

A generative speech model for daily dialogue.

Python 34,915 3,770 Updated Feb 18, 2025

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,307 145 Updated Oct 5, 2023

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,792 481 Updated Feb 7, 2025

An open source implementation of CLIP.

Python 11,154 1,054 Updated Mar 1, 2025
Next