Skip to content
View aww1q's full-sized avatar

Block or report aww1q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
303 results for source starred repositories
Clear filter

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,644 380 Updated Jun 20, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,002 1,114 Updated Feb 28, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,777 170 Updated Jan 22, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 294 12 Updated Feb 28, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,582 765 Updated Mar 7, 2025
Python 3,855 307 Updated Mar 6, 2025

HumanOmni

Python 75 4 Updated Mar 3, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,650 1,157 Updated Mar 7, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,732 222 Updated Dec 5, 2024

Large Language Model Text Generation Inference

Python 9,859 1,160 Updated Mar 7, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,791 6,137 Updated Mar 9, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,194 553 Updated Feb 26, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,788 503 Updated Mar 7, 2025

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 440 18 Updated Apr 8, 2024

Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,301 103 Updated Aug 19, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,511 127 Updated Jul 19, 2024

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,158 177 Updated Nov 26, 2024

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 3,821 231 Updated Dec 7, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,504 770 Updated Mar 7, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,342 491 Updated Feb 12, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,565 124 Updated Aug 13, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,622 116 Updated Jul 5, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,208 2,315 Updated Mar 7, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,305 122 Updated Apr 24, 2024

A generative speech model for daily dialogue.

Python 34,957 3,773 Updated Feb 18, 2025

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,307 146 Updated Oct 5, 2023

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,794 481 Updated Feb 7, 2025
Next