aww1q

aww1q

4 followers · 22 following

Stars

303 results for source starred repositories

Clear filter

cvg / LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,644 380 Updated Jun 20, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,002 1,114 Updated Feb 28, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,777 170 Updated Jan 22, 2025

Ola-Omni / Ola

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 294 12 Updated Feb 28, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,582 765 Updated Mar 7, 2025

stepfun-ai / Step-Audio

Python 3,855 307 Updated Mar 6, 2025

stepfun-ai / Step-Video-T2V

Python 2,597 221 Updated Feb 27, 2025

HumanMLLM / HumanOmni

HumanOmni

Python 75 4 Updated Mar 3, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,650 1,157 Updated Mar 7, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,732 222 Updated Dec 5, 2024

MoonshotAI / Kimi-k1.5

3,182 189 Updated Mar 7, 2025

deepseek-ai / DeepSeek-V3

Python 91,437 14,777 Updated Feb 24, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,859 1,160 Updated Mar 7, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,791 6,137 Updated Mar 9, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,194 553 Updated Feb 26, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,788 503 Updated Mar 7, 2025

UX-Decoder / DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 440 18 Updated Apr 8, 2024

microsoft / SoM

Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,301 103 Updated Aug 19, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,511 127 Updated Jul 19, 2024

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,158 177 Updated Nov 26, 2024

SysCV / sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 3,821 231 Updated Dec 7, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,504 770 Updated Mar 7, 2025

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,342 491 Updated Feb 12, 2025

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,565 124 Updated Aug 13, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,622 116 Updated Jul 5, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,208 2,315 Updated Mar 7, 2025

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,305 122 Updated Apr 24, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 34,957 3,773 Updated Feb 18, 2025

microsoft / X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,307 146 Updated Oct 5, 2023

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,794 481 Updated Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly