aww1q

aww1q

4 followers · 22 following

Stars

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,928 1,108 Updated Feb 28, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,776 169 Updated Jan 22, 2025

Ola-Omni / Ola

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 293 11 Updated Feb 28, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,320 720 Updated Mar 6, 2025

stepfun-ai / Step-Audio

Python 3,815 302 Updated Mar 6, 2025

stepfun-ai / Step-Video-T2V

Python 2,582 219 Updated Feb 27, 2025

HumanMLLM / HumanOmni

HumanOmni

Python 75 4 Updated Mar 3, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,556 1,148 Updated Mar 7, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,723 222 Updated Dec 5, 2024

MoonshotAI / Kimi-k1.5

3,171 187 Updated Mar 2, 2025

deepseek-ai / DeepSeek-V3

Python 91,211 14,742 Updated Feb 24, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,857 1,159 Updated Mar 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,516 6,097 Updated Mar 7, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,181 553 Updated Feb 26, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,778 503 Updated Mar 6, 2025

UX-Decoder / DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 440 18 Updated Apr 8, 2024

microsoft / SoM

Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,300 103 Updated Aug 19, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,508 127 Updated Jul 19, 2024

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,157 177 Updated Nov 26, 2024

SysCV / sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 3,821 231 Updated Dec 7, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,492 767 Updated Mar 5, 2025

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,340 491 Updated Feb 12, 2025

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,560 124 Updated Aug 13, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,621 116 Updated Jul 5, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,159 2,306 Updated Mar 6, 2025

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,302 122 Updated Apr 24, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 34,915 3,770 Updated Feb 18, 2025

microsoft / X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,307 145 Updated Oct 5, 2023

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,792 481 Updated Feb 7, 2025

mlfoundations / open_clip

An open source implementation of CLIP.

Python 11,154 1,054 Updated Mar 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly