Skip to content
View shigel's full-sized avatar

Block or report shigel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
110 results for source starred repositories
Clear filter

PDF to Markdown with vision models

Python 9,187 585 Updated Dec 18, 2024
Python 7,277 572 Updated Jan 24, 2025

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

Python 11,487 1,457 Updated Jan 27, 2025

Sky-T1: Train your own O1 preview model within $450

Python 2,188 231 Updated Jan 26, 2025

The official HelloMeme GitHub site

Python 549 38 Updated Jan 15, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,846 619 Updated Jan 24, 2025

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Python 9,004 1,335 Updated Jan 20, 2025

The Construction Site Snag Detector is a powerful tool built using Python, Gradio, and the Groq platform. It leverages AI to automatically detect defects, unfinished work, and quality issues in con…

Python 5 Updated Oct 21, 2024

このツールは、Googleスプレッドシートのデータを基にGoogleスライドを自動生成するPythonスクリプトです。スプレッドシートの各行からスライドを作成し、タイトル、サブタイトル、本文を適切なフォーマットで配置します。

Python 1 Updated Oct 27, 2024

Official inference framework for 1-bit LLMs

C++ 12,653 882 Updated Dec 20, 2024

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,449 501 Updated Jan 24, 2025

Diffusion model derived evolutionary algorithm

Python 187 15 Updated Jan 24, 2025

Python library that provides a unified interface for interacting with multiple Large Language Models (LLMs) from different providers.

Python 14 1 Updated Oct 13, 2024

A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.

Python 342 217 Updated Jan 22, 2025

React app for inspecting, building and debugging with the Realtime API

JavaScript 2,812 1,016 Updated Jan 2, 2025

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 348 19 Updated Oct 6, 2024

Optimizing inference proxy for LLMs

Python 1,953 155 Updated Jan 24, 2025

SOTA Open Source TTS

Python 18,661 1,413 Updated Jan 26, 2025

書籍「AIエディタCursor完全ガイド」 のサポートを行うリポジトリです。

155 8 Updated Dec 6, 2024

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 7,512 1,231 Updated Dec 20, 2024

A large-scale RWKV v6, v7 inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy on docker. Supports true multi-batch generation and dynamic State switching. CUDA …

Python 24 1 Updated Jan 21, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,691 395 Updated Dec 4, 2024

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,584 154 Updated Oct 29, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,821 1,281 Updated Jan 26, 2025
Python 1,845 129 Updated Nov 8, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,795 1,377 Updated Dec 25, 2024
Next