Skip to content
View hawktang's full-sized avatar

Block or report hawktang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,000 116 Updated Dec 24, 2024

Native Jellyfin Client for iOS and tvOS

Swift 2,673 291 Updated Dec 31, 2024

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 2,076 242 Updated Dec 26, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,966 639 Updated Jan 1, 2025

Efficiently find the best-suited language model (LM) for your NLP task

Python 108 10 Updated Dec 31, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 88,498 8,444 Updated Jan 1, 2025

DSPy: The framework for programming—not prompting—language models

Python 20,648 1,557 Updated Jan 1, 2025

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 235 62 Updated Dec 20, 2024

Inference Microsoft Florence2 VLM

Python 855 59 Updated Nov 27, 2024

👾🍎 Apple MLX engine for LM Studio

Python 295 27 Updated Dec 26, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,318 1,773 Updated Oct 15, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,318 5,509 Updated Dec 18, 2024

SOTA Open Source TTS

Python 17,967 1,347 Updated Dec 29, 2024

🔥 🔥 🔥 Open Source Airtable Alternative

TypeScript 50,616 3,475 Updated Jan 1, 2025

PlayStation 4 emulator for Windows, Linux and macOS written in C++

C++ 12,608 830 Updated Jan 1, 2025

[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo

Python 682 37 Updated Oct 4, 2024

Generate accurate transcripts using Apple's MLX framework

Python 346 32 Updated Dec 10, 2024

A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and…

Python 183 14 Updated Nov 3, 2024

Convert PDF to markdown + JSON quickly with high accuracy

Python 18,932 1,108 Updated Jan 2, 2025

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Python 1,558 106 Updated Nov 28, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 7,981 605 Updated Dec 27, 2024

Get your documents ready for gen AI

Python 17,093 893 Updated Dec 19, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,922 273 Updated Oct 5, 2024

1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3,…

Shell 6,020 1,131 Updated Jan 2, 2025

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 6,682 490 Updated Dec 31, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 13,984 1,467 Updated Nov 20, 2024

Build document-native LLM applications

Python 51 2 Updated Sep 11, 2024

Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Python 243 30 Updated Dec 28, 2024

Open-source AI cookbook

Jupyter Notebook 1,745 259 Updated Dec 31, 2024
Next