Skip to content
View yr-zh's full-sized avatar
  • SiChuan University
  • Chengdu

Block or report yr-zh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cross-modal few-shot adaptation with CLIP

Python 321 37 Updated Mar 13, 2024

Fast Neural Machine Translation in C++

C++ 1,275 235 Updated Aug 25, 2023

Stable Diffusion and Flux in pure C/C++

C++ 9 1 Updated Jan 22, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 63,978 9,649 Updated Jan 17, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,266 3,222 Updated Sep 24, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,897 148 Updated Jan 20, 2025
Python 82 5 Updated Jan 4, 2024

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,947 1,903 Updated Sep 26, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,530 128 Updated Jan 17, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 447 37 Updated Jan 3, 2025

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,224 70 Updated Nov 27, 2024
Jupyter Notebook 943 166 Updated Mar 7, 2022

DocBank: A Benchmark Dataset for Document Layout Analysis

Python 592 72 Updated Aug 12, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,459 429 Updated Jan 3, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 781 58 Updated Jan 16, 2025

Stable Diffusion and Flux in pure C/C++

C++ 3,706 329 Updated Jan 18, 2025

Command Line Interface for Managing ComfyUI

Python 337 52 Updated Jan 19, 2025

GGUF Quantization support for native ComfyUI models

Python 1,346 84 Updated Jan 8, 2025

A Python frontend and library for ComfyUI

Python 458 29 Updated Jan 15, 2025

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 1,426 143 Updated Jan 14, 2025

Fast stable diffusion on CPU

Python 1,554 128 Updated Nov 23, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,937 183 Updated Oct 31, 2024

A pytorch quantization backend for optimum

Python 868 68 Updated Jan 10, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,895 234 Updated Jan 20, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,566 4,741 Updated Jan 21, 2025

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,277 263 Updated Jan 21, 2025

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,803 2,242 Updated Jan 8, 2025

[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Python 1,168 55 Updated Apr 7, 2023

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,846 524 Updated Dec 25, 2024
Next