yr-zh

Follow

yr-zh

Follow

3 followers · 11 following

SiChuan University
Chengdu

Achievements

Achievements

Stars

linzhiqiu / cross_modal_adaptation

Cross-modal few-shot adaptation with CLIP

Python 321 37 Updated Mar 13, 2024

marian-nmt / marian

Fast Neural Machine Translation in C++

C++ 1,275 235 Updated Aug 25, 2023

stduhpf / stable-diffusion.cpp

Forked from leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

C++ 9 1 Updated Jan 22, 2025

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 63,978 9,649 Updated Jan 17, 2025

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,266 3,222 Updated Sep 24, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,897 148 Updated Jan 20, 2025

huggingface / amused

Python 82 5 Updated Jan 4, 2024

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,947 1,903 Updated Sep 26, 2024

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,530 128 Updated Jan 17, 2025

Topdu / OpenOCR

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 447 37 Updated Jan 3, 2025

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,224 70 Updated Nov 27, 2024

ibm-aur-nlp / PubLayNet

Jupyter Notebook 943 166 Updated Mar 7, 2022

doc-analysis / DocBank

DocBank: A Benchmark Dataset for Document Layout Analysis

Python 592 72 Updated Aug 12, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,459 429 Updated Jan 3, 2025

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 781 58 Updated Jan 16, 2025

leejet / stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

C++ 3,706 329 Updated Jan 18, 2025

comfyanonymous / ComfyUI_bitsandbytes_NF4

Python 363 30 Updated Aug 16, 2024

Comfy-Org / comfy-cli

Command Line Interface for Managing ComfyUI

Python 337 52 Updated Jan 19, 2025

city96 / ComfyUI-GGUF

GGUF Quantization support for native ComfyUI models

Python 1,346 84 Updated Jan 8, 2025

Chaoses-Ib / ComfyScript

A Python frontend and library for ComfyUI

Python 458 29 Updated Jan 15, 2025

pydn / ComfyUI-to-Python-Extension

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 1,426 143 Updated Jan 14, 2025

rupeshs / fastsdcpu

Fast stable diffusion on CPU

Python 1,554 128 Updated Nov 23, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,937 183 Updated Oct 31, 2024

huggingface / optimum-quanto

A pytorch quantization backend for optimum

Python 868 68 Updated Jan 10, 2025

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,895 234 Updated Jan 20, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,566 4,741 Updated Jan 21, 2025

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,277 263 Updated Jan 21, 2025

OpenNMT / OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,803 2,242 Updated Jan 8, 2025

autonomousvision / stylegan-t

[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Python 1,168 55 Updated Apr 7, 2023

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,846 524 Updated Dec 25, 2024