Skip to content
View ZHAIXINGZHAIYUE's full-sized avatar
😀
😀

Block or report ZHAIXINGZHAIYUE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR 2023)

Python 1,054 89 Updated Nov 26, 2024

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 377 31 Updated Dec 17, 2024

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 4,467 508 Updated Dec 17, 2024

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,976 115 Updated Sep 28, 2024

Fast parallel CTC.

Cuda 4,070 1,041 Updated Mar 4, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,347 207 Updated Dec 16, 2024

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Python 930 77 Updated Aug 5, 2024

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker

Python 10,147 708 Updated Dec 21, 2024

DIGImend graphics tablet drivers for the Linux kernel

C 1,179 175 Updated Jul 26, 2024

The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”.

Python 175 20 Updated Jul 21, 2024

Explainability for Vision Transformers

Python 875 101 Updated Mar 12, 2022

A paper list of some recent Transformer-based CV works.

1,158 137 Updated Dec 21, 2024

Xplorer, a customizable, modern file manager

TypeScript 4,930 341 Updated Apr 14, 2024

🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️

Shell 22,217 3,383 Updated Nov 30, 2024

The book every data scientist needs on their desk.

Jupyter Notebook 821 70 Updated Dec 13, 2024

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for…

Python 466 20 Updated Oct 29, 2024

(TPAMI 2024) A Survey on Open Vocabulary Learning

870 51 Updated Dec 10, 2024

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,084 2,930 Updated Dec 19, 2024

With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)

Go 608 30 Updated Oct 16, 2024

A Multimodal Language Agent Framework for Smart Devices and More

Python 1,382 117 Updated Dec 20, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 21,688 1,553 Updated Dec 20, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 27,261 3,032 Updated Dec 22, 2024

Collection of AWESOME vision-language models for vision tasks

2,638 227 Updated Dec 3, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,368 558 Updated Dec 18, 2024

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

TypeScript 42,658 3,301 Updated Dec 15, 2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1,826 180 Updated Nov 27, 2024

GUI for marking bounded boxes of objects in images for training neural network YOLO

C++ 517 116 Updated May 12, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

16,901 1,596 Updated Sep 19, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 29,858 9,499 Updated Aug 21, 2024

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

Python 3,105 314 Updated Dec 19, 2024
Next