Skip to content
View ZYM-PKU's full-sized avatar
🏫
Studying in school
🏫
Studying in school

Block or report ZYM-PKU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 361 28 Updated Feb 3, 2025

【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending

9 1 Updated Feb 26, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,099 5,153 Updated Feb 25, 2025

(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.

Python 59 1 Updated Jun 11, 2024

Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Jupyter Notebook 38 2 Updated Feb 26, 2025

Ultralytics YOLO11 🚀

Python 37,111 7,194 Updated Feb 26, 2025

Code for BLT research paper

Python 1,413 107 Updated Feb 25, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 495 39 Updated Feb 23, 2025

clash-for-linux

Shell 2,068 704 Updated Dec 12, 2023

CLIPScore EMNLP code

Python 211 27 Updated Dec 16, 2022
Python 1,007 74 Updated Jan 8, 2025

Peak signal-to-noise ratio and The structural similarity calculation tool

Python 20 3 Updated Jun 26, 2020
Python 1,903 135 Updated Nov 8, 2024

Official inference repo for FLUX.1 models

Python 20,445 1,435 Updated Feb 6, 2025

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,231 59 Updated Oct 18, 2022

A general fine-tuning kit geared toward diffusion models.

Python 2,107 199 Updated Feb 22, 2025

Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)

Python 102 12 Updated Jun 28, 2024

Your image is almost there!

Python 7,504 427 Updated Jul 26, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,159 91 Updated Feb 16, 2025

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 411 20 Updated Jul 5, 2024
TeX 29 15 Updated Mar 11, 2022

This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context".

Python 85 8 Updated Feb 21, 2023

A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.

Python 369 26 Updated Aug 13, 2023

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,441 898 Updated Feb 5, 2025

Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).

Python 129 6 Updated Jul 11, 2024

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Python 217 17 Updated Feb 14, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,712 440 Updated Jan 12, 2025

A simple ANN implemented in Numpy

Python 1 Updated Feb 11, 2021

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

Python 404 11 Updated Nov 10, 2024

The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

Python 49 4 Updated Jun 14, 2024
Next