-
Intern at Xiaohongshu Inc.
- Beijing, China
- https://scholar.google.com/citations?user=4N1hFycAAAAJ&hl=zh-CN
Stars
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis
[KDD2025] Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Diffusion Model-Based Image Editing: A Survey (arXiv)
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
A generative speech model for daily dialogue.
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
This is the pytorch implement of our paper "RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model"
[ICLR 2025] IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing
(AAAI-24) Federated Learning via Input-Output Collaborative Distillation
Official PyTorch implementation for the paper "Neural Video Fields Editing"
Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"
AAAI 2024-Controllable Mind Visual Diffusion Model
A batched offline inference oriented version of segment-anything
MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.
This repo contains a PyTorch implementation of the paper: "Evidential Deep Learning to Quantify Classification Uncertainty"
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。