Stars
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].
Progressive Feature Learning for Facade Parsing with Occlusions
Source code for paper DeepFacade: A Deep Learning Approach to Facade Parsing
An efficient and robust building height estimation model using street-view panoramas
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[CVPR23] Official Implementation of MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation
High-Resolution Image Synthesis with Latent Diffusion Models
(IEEE TITS 2024) WHU-Railway3D: A Diverse Dataset and Benchmark for Railway Point Cloud Semantic Segmentation
Improving Facade Parsing with Vision Transformers and Line Integration
PyTorch code and models for the DINOv2 self-supervised learning method.
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Self-attention、Non-local、SE、SK、CBAM、DANet
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…