Lists (2)
Sort Name ascending (A-Z)
Stars
Python implementation of convertion between equirectangular, cubemap and perspective. (equirect2cube, cube2equirect, equirect2perspec)
Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.
A lightweight library for instance-level visual road marking extraction, parameterization, mapping, etc.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Based on ResNet50, detect keypoint, and use keypoint perspective transform image.
[ECCV 2022] An End-to-End Transformer Model for Crowd Localization
VMamba: Visual State Space Models,code is based on mamba
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
🛣️ automatic extraction of road markings from MLS or ALS point cloud [ISPRS-A' 19]
a state-of-the-art-level open visual language model | 多模态预训练模型
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Training library for local feature detection and matching
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)
[ICCV 2023] ADNet: Lane Shape Prediction via Anchor Decomposition
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
Road detections from Microsoft Maps aerial imagery
Worldwide building footprints derived from satellite imagery
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Official MegEngine implementation of CREStereo(CVPR 2022 Oral).
[TPAMI 2024 & CVPR 2022] Attention Concatenation Volume for Accurate and Efficient Stereo Matching