Stars
Official repo for our ECCV'24 paper: Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
https://dl.acm.org/doi/10.1145/3689095.3689102
[Pattern Recognition'24] Pytorch implementation of Multiple-environment Self-adaptive Network for Aerial-view Geo-localization 🚁 https://arxiv.org/abs/2204.08381
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Dreamer312 / LPN
Forked from wtyhub/LPNPytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646
layumi / PiPa
Forked from chen742/PiPaOfficial Implementation of PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Zhedong Zheng Homepage http://www.zdzheng.xyz
layumi / LPN
Forked from wtyhub/LPNPytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646
UAVM @ ACM MM2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Pytorch implementation of Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization https://arxiv.org/abs/2211.05296
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
Official Implementation of PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization