meaquanana

meaquanana

7 followers · 6 following

Stars

linhuixiao / Awesome-Visual-Grounding

[TPAMI reviewing] Towards Visual Grounding: A Survey

Shell 111 12 Updated Feb 13, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,905 444 Updated Jan 12, 2025

LeapLabTHU / ImprovedNAT

A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"

Python 42 2 Updated Jun 13, 2024

google-research / maskgit

Official Jax Implementation of MaskGIT

Jupyter Notebook 490 50 Updated Nov 18, 2022

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,055 1,180 Updated Jul 30, 2024

LeapLabTHU / AdaNAT

[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Python 33 1 Updated Sep 12, 2024

LeapLabTHU / OVM3D-Det

Python 30 1 Updated Jan 2, 2025

robot-learning-freiburg / BEVCar

[IROS2024] Camera-Radar Fusion for BEV Map and Object Segmentation

Python 73 8 Updated Mar 6, 2025

showlab / Tune-An-Ellipse

[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

Python 10 1 Updated Jan 5, 2025

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 496 79 Updated Aug 27, 2024

BryanPlummer / flickr30k_entities

Flickr30K Entities Dataset

MATLAB 168 26 Updated Dec 23, 2018

hila-chefer / Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,849 247 Updated Jan 24, 2024

hila-chefer / Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…

Jupyter Notebook 835 109 Updated Aug 24, 2023

kerrj / lerf

Code for LERF: Language Embedded Radiance Fields

Python 679 69 Updated Jul 9, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,974 898 Updated Aug 7, 2024

wanghao9610 / OV-DINO

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 296 19 Updated Mar 12, 2025

uvavision / SelfEQ

[CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".

Python 24 1 Updated Mar 1, 2024

uvavision / AMC-grounding

[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"

Jupyter Notebook 19 2 Updated Oct 10, 2023

VainF / Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

Python 2,903 344 Updated Mar 5, 2025

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

Python 1,616 203 Updated Sep 20, 2022

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,465 1,501 Updated Dec 25, 2024

xuwangyin / dinov2-finetune

Finetuning DINOv2 (https://github.com/facebookresearch/dinov2) on your own dataset

Python 55 3 Updated Jun 8, 2023

lucidrains / CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Python 1,109 89 Updated Dec 12, 2023

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,334 1,005 Updated Nov 18, 2024

acfr / cam_lidar_calibration

(ITSC 2021) Optimising the selection of samples for robust lidar camera calibration. This package estimates the calibration parameters from camera to lidar frame.

C++ 479 109 Updated Oct 4, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,867 128 Updated Oct 30, 2024

apple / ml-4m

4M: Massively Multimodal Masked Modeling

Python 1,692 102 Updated Mar 7, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,454 4,865 Updated Feb 23, 2025

PaddlePaddle / PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,168 2,921 Updated Mar 12, 2025

open-mmlab / mmfewshot

OpenMMLab FewShot Learning Toolbox and Benchmark

Python 719 120 Updated Sep 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly