Skip to content
View GenkiK's full-sized avatar

Highlights

  • Pro

Block or report GenkiK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR2022] Geometric Transformer for Fast and Robust Point Cloud Registration

Python 686 71 Updated Nov 22, 2023

[Arxiv 2024] MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms

Python 108 4 Updated Dec 1, 2024

Official PyTorch repo for JoJoGAN: One Shot Face Stylization

Jupyter Notebook 1,422 206 Updated Sep 29, 2022

Official Pytorch implementation for 2021 ICCV paper "Learning Motion Priors for 4D Human Body Capture in 3D Scenes" and trained models / data

Python 196 21 Updated May 1, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,812 462 Updated Nov 5, 2024

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 818 26 Updated Aug 9, 2024

[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Python 140 9 Updated May 30, 2024

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Python 1,350 228 Updated Nov 8, 2024

Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)

Python 98 9 Updated Nov 27, 2024

Implementation of Linformer for Pytorch

Python 257 25 Updated Jan 5, 2024

This is a repository that implements the Dense NN Retrieval Evaluation used for evaluating the In-Context Learning Capabilities of Vision Encoders.

Python 15 1 Updated Nov 15, 2024

Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"

Python 31 Updated Oct 31, 2024

Official PyTorch Implementation of "The Hidden Attention of Mamba Models"

Python 207 12 Updated May 27, 2024

Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"

Python 323 34 Updated Nov 26, 2024

An open source implementation of CLIP.

Python 10,569 1,001 Updated Dec 4, 2024

Grounding Image Matching in 3D with MASt3R

Python 1,411 111 Updated Oct 12, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,696 2,281 Updated Aug 12, 2024

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,819 130 Updated Oct 2, 2024

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,071 68 Updated Apr 15, 2024

tiny vision language model

Jupyter Notebook 6,074 502 Updated Dec 10, 2024

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 748 49 Updated Nov 22, 2024

[ECCV 2024] Official PyTorch implementation of the paper "Scene-aware Human Motion Forecasting via Mutual Distance Prediction"

Python 12 Updated Nov 14, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 658 32 Updated Dec 11, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,466 92 Updated Dec 11, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,381 492 Updated Dec 10, 2024

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Python 1,055 120 Updated Nov 5, 2024

Real-time and accurate open-vocabulary end-to-end object detection

Python 1,542 143 Updated Sep 6, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,327 119 Updated Dec 11, 2024

High-resolution models for human tasks.

Python 4,630 265 Updated Nov 18, 2024

Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)

Jupyter Notebook 518 79 Updated Jul 30, 2021
Next