Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Python 1,350 228 Updated Nov 8, 2024

thuml / TimeXer

Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)

Python 98 9 Updated Nov 27, 2024

lucidrains / linformer

Implementation of Linformer for Pytorch

Python 257 25 Updated Jan 5, 2024

vpariza / open-hummingbird-eval

This is a repository that implements the Dense NN Retrieval Evaluation used for evaluating the In-Context Learning Capabilities of Vision Encoders.

Python 15 1 Updated Nov 15, 2024

WalterSimoncini / fungivision

Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"

Python 31 Updated Oct 31, 2024

AmeenAli / HiddenMambaAttn

Official PyTorch Implementation of "The Hidden Attention of Mamba Models"

Python 207 12 Updated May 27, 2024

facebookresearch / mae_st

Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"

Python 323 34 Updated Nov 26, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,569 1,001 Updated Dec 4, 2024

naver / mast3r

Grounding Image Matching in 3D with MASt3R

Python 1,411 111 Updated Oct 12, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,696 2,281 Updated Aug 12, 2024

CannyLab / tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,819 130 Updated Oct 2, 2024

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,071 68 Updated Apr 15, 2024

vikhyat / moondream

tiny vision language model

Jupyter Notebook 6,074 502 Updated Dec 10, 2024

apple / ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 748 49 Updated Nov 22, 2024

xccyue / MutualDistance

[ECCV 2024] Official PyTorch implementation of the paper "Scene-aware Human Motion Forecasting via Mutual Distance Prediction"

Python 12 Updated Nov 14, 2024

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 658 32 Updated Dec 11, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,466 92 Updated Dec 11, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,381 492 Updated Dec 10, 2024

Sense-X / Co-DETR

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Python 1,055 120 Updated Nov 5, 2024

om-ai-lab / OmDet

Real-time and accurate open-vocabulary end-to-end object detection

Python 1,542 143 Updated Sep 6, 2024

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,327 119 Updated Dec 11, 2024

facebookresearch / sapiens

High-resolution models for human tasks.

Python 4,630 265 Updated Nov 18, 2024

zju3dv / mvpose

Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)

Jupyter Notebook 518 79 Updated Jul 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GenkiK

Highlights

Block or report GenkiK

Stars

qinzheng93 / GeoTransformer

IDEA-Research / MotionCLR

mchong6 / JoJoGAN

sanweiliti / LEMO

AILab-CVC / YOLO-World

IDEA-Research / Grounding-DINO-1.5-API

xb534 / SED

thuml / iTransformer