tomchen-ctj

Follow

💨

Learning

Tongjia tomchen-ctj

💨

Learning

Follow

[email protected]

23 followers · 98 following

06:03 (UTC +08:00)
https://tomchen-ctj.github.io/

Achievements

Achievements

Stars

showlab / DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Python 467 16 Updated Jul 2, 2024

tomasjakab / animodel

Python 14 Updated Jul 1, 2024

mit-han-lab / hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 407 19 Updated Oct 16, 2024

3DAnimals / 3DAnimals

A machine learning framework for reconstructing articulated 3D animals from images

Python 87 3 Updated Dec 18, 2024

YunzeMan / Lexicon3D

[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Python 69 4 Updated Dec 3, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,410 94 Updated Aug 13, 2024

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,740 85 Updated Oct 31, 2024

gsgen3d / gsgen

[CVPR 2024] Text-to-3D using Gaussian Splatting

Python 813 49 Updated Jan 7, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,696 1,354 Updated Dec 25, 2024

graphdeco-inria / gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 15,480 2,061 Updated Oct 30, 2024

sczhou / ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 5,839 676 Updated Sep 18, 2024

LostXine / LLaRA

LLaRA: Large Language and Robotics Assistant

Python 164 3 Updated Oct 2, 2024

EvolvingLMMs-Lab / LongVA

Long Context Transfer from Language to Vision

Python 356 19 Updated Nov 20, 2024

showlab / videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 283 32 Updated Aug 15, 2024

mbzuai-oryx / VideoGPT-plus

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 245 15 Updated Aug 11, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,823 524 Updated Dec 25, 2024

BradyFU / Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

445 18 Updated Dec 14, 2024

LeapLabTHU / EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

Python 217 9 Updated Aug 23, 2024

HJYao00 / DenseConnector

【NeurIPS 2024】Dense Connector for MLLMs

Python 154 7 Updated Oct 14, 2024

zhaohengyuan1 / Genixer

(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator

Python 103 Updated Oct 17, 2024

FreedomIntelligence / Apollo

Multilingual Medicine: Model, Dataset, Benchmark, Code

Python 179 9 Updated Oct 15, 2024

jiyanggao / TALL

TALL: Temporal Activity Localization via Language Query

Python 195 48 Updated Mar 15, 2018

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,069 910 Updated Oct 3, 2024

LLaVA-VL / LLaVA-NeXT

Python 3,281 294 Updated Oct 16, 2024

hbb1 / 2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Python 2,269 168 Updated Dec 30, 2024

whwu95 / FreeVA

FreeVA: Offline MLLM as Training-Free Video Assistant

Python 54 Updated Jun 9, 2024

DavidHuji / CapDec

CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

Python 188 20 Updated Jan 28, 2024

showlab / MotionDirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Python 866 54 Updated Aug 21, 2024

hr98w / awesome-sora-prompts

This repository contains curated prompts aimed at maximizing the effectiveness of Sora for generating videos.

16 1 Updated Jan 17, 2025

Blealtan / efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Python 4,196 376 Updated Aug 1, 2024