[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 229 14 Updated May 5, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,484 2,326 Updated Mar 5, 2025

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 774 45 Updated Jul 29, 2024

ZihaoZhao / Arxiv_daily

A little spider which can help you to get your own paper list from https://arxiv.org/ every day.

Python 45 11 Updated Oct 15, 2019

bytedance / SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,174 93 Updated Mar 4, 2025

eric-ai-lab / MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 862 52 Updated Dec 12, 2024

PKU-YuanGroup / Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 916 45 Updated Oct 16, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

14,115 906 Updated Mar 5, 2025

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,939 486 Updated Aug 6, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,308 1,003 Updated Nov 18, 2024

mli / paper-reading

深度学习经典、新论文逐段精读

29,325 2,589 Updated Nov 17, 2024

HumanSignal / labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 23,336 6,394 Updated Jun 7, 2024

bnsreenu / python_for_microscopists

https://www.youtube.com/channel/UC34rW-HtPJulxr5wp2Xa04w?sub_confirmation=1

Jupyter Notebook 4,128 2,455 Updated Feb 13, 2025

rajatjain3571 / Lucas-Kanade-Optical-Flow-

Lucas Kanade Implementation with and without pyramid

Python 14 8 Updated Dec 14, 2017

cheind / image-align

Variants of the classic Lucas-Kanade image alignment algorithm

C++ 154 53 Updated Jan 19, 2017

matiasdm / 3DReconstructionViaStructuredLight

C 3 1 Updated Jul 23, 2020

lichangke / DesignPattern

设计模式C++实现以及说明

13 7 Updated Jan 12, 2021

NVlabs / SegFormer

Official PyTorch implementation of SegFormer

Python 2,746 375 Updated Aug 2, 2024

fiji / Stitching

Fiji's Stitching plugins reconstruct big images from tiled input images.

Java 100 64 Updated Jan 22, 2025

axelbrando / Mixture-Density-Networks-for-distribution-and-uncertainty-estimation

A generic Mixture Density Networks (MDN) implementation for distribution and uncertainty estimation by using Keras (TensorFlow)

Jupyter Notebook 346 91 Updated Jun 30, 2017

apachecn / hands-on-ml-zh

📖 [译] Sklearn 与 TensorFlow 机器学习实用指南【版权问题，网站已下线！！】

CSS 3,739 1,537 Updated Aug 9, 2021

ageron / handson-ml2

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 28,492 12,927 Updated Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MarStarck

Achievements

Achievements

Block or report MarStarck

Stars

om-ai-lab / VLM-R1

HarderThenHarder / RLLoggingBoard

Pengfei8324 / chinese_license_plate_generator

detectRecog / CCPD

njvisionpower / Safety-Helmet-Wearing-Dataset

ruotianluo / self-critical.pytorch

naginoa / LLMs_interview_notes

deezer / spleeter

RERV / VDT