Skip to content
View MarStarck's full-sized avatar

Block or report MarStarck

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Solve Visual Understanding with Reinforced VLMs

Python 3,830 234 Updated Mar 5, 2025

A visuailzation tool to make deep understaning and easier debugging for RLHF training.

Python 157 6 Updated Feb 20, 2025

中国车牌生成器

Python 148 39 Updated Oct 28, 2020

[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

Python 2,324 573 Updated Feb 15, 2024

Safety helmet wearing detect dataset, with pretrained model

Python 1,497 411 Updated Dec 17, 2019

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Python 998 277 Updated Oct 5, 2023

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

465 116 Updated Oct 16, 2023

Deezer source separation library including pretrained models.

Python 26,469 2,891 Updated Jan 24, 2025

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 229 14 Updated May 5, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 23,484 2,326 Updated Mar 5, 2025

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 774 45 Updated Jul 29, 2024

A little spider which can help you to get your own paper list from https://arxiv.org/ every day.

Python 45 11 Updated Oct 15, 2019

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,174 93 Updated Mar 4, 2025

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 862 52 Updated Dec 12, 2024

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 916 45 Updated Oct 16, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,115 906 Updated Mar 5, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,939 486 Updated Aug 6, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,308 1,003 Updated Nov 18, 2024

深度学习经典、新论文逐段精读

29,325 2,589 Updated Nov 17, 2024

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 23,336 6,394 Updated Jun 7, 2024

https://www.youtube.com/channel/UC34rW-HtPJulxr5wp2Xa04w?sub_confirmation=1

Jupyter Notebook 4,128 2,455 Updated Feb 13, 2025

Lucas Kanade Implementation with and without pyramid

Python 14 8 Updated Dec 14, 2017

Variants of the classic Lucas-Kanade image alignment algorithm

C++ 154 53 Updated Jan 19, 2017

设计模式C++实现以及说明

13 7 Updated Jan 12, 2021

Official PyTorch implementation of SegFormer

Python 2,746 375 Updated Aug 2, 2024

Fiji's Stitching plugins reconstruct big images from tiled input images.

Java 100 64 Updated Jan 22, 2025

A generic Mixture Density Networks (MDN) implementation for distribution and uncertainty estimation by using Keras (TensorFlow)

Jupyter Notebook 346 91 Updated Jun 30, 2017

📖 [译] Sklearn 与 TensorFlow 机器学习实用指南【版权问题,网站已下线!!】

CSS 3,739 1,537 Updated Aug 9, 2021

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 28,492 12,927 Updated Jun 13, 2024
Next