sterzhang

Follow

😋

wish me good luck

Jianshu Zhang sterzhang

😋

wish me good luck

Follow

Let it be.

8 followers · 9 following

Achievements

Achievements

Highlights

Pro

Stars

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 1,737 134 Updated Jan 28, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 6,605 810 Updated Feb 1, 2025

deepseek-ai / DeepSeek-R1

56,596 7,023 Updated Feb 1, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 196 11 Updated Jan 28, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 14,548 1,139 Updated Jan 31, 2025

ZihanWang314 / RAGEN

RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.

Python 582 36 Updated Jan 30, 2025

CUHK-ARISE / GAMABench

Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

Jupyter Notebook 65 Updated Jan 27, 2025

yossigandelsman / clip_text_span

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Jupyter Notebook 183 20 Updated Nov 26, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,045 193 Updated Feb 3, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,210 4,807 Updated Feb 1, 2025

pipilurj / awesome_multimodal_reasoning

2 Updated Dec 27, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,873 1,391 Updated Jan 31, 2025

XLabs-AI / x-flux

Python 1,849 130 Updated Nov 8, 2024

JialianW / GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Python 311 30 Updated Jan 8, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,447 458 Updated Feb 1, 2025

csbobby / STAR_Benchmark

Python 30 2 Updated Apr 18, 2024

dhg-wei / TOPA

(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Python 26 Updated Sep 27, 2024

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 1,143 75 Updated Jan 23, 2025

PKU-YuanGroup / ConsisID

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 563 30 Updated Jan 26, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,853 229 Updated Jan 24, 2025

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,876 91 Updated Jan 26, 2025

mfarre / Video-LLaVA-7B-hf-CinePile

Video-LlaVA fine-tune for CinePile evaluation

Jupyter Notebook 46 5 Updated Aug 8, 2024

mayuelala / FollowYourEmoji

[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Python 358 28 Updated Sep 11, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,264 1,441 Updated Feb 3, 2025

983632847 / Awesome-Multimodal-Object-Tracking

A personal investigative project to track the latest progress in the field of multi-modal object tracking.

Python 144 14 Updated Jan 21, 2025

jaehong31 / RACCooN

(arXiv.2405.18406) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives

Python 32 1 Updated Oct 31, 2024

SunzeY / AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 769 50 Updated Jul 30, 2024

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,146 620 Updated Sep 26, 2024

TencentARC / SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 787 59 Updated Oct 11, 2024

magic-research / PLLaVA

Official repository for the paper PLLaVA

Python 636 48 Updated Jul 28, 2024