Skip to content
View qianxinchun's full-sized avatar

Block or report qianxinchun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,084 1,411 Updated Mar 10, 2025

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,780 286 Updated Mar 4, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 121,368 16,312 Updated Mar 3, 2025

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 6,173 617 Updated Jan 8, 2025

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

172 4 Updated Mar 1, 2025

MR project investigating the mediating role of mammographic density in the childhood body size and breast cancer relationship

R 11 1 Updated May 14, 2024

LLM101n: Let's build a Storyteller

32,318 1,748 Updated Aug 1, 2024

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Python 704 58 Updated Jun 24, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 813 46 Updated Mar 8, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 6,102 547 Updated Mar 7, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,310 45 Updated Mar 11, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,746 191 Updated Aug 17, 2024

NanoGPT (124M) in 3 minutes

Python 2,366 261 Updated Mar 11, 2025

Sequence-level 1F1B schedule for LLMs.

Python 17 3 Updated Jun 4, 2024

An ML Systems Onboarding list

726 25 Updated Jan 24, 2025

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 267 9 Updated Dec 4, 2024

More relighting!

Python 7,668 469 Updated Feb 20, 2025

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,037 42 Updated May 31, 2024

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 168 11 Updated Jul 25, 2024

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.

Python 834 70 Updated Jul 6, 2024

Schedule free optimiser implemented in JAX using Optimistix

Python 14 Updated May 29, 2024

Schedule-Free Optimization in PyTorch

Python 2,109 71 Updated Feb 28, 2025

A generative speech model for daily dialogue.

Python 35,010 3,782 Updated Feb 18, 2025
Jupyter Notebook 200 18 Updated May 27, 2024

[CVPR'25] RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 305 12 Updated Mar 4, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 696 39 Updated Mar 5, 2025

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 499 35 Updated Sep 23, 2024

Large Action Model framework to develop AI Web Agents

Python 5,937 540 Updated Jan 21, 2025

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,630 167 Updated Feb 24, 2024

Structured Text Generation

Python 10,966 570 Updated Mar 10, 2025
Next
Showing results