Skip to content
View qianxinchun's full-sized avatar

Block or report qianxinchun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,282 1,326 Updated Feb 1, 2025

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,339 249 Updated Feb 7, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 120,581 16,218 Updated Feb 13, 2025

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 6,013 608 Updated Jan 8, 2025

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

148 3 Updated Feb 14, 2025

MR project investigating the mediating role of mammographic density in the childhood body size and breast cancer relationship

R 10 1 Updated May 14, 2024

LLM101n: Let's build a Storyteller

31,805 1,727 Updated Aug 1, 2024

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Python 697 58 Updated Jun 24, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 758 39 Updated Feb 17, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 5,819 513 Updated Jan 24, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,236 44 Updated Feb 18, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,707 186 Updated Aug 17, 2024

NanoGPT (124M) in 3 minutes

Python 2,288 242 Updated Feb 17, 2025

Sequence-level 1F1B schedule for LLMs.

Python 17 3 Updated Jun 4, 2024

An ML Systems Onboarding list

692 23 Updated Jan 24, 2025

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 262 9 Updated Dec 4, 2024

More relighting!

Python 7,544 462 Updated Nov 28, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,032 42 Updated May 31, 2024

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 164 11 Updated Jul 25, 2024

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.

Python 832 70 Updated Jul 6, 2024

Schedule free optimiser implemented in JAX using Optimistix

Python 14 Updated May 29, 2024

Schedule-Free Optimization in PyTorch

Python 2,099 71 Updated Dec 2, 2024

A generative speech model for daily dialogue.

Python 34,475 3,721 Updated Feb 18, 2025
Jupyter Notebook 198 17 Updated May 27, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 291 12 Updated Dec 7, 2024

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 654 34 Updated Jan 21, 2025

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 498 35 Updated Sep 23, 2024

Large Action Model framework to develop AI Web Agents

Python 5,889 535 Updated Jan 21, 2025

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,580 164 Updated Feb 24, 2024

Structured Text Generation

Python 10,719 561 Updated Feb 17, 2025
Next
Showing results