Lists (1)
Sort Name ascending (A-Z)
Stars
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.
MR project investigating the mediating role of mammographic density in the childhood body size and breast cancer relationship
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
Doing simple retrieval from LLM models at various context lengths to measure accuracy
MayDomine / Seq1F1B
Forked from NVIDIA/Megatron-LMSequence-level 1F1B schedule for LLMs.
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
Schedule free optimiser implemented in JAX using Optimistix
Schedule-Free Optimization in PyTorch
A generative speech model for daily dialogue.
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Pandora: Towards General World Model with Natural Language Actions and Video States
Large Action Model framework to develop AI Web Agents
A Bulletproof Way to Generate Structured JSON from Language Models