Skip to content
View steven0912wq's full-sized avatar

Block or report steven0912wq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,306 120 Updated Mar 13, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,070 1,299 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 19,948 1,712 Updated Feb 15, 2025

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Jupyter Notebook 1,705 155 Updated Mar 13, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

5,096 764 Updated Sep 24, 2024

DeepSeek Coder: Let the Code Write Itself

Python 19,487 2,172 Updated May 21, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 8,972 943 Updated Feb 15, 2025

SoTA LLM for converting natural language questions to SQL queries

Jupyter Notebook 3,562 230 Updated May 23, 2024

DataComp for Language Models

HTML 1,225 112 Updated Dec 11, 2024

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 2,732 328 Updated Feb 14, 2025

Efficient Image Captioning code in Torch, runs on GPU

Jupyter Notebook 5,521 1,260 Updated Nov 7, 2017

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

Python 5,413 1,322 Updated Dec 22, 2020

Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

Lua 11,705 2,599 Updated Oct 24, 2023

Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.

Jupyter Notebook 622 68 Updated Feb 3, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,400 893 Updated Jul 1, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 13,206 1,814 Updated Aug 18, 2024

LLM101n: Let's build a Storyteller

31,724 1,722 Updated Aug 1, 2024
Jupyter Notebook 3,406 1,011 Updated Jul 9, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 36,200 3,235 Updated Feb 15, 2025

Code for the manim-generated scenes used in 3blue1brown videos

Python 8,907 1,834 Updated Jan 9, 2025

Animation engine for explanatory math videos

Python 75,025 6,540 Updated Jan 8, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 11,171 1,634 Updated Aug 8, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,866 560 Updated Aug 13, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,435 1,320 Updated Feb 11, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,323 2,302 Updated Feb 13, 2025

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 21,127 5,437 Updated Feb 14, 2025

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 34,562 8,819 Updated Feb 5, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,217 167 Updated Feb 12, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,687 1,895 Updated Nov 19, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,123 841 Updated Feb 13, 2025
Next