Skip to content
View yanfjz's full-sized avatar

Block or report yanfjz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 7,765 1,010 Updated Feb 1, 2025

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.

TypeScript 3,306 180 Updated Feb 2, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,518 4,224 Updated Feb 5, 2025

An LLM playground you can run on your laptop

TypeScript 6,318 488 Updated Jan 17, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 119,717 16,131 Updated Feb 5, 2025

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,448 704 Updated Jan 28, 2025

Expanding natural instructions

Python 971 191 Updated Dec 11, 2023

GLM-4-Voice | 端到端中英语音对话模型

Python 2,606 213 Updated Dec 5, 2024

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,643 356 Updated Jan 28, 2025

Deep learning based content moderation from text, audio, video & image input modalities.

330 18 Updated Dec 16, 2024

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 14,317 1,537 Updated Feb 5, 2025

Building modular LMs with parameter-efficient fine-tuning.

Python 96 13 Updated Feb 5, 2025

[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"

Python 104 5 Updated Jun 20, 2024

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

Java 355 17 Updated Apr 8, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,979 173 Updated Nov 7, 2024

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Python 583 55 Updated Jan 3, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 20,568 2,523 Updated Feb 4, 2025

A curated list of awesome data labeling tools

3,898 442 Updated Jun 17, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 119,167 9,536 Updated Feb 5, 2025

A proof-of-concept project that showcases the potential for using small, locally trainable LLMs to create next-generation documentation tools.

Python 515 38 Updated Apr 9, 2023

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,089 1,299 Updated Jan 27, 2025

Code for the paper Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration

Python 93 15 Updated Jul 15, 2022

📋 A list of open LLMs available for commercial use.

11,607 795 Updated Feb 3, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,111 2,315 Updated Feb 4, 2025

Experiments for "Automatic Calibration and Error Correction for Large Language Models via Pareto Optimal Self-Supervision"

Jupyter Notebook 13 2 Updated Aug 4, 2023

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,360 123 Updated Jan 16, 2025

Inference Llama 2 in one file of pure C

C 17,970 2,185 Updated Aug 6, 2024

Atmospheric adventure chat for AI language models (KoboldAI, NovelAI, Pygmalion, OpenAI chatgpt, gpt-4)

JavaScript 2,283 300 Updated Dec 5, 2024

Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.

TypeScript 797 75 Updated Feb 5, 2025
Next