Skip to content
View huyiwen's full-sized avatar
  • Renmin University of China
  • Beijing
  • 01:55 (UTC +08:00)

Highlights

  • Pro

Block or report huyiwen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"

Python 58 9 Updated Aug 27, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,100 154 Updated Dec 11, 2024

Everything about the SmolLM & SmolLM2 family of models

Python 1,423 67 Updated Dec 2, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,139 202 Updated Apr 24, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,776 874 Updated Dec 13, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,634 308 Updated Dec 15, 2024

A series of technical report on Slow Thinking with LLM

29 Updated Dec 13, 2024

Theorem Proving in Lean 4

JavaScript 166 94 Updated Oct 14, 2024

HTML to Markdown converter

Go 231 19 Updated Nov 11, 2024

Transformer related optimization, including BERT, GPT

C++ 5,932 895 Updated Mar 27, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,065 40 Updated Dec 13, 2024

清华大学2023年“智能机电系统实践”——“场外相机组”代码库

C 8 Updated Jul 1, 2024
Python 281 38 Updated Aug 20, 2024

NCCL Tests

Cuda 931 248 Updated Nov 1, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

2,319 123 Updated Sep 24, 2024

A GPU-compatible PyTorch implementation of Incremental PCA for memory-efficient dimensionality reduction on large datasets.

Python 2 Updated Dec 5, 2024

🌟 Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)

TypeScript 21,626 4,024 Updated Dec 15, 2024

Collection of Reverse Engineering in Large Model

31 Updated Nov 6, 2024

A Survey on Data Selection for Language Models

193 10 Updated Oct 13, 2024

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Scala 226 26 Updated Oct 16, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 861 53 Updated Apr 15, 2024

A simple REPL for Lean 4, returning information about errors and sorries.

Lean 86 25 Updated Dec 2, 2024

Lean4 Logic Formalization

Lean 86 5 Updated Dec 15, 2024

Lean theorem proving interface which feels like pen-and-paper proofs.

TypeScript 371 10 Updated Nov 13, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,307 223 Updated Dec 12, 2024

Simple converter of Mathematica notebooks to markdown.

Mathematica 46 13 Updated Nov 13, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,922 344 Updated Dec 5, 2024

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 1,192 50 Updated Dec 15, 2024
Next