huyiwen

胡译文 huyiwen

Undergraduate majored in AI and Fintech. WeChat: yiwen_hu

33 followers · 37 following

Renmin University of China
Beijing
01:55 (UTC +08:00)

Achievements

Highlights

Lists (1)

Sort

🚀 My stack

3 repositories

Starred repositories

RUC-GSAI / Yulan-GARDEN

Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"

Python 58 9 Updated Aug 27, 2024

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,100 154 Updated Dec 11, 2024

huggingface / smollm

Everything about the SmolLM & SmolLM2 family of models

Python 1,423 67 Updated Dec 2, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,139 202 Updated Apr 24, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,776 874 Updated Dec 13, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,634 308 Updated Dec 15, 2024

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

29 Updated Dec 13, 2024

leanprover / theorem_proving_in_lean4

Theorem Proving in Lean 4

JavaScript 166 94 Updated Oct 14, 2024

leanprover-community / lean4-metaprogramming-book

Lean 230 56 Updated Nov 5, 2024

suntong / html2md

HTML to Markdown converter

Go 231 19 Updated Nov 11, 2024

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,932 895 Updated Mar 27, 2024

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,065 40 Updated Dec 13, 2024

GuanYixuan / 2023_little_car

清华大学2023年“智能机电系统实践”——“场外相机组”代码库

C 8 Updated Jul 1, 2024

imbue-ai / cluster-health

Python 281 38 Updated Aug 20, 2024

NVIDIA / nccl-tests

NCCL Tests

Cuda 931 248 Updated Nov 1, 2024

deepseek-ai / DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

2,319 123 Updated Sep 24, 2024

sirluk / pytorch_incremental_pca

A GPU-compatible PyTorch implementation of Incremental PCA for memory-efficient dimensionality reduction on large datasets.

Python 2 Updated Dec 5, 2024

OI-wiki / OI-wiki

🌟 Wiki of OI / ICPC for everyone. （某大型游戏线上攻略，内含炫酷算术魔法）

TypeScript 21,626 4,024 Updated Dec 15, 2024

saprmarks / dictionary_learning

Python 159 40 Updated Oct 22, 2024

shuyhere / Awesome-Sparse-Autoencoder

Collection of Reverse Engineering in Large Model

31 Updated Nov 6, 2024

alon-albalak / data-selection-survey

A Survey on Data Selection for Language Models

193 10 Updated Oct 13, 2024

allenai / ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Scala 226 26 Updated Oct 16, 2024

deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 861 53 Updated Apr 15, 2024

leanprover-community / repl

A simple REPL for Lean 4, returning information about errors and sorries.

Lean 86 25 Updated Dec 2, 2024

FormalizedFormalLogic / Foundation

Lean4 Logic Formalization

Lean 86 5 Updated Dec 15, 2024

Paper-Proof / paperproof

Lean theorem proving interface which feels like pen-and-paper proofs.

TypeScript 371 10 Updated Nov 13, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,307 223 Updated Dec 12, 2024

kubaPod / M2MD

Simple converter of Mathematica notebooks to markdown.

Mathematica 46 13 Updated Nov 13, 2023

microsoft / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,922 344 Updated Dec 5, 2024

tensorzero / tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 1,192 50 Updated Dec 15, 2024