-
-
-
-
-
RWKV-LM-LoRA-PP Public
Forked from Blealtan/RWKV-LM-LoRARWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, …
Python Apache License 2.0 UpdatedAug 30, 2023 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedAug 24, 2023 -
-
-
-
-
cs224n-learning-camp Public
Forked from learning511/cs224n-learning-campPython UpdatedFeb 15, 2019