sublimationAC

Follow

🎯

Focusing

Wei Liu sublimationAC

🎯

Focusing

Follow

Acmer and mathlover

23 followers · 60 following

XDU & USYD
Xi'an

Stars

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

19,527 1,612 Updated Dec 19, 2024

ryoungj / ObsScaling

[NeurIPS'24 Spotlight] Observational Scaling Laws

Jupyter Notebook 46 3 Updated Oct 2, 2024

JunweiLiang / awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,476 85 Updated Feb 1, 2024

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

2,716 175 Updated Nov 28, 2023

mosaicml / composer

Supercharge Your Model Training

Python 5,197 427 Updated Dec 16, 2024

CASIA-LM / ChineseWebText

Python 162 14 Updated Nov 13, 2023

princeton-nlp / LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 566 48 Updated Mar 4, 2024

cofe-ai / Mu-scaling

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Python 32 1 Updated Jul 17, 2023

facebookresearch / adaptive-span

Transformer training code for sequential tasks

Python 609 60 Updated Sep 14, 2021

vahidk / tfrecord

Standalone TFRecord reader/writer with PyTorch data loaders

Python 872 107 Updated Aug 20, 2024

ConnorJL / GPT2

An implementation of training for GPT2, supports TPUs

Python 1,423 335 Updated Dec 12, 2022

AetherCortex / Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,613 103 Updated Aug 30, 2023

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,546 1,872 Updated Apr 30, 2024

yangjianxin1 / LLMPruner

Python 300 22 Updated Apr 6, 2023

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,166 1,054 Updated Dec 14, 2024

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,601 351 Updated Dec 7, 2024

PaddlePaddle / PaddleFleetX

飞桨大模型开发套件，提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Python 448 162 Updated May 24, 2024

PaddlePaddle / PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

Python 1,566 347 Updated Dec 4, 2024

Romainpkq / ChatGPT4MT

🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation

Python 72 2 Updated Mar 25, 2024

Coldmist-Lu / ErrorAnalysis_Prompt

🎁[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT

Python 88 3 Updated Jan 15, 2024

WHU-ZQH / ChatGPT-vs.-BERT

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Python 192 9 Updated Apr 17, 2023

togethercomputer / OpenChatKit

Python 9,010 1,015 Updated Apr 9, 2024

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,478 1,496 Updated Dec 21, 2024

facebookresearch / metaseq

Repo for external large-scale work

Python 6,520 729 Updated Apr 27, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,825 876 Updated Dec 20, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,980 694 Updated Dec 17, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,193 5,699 Updated Sep 18, 2024

stanford-crfm / BioMedLM

Python 611 64 Updated Aug 20, 2023

DLR-RM / BlenderProc

A procedural Blender pipeline for photorealistic training image generation

Python 2,877 454 Updated Dec 16, 2024

Qihoo360 / safe-rules

详细的C/C++编程规范指南，由360质量工程部编著，适用于桌面、服务端及嵌入式软件系统。

2,495 293 Updated Oct 19, 2024