Skip to content
View sublimationAC's full-sized avatar
🎯
Focusing
🎯
Focusing
  • XDU & USYD
  • Xi'an

Block or report sublimationAC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome-LLM: a curated list of Large Language Model

19,527 1,612 Updated Dec 19, 2024

[NeurIPS'24 Spotlight] Observational Scaling Laws

Jupyter Notebook 46 3 Updated Oct 2, 2024

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,476 85 Updated Feb 1, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,716 175 Updated Nov 28, 2023

Supercharge Your Model Training

Python 5,197 427 Updated Dec 16, 2024
Python 162 14 Updated Nov 13, 2023

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 566 48 Updated Mar 4, 2024

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Python 32 1 Updated Jul 17, 2023

Transformer training code for sequential tasks

Python 609 60 Updated Sep 14, 2021

Standalone TFRecord reader/writer with PyTorch data loaders

Python 872 107 Updated Aug 20, 2024

An implementation of training for GPT2, supports TPUs

Python 1,423 335 Updated Dec 12, 2022

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,613 103 Updated Aug 30, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,546 1,872 Updated Apr 30, 2024
Python 300 22 Updated Apr 6, 2023

Example models using DeepSpeed

Python 6,166 1,054 Updated Dec 14, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,601 351 Updated Dec 7, 2024

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Python 448 162 Updated May 24, 2024

PaddleSlim is an open-source library for deep model compression and architecture search.

Python 1,566 347 Updated Dec 4, 2024

🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation

Python 72 2 Updated Mar 25, 2024

🎁[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT

Python 88 3 Updated Jan 15, 2024

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Python 192 9 Updated Apr 17, 2023

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,478 1,496 Updated Dec 21, 2024

Repo for external large-scale work

Python 6,520 729 Updated Apr 27, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,825 876 Updated Dec 20, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,980 694 Updated Dec 17, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,193 5,699 Updated Sep 18, 2024
Python 611 64 Updated Aug 20, 2023

A procedural Blender pipeline for photorealistic training image generation

Python 2,877 454 Updated Dec 16, 2024

详细的C/C++编程规范指南,由360质量工程部编著,适用于桌面、服务端及嵌入式软件系统。

2,495 293 Updated Oct 19, 2024
Next