Skip to content
View Jxh99's full-sized avatar
  • Beijing,China

Block or report Jxh99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

690 40 Updated Oct 22, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,171 4,451 Updated Dec 14, 2024

基于pytorch的GlobalPointer进行中文命名实体识别。

Python 36 3 Updated Jul 7, 2023

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,161 453 Updated Aug 6, 2024

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Python 4,741 1,257 Updated Feb 24, 2021

PaddleNLP UIE模型的PyTorch版实现

Python 599 101 Updated Aug 13, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,887 4,168 Updated Dec 16, 2024

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,330 1,280 Updated Aug 31, 2024

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,638 410 Updated Jul 22, 2024

这个是一个在SSD的基础上用于生成绘制mAP代码所用的txt的例子。(目的是生成txt)

Python 129 40 Updated Jan 31, 2021

ERNIE Pytorch Version

Python 917 120 Updated Jul 26, 2023

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,745 1,389 Updated Jul 31, 2023

The uncompromising Python code formatter

Python 39,330 2,480 Updated Dec 11, 2024

A pytorch implementation of Attention is all you need

Python 90 18 Updated Dec 16, 2018

Baichuan-13B 指令微调

Python 89 9 Updated Jul 14, 2023

这是一个yolov7的库,可以用于训练自己的数据集。

Python 881 154 Updated Aug 27, 2023

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 96,260 15,650 Updated Dec 16, 2024

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,228 110 Updated Apr 3, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,271 4,573 Updated Dec 10, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,312 827 Updated Dec 16, 2024

Example models using DeepSpeed

Python 6,157 1,051 Updated Dec 14, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,954 695 Updated Aug 14, 2024

🩹Editing large language models within 10 seconds⚡

Python 1,294 93 Updated Aug 13, 2023

Fast and memory-efficient exact attention

Python 14,649 1,375 Updated Dec 15, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,252 187 Updated Aug 11, 2024

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,460 354 Updated Feb 5, 2024

Aligning pretrained language models with instruction data generated by themselves.

Python 4,201 489 Updated Mar 27, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,680 505 Updated Jul 18, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,371 119 Updated Jun 13, 2024

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,243 164 Updated Jul 25, 2023
Next