Stars
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Secrets of RLHF in Large Language Models Part I: PPO
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.
Author's PyTorch implementation of TD3 for OpenAI gym tasks
A modular RL library to fine-tune language models to human preferences
An offline deep reinforcement learning library
Source code for Twitter's Recommendation Algorithm
Chinese version of GPT2 training code, using BERT tokenizer.
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Must-read papers on prompt-based tuning for pre-trained language models.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
☁️ Build multimodal AI applications with cloud-native stack
PyTorch implementations of algorithms for density estimation
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
The implementation of “Gradient Harmonized Single-stage Detector” published on AAAI 2019.
Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`
Joint Extraction of Entities and Relations Based on cnn+rnn
Implementation of active learning based on BiLSTM-CRF
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Gustafson Kessel & Fuzzy C-Means Implementation