Stars
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
PyTorch Tutorial for Deep Learning Researchers
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
OpenChat: Advancing Open-source Language Models with Imperfect Data
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social …
General technology for enabling AI capabilities w/ LLMs and MLLMs
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
Minimal, clean example of lstm neural network training in python, for learning purposes.
novel deep learning research works with PaddlePaddle
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"
CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
Optimus: the first large-scale pre-trained VAE language model
Offsite-Tuning: Transfer Learning without Full Model
Implementation of Meta-Learning with Latent Embedding Optimization
PyContinual (An Easy and Extendible Framework for Continual Learning)