Stars
A blazing fast inference solution for text embeddings models
A high-throughput and memory-efficient inference and serving engine for LLMs
A Label Attention Model for ICD Coding from Clinical Text
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
QLoRA: Efficient Finetuning of Quantized LLMs
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
🦜🔗 Build context-aware reasoning applications
A library and microservice implementing the health and care terminology SNOMED CT with support for cross-maps, inference, fast full-text search, autocompletion, compositional grammar and the expres…
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Joint Multilingual Knowledge Graph Completion and Alignment (Findings of EMNLP 2022) (Pytorch)
Pretrained language model with 100B parameters
[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
A curated list of awesome actions to use on GitHub
Collection of useful data science topics along with articles, videos, and code
Named Entity Recognition as Dependency Parsing
Repository containing code for "How to Train BERT with an Academic Budget" paper
Label Studio is a multi-type data labeling and annotation tool with standardized output format
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or…
TensorFlow tutorials and best practices.
Code for Tensorflow Machine Learning Cookbook
GeDi: Generative Discriminator Guided Sequence Generation
A Unified Library for Parameter-Efficient and Modular Transfer Learning