A library and microservice implementing the health and care terminology SNOMED CT with support for cross-maps, inference, fast full-text search, autocompletion, compositional grammar and the expres…

Clojure 177 23 Updated Jan 9, 2025

microsoft / BioGPT

Python 4,350 454 Updated Jul 25, 2024

lk-geimfari / mimesis

Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

Python 4,475 338 Updated Jan 7, 2025

vinhsuhi / JMAC

Joint Multilingual Knowledge Graph Completion and Alignment (Findings of EMNLP 2022) (Pytorch)

Python 34 Updated Oct 23, 2022

thelinhbkhn2014 / VnCoreNLP_Wrapper

Python 25 5 Updated Aug 28, 2024

yandex / YaLM-100B

Pretrained language model with 100B parameters

Python 3,748 299 Updated Jul 10, 2023

princeton-nlp / CoFiPruning

[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408

Python 192 31 Updated May 9, 2023

juand-r / entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Python 1,512 247 Updated Nov 29, 2024

TencentARC / GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 36,173 5,989 Updated Jul 26, 2024

sdras / awesome-actions

A curated list of awesome actions to use on GitHub

25,520 1,504 Updated Sep 1, 2024

khuyentran1401 / Data-science

Collection of useful data science topics along with articles, videos, and code

Jupyter Notebook 4,070 1,026 Updated Oct 12, 2024

juntaoy / biaffine-ner

Named Entity Recognition as Dependency Parsing

Python 348 39 Updated Aug 16, 2023

IntelLabs / academic-budget-bert

Repository containing code for "How to Train BERT with an Academic Budget" paper

Python 310 47 Updated Sep 18, 2023

thunlp / PLMpapers

Must-read Papers on pre-trained language models.

3,336 436 Updated Nov 6, 2022

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 20,283 2,496 Updated Jan 17, 2025

kocmitom / LanideNN

Python 19 11 Updated Jan 21, 2021

facebookresearch / SentAugment

SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or…

Python 361 34 Updated Feb 22, 2022

hoangnguyence / hpconcrete

Python 4 1 Updated Oct 17, 2020

vahidk / EffectiveTensorflow

TensorFlow tutorials and best practices.

8,620 905 Updated Oct 22, 2020

nfmcclure / tensorflow_cookbook

Code for Tensorflow Machine Learning Cookbook

Jupyter Notebook 6,244 2,412 Updated May 23, 2024

salesforce / GeDi

GeDi: Generative Discriminator Guided Sequence Generation

Python 208 46 Updated Sep 28, 2022

adapter-hub / adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,634 355 Updated Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thanh Vu tienthanhdhcn

Achievements

Achievements

Block or report tienthanhdhcn

Stars

huggingface / text-embeddings-inference

vllm-project / vllm

aehrc / LAAT

rustformers / llm

artidoro / qlora

lm-sys / FastChat

microsoft / DeepSpeedExamples

langchain-ai / langchain

wardle / hermes