Stars
Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect
All kinds of neural text classifiers implemented by Keras
This repository contains the code and data download links to reproduce the experiments of the PVLDB paper "Dual-Objective Fine-Tuning of BERT for Entity Matching" by Ralph Peeters and Christian Bizer.
Predict Race and Ethnicity Based on the Sequence of Characters in a Name
Code and data used in named entity transliteration experiments
Fast, flexible name matching for large datasets
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Rapid fuzzy string matching in Python using various string metrics
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
π― String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Reβ¦
A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
A curated list of resources on document similarity measures (papers, tutorials, code, ...)
π A curated list of resources dedicated to Natural Language Processing (NLP)
nannerhammix / awesome-nlp
Forked from keon/awesome-nlpπ A curated list of resources dedicated to Natural Language Processing (NLP)
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
State of the Art Natural Language Processing
An implementation of DBSCAN runing on top of Apache Spark
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Streamlit β A faster way to build and share data apps.
Data Apps & Dashboards for Python. No JavaScript Required.
π Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
Neural network visualization toolkit for keras