Skip to content
View nannerhammix's full-sized avatar

Block or report nannerhammix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

gRPC clients and servers in R

C++ 75 25 Updated Feb 23, 2023

Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect

R 14 3 Updated Oct 29, 2024

All kinds of neural text classifiers implemented by Keras

Python 64 21 Updated Sep 20, 2019

This repository contains the code and data download links to reproduce the experiments of the PVLDB paper "Dual-Objective Fine-Tuning of BERT for Entity Matching" by Ralph Peeters and Christian Bizer.

Python 14 6 Updated Jun 7, 2021

Tidy interface to 'data.table'

R 452 32 Updated Dec 11, 2024

Tidy interface to polars

Python 348 11 Updated Nov 2, 2024

Predict Race and Ethnicity Based on the Sequence of Characters in a Name

Jupyter Notebook 235 66 Updated Jun 13, 2024

Code and data used in named entity transliteration experiments

Python 57 8 Updated Jun 4, 2018

Transliteration data and models

54 30 Updated Nov 19, 2016

Fast, flexible name matching for large datasets

Python 70 9 Updated Dec 15, 2023

Fuzzy document finding in Ruby

Ruby 23 8 Updated Oct 18, 2017

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Python 809 121 Updated Dec 21, 2024

Rapid fuzzy string matching in Python using various string metrics

C++ 2,796 120 Updated Dec 23, 2024

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,175 298 Updated Oct 27, 2024

πŸ“ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Python 3,422 251 Updated Sep 9, 2024

🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Re…

Scala 486 80 Updated Jul 28, 2017

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Python 307 54 Updated Aug 24, 2022

A curated list of resources on document similarity measures (papers, tutorials, code, ...)

238 24 Updated Jul 13, 2022

πŸ“– A curated list of resources dedicated to Natural Language Processing (NLP)

16,851 2,588 Updated Nov 13, 2023

πŸ“– A curated list of resources dedicated to Natural Language Processing (NLP)

1 Updated Dec 7, 2021

πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,745 27,383 Updated Dec 25, 2024

State of the Art Natural Language Processing

Scala 3,895 715 Updated Dec 25, 2024

An implementation of DBSCAN runing on top of Apache Spark

Scala 184 58 Updated Jan 10, 2018

Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.

Python 2,337 333 Updated Jul 18, 2024

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,894 497 Updated Feb 14, 2023

Streamlit β€” A faster way to build and share data apps.

Python 36,355 3,132 Updated Dec 25, 2024

Data Apps & Dashboards for Python. No JavaScript Required.

Python 21,716 2,092 Updated Dec 20, 2024

πŸ”… Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

Jupyter Notebook 2,754 335 Updated Dec 9, 2024

Neural network visualization toolkit for keras

Python 2,987 660 Updated Feb 7, 2022

DeepVis Toolbox

Python 4,026 927 Updated Jan 13, 2020
Next