GitHub - tuthmose/Clustering: Clustering and combinatorial optimization algorithms and internal validation criteria.

Clustering algorithms and external/internal validation criteria

src/ includes source files for clustering and clustering metrics metods devel/ includes work in progress modules test/ include jupyter notebooks with various tests validation/ includes notes about clustering validation

Algorithms included were selected because:

they were not present at the time in scikit-learn or extras (but perhaps they are now) e. g. Density Peaks or SNN
self teaching

all clustering algorithms are imported by myclusters.py all validation ones are imported by mymetrics.py mdutils includes various helper functions for Molecular Dynamics trajectories including, e. g. USR distance

Implementation is pure Python/numpy and slow. I did not even bother with unraveling triangular matrices (not often at least).

The following conventions in source and notebook holds:

X when given or defined, is ALWAYS the feature or coordinate matrix, [npoints x nfeatures]
D, the distance matrix is always expected to be a nxn square symmetric matrix with pair distances between the data ALL data set elements: 0 1 2 3 0 - d00 d01 ... 1 - d10 d11 ... 2 - ... 3 - ...
W are the weights of data points ([npoints x 1] or None)
clusters is always expected to be a list of all elements the data set where the elements are identified with the label of the corresponding CLUSTER (0 to n) or cluster centroids IF AVAILABLE; If centroids are available set(clusters) ALWAYS gives the centroids labels cluster labels are ALWAYS positive integers a label of -1 ALWAYS indentifies noise.

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
src		src
test		test
validation		validation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
commit.txt		commit.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clustering algorithms and external/internal validation criteria

About

Releases

Packages

Languages

License

tuthmose/Clustering

Folders and files

Latest commit

History

Repository files navigation

Clustering algorithms and external/internal validation criteria

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages