Skip to content

Latest commit

 

History

History
253 lines (182 loc) · 8 KB

NEWS.md

File metadata and controls

253 lines (182 loc) · 8 KB

dbscan 1.2.0-1 (2024-xx-xx)

Changes

  • Various refactoring by m-muecke

New Features

  • HDBSCAN gained parameter cluster_selection_epsilon to implement clusters selected from Malzer and Baum (2020).
  • Functions ncluster() and nnoise() were added.
  • hullplot now() marks noise as x.
  • Added clplot().
  • pointdensity now also accepts a dist object as input and has the new type "gaussian" to calculate a Gaussian kernel estimate.
  • Added the DBCV index.

Bugfixes

  • extractFOCS: Fixed total_score.

dbscan 1.2-0 (2024-06-28)

New Features

  • dbscan has now tidymodels tidiers (glance, tidy, augment).
  • kNNdistplot can now plot a range of k/minPts values.
  • added stats::nobs methods for the clusterings.
  • kNN and frNN now contains the used distance metric.

Changes

  • dbscan component dist was renamed to metric.
  • Removed redundant sort in kNNdistplot (reported by Natasza Szczypien).
  • Refactoring use anyNA(x) instead of any(is.na(x)) and many more (by m-muecke).
  • Reorganized the C++ source code.
  • README now uses bibtex.
  • Tests use now testthat edition 3 (m-muecke).

dbscan 1.1-12 (2023-11-28)

Bugfixes

  • point_density checks now for missing values (reported by soelderer).
  • Removed C++11 specification.
  • ANN.cpp: fixed Rprintf warning.

dbscan 1.1-11 (2022-10-26)

New Features

  • kNNdistplot gained parameter minPts.
  • dbscan now retains information on distance method and border points.
  • HDBSCAN now supports long vectors to work with larger distance matrices.
  • conversion from dist to kNN and frNN is now more memory efficient. It does no longer coerce the dist object into a matrix of double the size, but extract the distances directly from the dist object.
  • Better description of how predict uses only Euclidean distances and more error checking.
  • The package now exports a new generic for as.dendrogram().

Bugfixes

  • is.corepoint() now uses the correct epsilon value (reported by Eng Aun).
  • functions now check for cluster::dissimilariy objects which have class dist but missing attributes.

dbscan 1.1-10 (2022-01-14)

New Features

  • is.corepoint() for DBSCAN.
  • coredist() and mrdist() for HDBSCAN.
  • find connected components with comps().

Changes

  • reachability plot now shows all undefined distances as a dashed line.

Bugfixes

  • memory leak in mrd calculation fixed.

dbscan 1.1-9 (2022-01-10)

Changes

  • We use now roxygen2.

New Features

  • Added predict for hdbscan (as suggested by moredatapls)

dbscan 1.1-8 (2021-04-26)

Bugfixes

  • LOF: fixed numerical issues with k-nearest neighbor distance on Solaris.

dbscan 1.1-7 (2021-04-21)

Bugfixes

  • Fixed description of k in knndistplot and added minPts argument.
  • Fixed bug for tied distances in lof (reported by sverchkov).

Changes

  • lof: the density parameter was changes to minPts to be consistent with the original paper and dbscan. Note that minPts = k + 1.

dbscan 1.1-6 (2021-02-24)

Improvements

  • Improved speed of LOF for large ks (following suggestions by eduardokapp).
  • kNN: results is now not sorted again for kd-tree queries which is much faster (by a factor of 10).
  • ANN library: annclose() is now only called once when the package is unloaded. This is in preparation to support persistent kd-trees using external pointers.
  • hdbscan lost parameter xdist.

Bugfixes

  • removed dependence on methods.
  • fixed problem in hullplot for singleton clusters (reported by Fernando Archuby).
  • GLOSH now also accepts data.frames.
  • GLOSH returns now 0 instead of NaN if we have k duplicate points in the data.

dbscan 1.1-5 (2019-10-22)

New Features

  • kNN and frNN gained parameter query to query neighbors for points not in the data.
  • sNN gained parameter jp to decide if the shared NN should be counted using the definition by Jarvis and Patrick.

dbscan 1.1-4 (2019-08-05)

New Features

  • kNNdist gained parameter all to indicate if a matrix with the distance to all nearest neighbors up to k should be returned.

Bugfixes

  • kNNdist now correctly returns the distances to the kth neighbor (reported by zschuster).
  • dbscan: check eps and minPts parameters to avoid undefined results (reported by ArthurPERE).

dbscan 1.1-3 (2018-11-12)

Bugfixes

  • pointdensity was double counting the query point (reported by Marius Hofert).

dbscan 1.1-2 (2018-05-18)

New Features

  • OPTICS now calculates eps if it is omitted.

Bugfixes

  • Example now only uses igraph conditionally since it is unavailable on Solaris (reported by B. Ripley).

dbscan 1.1-1 (2017-03-19)

Bugfixes

  • Fixed problem with constant name on Solaris in ANN code (reported by B. Ripley).

dbscan 1.1-0 (2017-03-18)

New Features

  • HDBSCAN was added.
  • extractFOSC (optimal selection of clusters for HDBSCAN) was added.
  • GLOSH outlier score was added.
  • hullplot uses now filled polygons as the default.
  • hullplot now used PCA if the data has more than 2 dimensions.
  • Added NN superclass for kNN and frNN with plot and with adjacencylist().
  • Added shared nearest neighbor clustering as sNNclust() and sNN to calculate the number of shared nearest neighbors.
  • Added pointdensity function.
  • Unsorted kNN and frNN can now be sorted using sort().
  • kNN and frNN now also accept kNN and frNN objects, respectively. This can be used to create a new kNN (frNN) with a reduced k or eps.
  • Datasets added: DS3 and moon.

Interface Changes

  • Improved interface for dbscan() and optics(): ... it now passed on to frNN.
  • OPTICS clustering extraction methods are now called extractDBSCAN and extractXi.
  • kNN and frNN are now objects with a print function.
  • dbscan now also accepts a frNN object as input.
  • jpclust and sNNclust now return a list instead of just the cluster assignments.

dbscan 1.0-0 (2017-02-02)

New Features

  • The package has now a vignette.
  • Jarvis-Patrick clustering is now available as jpclust().
  • Improved interface for dbscan() and optics(): ... is now passed on to frNN.
  • OPTICS clustering extraction methods are now called extractDBSCAN and extractXi.
  • hullplot uses now filled polygons as the default.
  • hullplot now used PCA if the data has more than 2 dimensions.
  • kNN and frNN are now objects with a print function.
  • dbscan now also accepts a frNN object as input.

dbscan 0.9-8 (2016-08-05)

New Features

  • Added hullplot to plot a scatter plot with added convex cluster hulls.
  • OPTICS: added a predecessor correction step that is used by the ELKI implementation (Matt Piekenbrock).

Bugfixes

  • Fixed a memory problem in frNN (reported by Yilei He).

dbscan 0.9-7 (2016-04-14)

  • OPTICSXi is now implemented (thanks to Matt Piekenbrock).
  • DBSCAN now also accepts MinPts (with a capital M) to be compatible with the fpc version.
  • DBSCAN objects are now also of class db scan_fast to avoid clashes with fpc.
  • DBSCAN and OPTICS have now predict functions.
  • Added test for unhandled NAs.
  • Fixed LOF for more than k duplicate points (reported by Samneet Singh).

dbscan 0.9-6 (2015-12-14)

  • OPTICS: fixed second bug reported by Di Pang
  • all methods now also accept dist objects and have a search method "dist" which precomputes distances.

dbscan 0.9-5 (2015-10-04)

  • OPTICS: fixed bug with first observation reported by Di Pang
  • OPTICS: clusterings can now be extracted using optics_cut

dbscan 0.9-4 (2015-09-17)

  • added tests (testthat).
  • input data is now checked if it can safely be coerced into a numeric matrix (storage.mode double).
  • fixed self matches in kNN and frNN (now returns the first NN correctly).

dbscan 0.9-3 (2015-9-2)

  • Added weights to DBSCAN.

dbscan 0.9-2 (2015-08-11)

  • Added kNN interface.
  • Added frNN (fixed radius NN) interface.
  • Added LOF.
  • Added OPTICS.
  • All algorithms check now for interrupt (CTRL-C/Esc).
  • DBSCAN now returns a list instead of a numeric vector.

dbscan 0.9-1 (2015-07-21)

  • DBSCAN: Improved speed by avoiding repeated sorting of point ids.
  • Added linear NN search option.
  • Added fast calculation for kNN distance.
  • fpc and microbenchmark are now used conditionally in the examples.

dbscan 0.9-0 (2015-07-15)

  • initial release