Skip to content

Latest commit

 

History

History
31 lines (20 loc) · 968 Bytes

ml-clustering.md

File metadata and controls

31 lines (20 loc) · 968 Bytes
layout title displayTitle
global
Clustering - ML
<a href="ml-guide.html">ML</a> - Clustering

In this section, we introduce the pipeline API for clustering in mllib.

Latent Dirichlet allocation (LDA)

LDA is implemented as an Estimator that supports both EMLDAOptimizer and OnlineLDAOptimizer, and generates a LDAModel as the base models. Expert users may cast a LDAModel generated by EMLDAOptimizer to a DistributedLDAModel if needed.

Refer to the Scala API docs for more details.

{% include_example scala/org/apache/spark/examples/ml/LDAExample.scala %}

Refer to the Java API docs for more details.

{% include_example java/org/apache/spark/examples/ml/JavaLDAExample.java %}