diff options
Diffstat (limited to 'docs/ml-clustering.md')
-rw-r--r-- | docs/ml-clustering.md | 30 |
1 files changed, 30 insertions, 0 deletions
diff --git a/docs/ml-clustering.md b/docs/ml-clustering.md new file mode 100644 index 0000000000..1743ef43a6 --- /dev/null +++ b/docs/ml-clustering.md @@ -0,0 +1,30 @@ +--- +layout: global +title: Clustering - ML +displayTitle: <a href="ml-guide.html">ML</a> - Clustering +--- + +In this section, we introduce the pipeline API for [clustering in mllib](mllib-clustering.html). + +## Latent Dirichlet allocation (LDA) + +`LDA` is implemented as an `Estimator` that supports both `EMLDAOptimizer` and `OnlineLDAOptimizer`, +and generates a `LDAModel` as the base models. Expert users may cast a `LDAModel` generated by +`EMLDAOptimizer` to a `DistributedLDAModel` if needed. + +<div class="codetabs"> + +Refer to the [Scala API docs](api/scala/index.html#org.apache.spark.ml.clustering.LDA) for more details. + +<div data-lang="scala" markdown="1"> +{% include_example scala/org/apache/spark/examples/ml/LDAExample.scala %} +</div> + +<div data-lang="java" markdown="1"> + +Refer to the [Java API docs](api/java/org/apache/spark/ml/clustering/LDA.html) for more details. + +{% include_example java/org/apache/spark/examples/ml/JavaLDAExample.java %} +</div> + +</div>
\ No newline at end of file |