diff options
author | Sean Owen <sowen@cloudera.com> | 2015-10-27 23:07:37 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-10-27 23:07:37 -0700 |
commit | 826e1e304b57abbc56b8b7ffd663d53942ab3c7c (patch) | |
tree | 379cecd7931154b2ce835302106139f06af613be /python/pyspark/mllib/clustering.py | |
parent | d9c6039897236c3f1e4503aa95c5c9b07b32eadd (diff) | |
download | spark-826e1e304b57abbc56b8b7ffd663d53942ab3c7c.tar.gz spark-826e1e304b57abbc56b8b7ffd663d53942ab3c7c.tar.bz2 spark-826e1e304b57abbc56b8b7ffd663d53942ab3c7c.zip |
[SPARK-11302][MLLIB] 2) Multivariate Gaussian Model with Covariance matrix returns incorrect answer in some cases
Fix computation of root-sigma-inverse in multivariate Gaussian; add a test and fix related Python mixture model test.
Supersedes https://github.com/apache/spark/pull/9293
Author: Sean Owen <sowen@cloudera.com>
Closes #9309 from srowen/SPARK-11302.2.
Diffstat (limited to 'python/pyspark/mllib/clustering.py')
-rw-r--r-- | python/pyspark/mllib/clustering.py | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/python/pyspark/mllib/clustering.py b/python/pyspark/mllib/clustering.py index c451df17cf..d1c3755a78 100644 --- a/python/pyspark/mllib/clustering.py +++ b/python/pyspark/mllib/clustering.py @@ -236,9 +236,9 @@ class GaussianMixtureModel(JavaModelWrapper, JavaSaveable, JavaLoader): >>> model = GaussianMixture.train(clusterdata_2, 2, convergenceTol=0.0001, ... maxIterations=150, seed=10) >>> labels = model.predict(clusterdata_2).collect() - >>> labels[0]==labels[1]==labels[2] + >>> labels[0]==labels[1] True - >>> labels[3]==labels[4] + >>> labels[2]==labels[3]==labels[4] True .. versionadded:: 1.3.0 |