aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/mllib
diff options
context:
space:
mode:
authorSean Owen <sowen@cloudera.com>2015-10-27 23:07:37 -0700
committerXiangrui Meng <meng@databricks.com>2015-10-27 23:07:37 -0700
commit826e1e304b57abbc56b8b7ffd663d53942ab3c7c (patch)
tree379cecd7931154b2ce835302106139f06af613be /python/pyspark/mllib
parentd9c6039897236c3f1e4503aa95c5c9b07b32eadd (diff)
downloadspark-826e1e304b57abbc56b8b7ffd663d53942ab3c7c.tar.gz
spark-826e1e304b57abbc56b8b7ffd663d53942ab3c7c.tar.bz2
spark-826e1e304b57abbc56b8b7ffd663d53942ab3c7c.zip
[SPARK-11302][MLLIB] 2) Multivariate Gaussian Model with Covariance matrix returns incorrect answer in some cases
Fix computation of root-sigma-inverse in multivariate Gaussian; add a test and fix related Python mixture model test. Supersedes https://github.com/apache/spark/pull/9293 Author: Sean Owen <sowen@cloudera.com> Closes #9309 from srowen/SPARK-11302.2.
Diffstat (limited to 'python/pyspark/mllib')
-rw-r--r--python/pyspark/mllib/clustering.py4
1 files changed, 2 insertions, 2 deletions
diff --git a/python/pyspark/mllib/clustering.py b/python/pyspark/mllib/clustering.py
index c451df17cf..d1c3755a78 100644
--- a/python/pyspark/mllib/clustering.py
+++ b/python/pyspark/mllib/clustering.py
@@ -236,9 +236,9 @@ class GaussianMixtureModel(JavaModelWrapper, JavaSaveable, JavaLoader):
>>> model = GaussianMixture.train(clusterdata_2, 2, convergenceTol=0.0001,
... maxIterations=150, seed=10)
>>> labels = model.predict(clusterdata_2).collect()
- >>> labels[0]==labels[1]==labels[2]
+ >>> labels[0]==labels[1]
True
- >>> labels[3]==labels[4]
+ >>> labels[2]==labels[3]==labels[4]
True
.. versionadded:: 1.3.0