aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/mllib/clustering.py
diff options
context:
space:
mode:
authorzero323 <matthew.szymkiewicz@gmail.com>2016-01-06 11:58:33 -0800
committerJoseph K. Bradley <joseph@databricks.com>2016-01-06 11:58:33 -0800
commitfcd013cf70e7890aa25a8fe3cb6c8b36bf0e1f04 (patch)
tree48c21e17e12296e7724deadf1b129555c26a4348 /python/pyspark/mllib/clustering.py
parentea489f14f11b2fdfb44c86634d2e2c2167b6ea18 (diff)
downloadspark-fcd013cf70e7890aa25a8fe3cb6c8b36bf0e1f04.tar.gz
spark-fcd013cf70e7890aa25a8fe3cb6c8b36bf0e1f04.tar.bz2
spark-fcd013cf70e7890aa25a8fe3cb6c8b36bf0e1f04.zip
[SPARK-12006][ML][PYTHON] Fix GMM failure if initialModel is not None
If initial model passed to GMM is not empty it causes `net.razorvine.pickle.PickleException`. It can be fixed by converting `initialModel.weights` to `list`. Author: zero323 <matthew.szymkiewicz@gmail.com> Closes #9986 from zero323/SPARK-12006.
Diffstat (limited to 'python/pyspark/mllib/clustering.py')
-rw-r--r--python/pyspark/mllib/clustering.py2
1 files changed, 1 insertions, 1 deletions
diff --git a/python/pyspark/mllib/clustering.py b/python/pyspark/mllib/clustering.py
index c9e6f1dec6..48daa87e82 100644
--- a/python/pyspark/mllib/clustering.py
+++ b/python/pyspark/mllib/clustering.py
@@ -346,7 +346,7 @@ class GaussianMixture(object):
if initialModel.k != k:
raise Exception("Mismatched cluster count, initialModel.k = %s, however k = %s"
% (initialModel.k, k))
- initialModelWeights = initialModel.weights
+ initialModelWeights = list(initialModel.weights)
initialModelMu = [initialModel.gaussians[i].mu for i in range(initialModel.k)]
initialModelSigma = [initialModel.gaussians[i].sigma for i in range(initialModel.k)]
java_model = callMLlibFunc("trainGaussianMixtureModel", rdd.map(_convert_to_vector),