aboutsummaryrefslogtreecommitdiff
path: root/data/mllib
diff options
context:
space:
mode:
authorJoseph K. Bradley <joseph.kurata.bradley@gmail.com>2014-07-17 15:05:02 -0700
committerXiangrui Meng <meng@databricks.com>2014-07-17 15:05:02 -0700
commit935fe65ff6559a0e3b481e7508fa14337b23020b (patch)
treeee298094fca9a7aead7c7e3f01abdd952ddc4845 /data/mllib
parent1fcd5dcdd8edb0e6989278c95e7f2c7d86c4efb2 (diff)
downloadspark-935fe65ff6559a0e3b481e7508fa14337b23020b.tar.gz
spark-935fe65ff6559a0e3b481e7508fa14337b23020b.tar.bz2
spark-935fe65ff6559a0e3b481e7508fa14337b23020b.zip
SPARK-1215 [MLLIB]: Clustering: Index out of bounds error (2)
Added check to LocalKMeans.scala: kMeansPlusPlus initialization to handle case with fewer distinct data points than clusters k. Added two related unit tests to KMeansSuite. (Re-submitting PR after tangling commits in PR 1407 https://github.com/apache/spark/pull/1407 ) Author: Joseph K. Bradley <joseph.kurata.bradley@gmail.com> Closes #1468 from jkbradley/kmeans-fix and squashes the following commits: 4e9bd1e [Joseph K. Bradley] Updated PR per comments from mengxr 6c7a2ec [Joseph K. Bradley] Added check to LocalKMeans.scala: kMeansPlusPlus initialization to handle case with fewer distinct data points than clusters k. Added two related unit tests to KMeansSuite.
Diffstat (limited to 'data/mllib')
0 files changed, 0 insertions, 0 deletions