aboutsummaryrefslogtreecommitdiff
path: root/build
diff options
context:
space:
mode:
authorOliver Pierson <ocp@gatech.edu>2016-04-11 12:02:48 -0700
committerXiangrui Meng <meng@databricks.com>2016-04-11 12:02:48 -0700
commit89a41c5b7a3f727b44a7f615a1352ca006d12f73 (patch)
tree1c59e13c4fe03bbb0c5717f6c08311a2d2648da2 /build
parent2dacc81ec31233e558855a26340ad4662d470387 (diff)
downloadspark-89a41c5b7a3f727b44a7f615a1352ca006d12f73.tar.gz
spark-89a41c5b7a3f727b44a7f615a1352ca006d12f73.tar.bz2
spark-89a41c5b7a3f727b44a7f615a1352ca006d12f73.zip
[SPARK-13600][MLLIB] Use approxQuantile from DataFrame stats in QuantileDiscretizer
## What changes were proposed in this pull request? QuantileDiscretizer can return an unexpected number of buckets in certain cases. This PR proposes to fix this issue and also refactor QuantileDiscretizer to use approxQuantiles from DataFrame stats functions. ## How was this patch tested? QuantileDiscretizerSuite unit tests (some existing tests will change or even be removed in this PR) Author: Oliver Pierson <ocp@gatech.edu> Closes #11553 from oliverpierson/SPARK-13600.
Diffstat (limited to 'build')
0 files changed, 0 insertions, 0 deletions