diff options
author | Peng, Meng <peng.meng@intel.com> | 2016-09-21 10:17:38 +0100 |
---|---|---|
committer | Sean Owen <sowen@cloudera.com> | 2016-09-21 10:17:38 +0100 |
commit | b366f18496e1ce8bd20fe58a0245ef7d91819a03 (patch) | |
tree | 3a2e189a94ad1fc49040d721eb586724bc493097 /project/MimaExcludes.scala | |
parent | 28fafa3ee8f3478fa441e7bd6c8fd4ab482ca98e (diff) | |
download | spark-b366f18496e1ce8bd20fe58a0245ef7d91819a03.tar.gz spark-b366f18496e1ce8bd20fe58a0245ef7d91819a03.tar.bz2 spark-b366f18496e1ce8bd20fe58a0245ef7d91819a03.zip |
[SPARK-17017][MLLIB][ML] add a chiSquare Selector based on False Positive Rate (FPR) test
## What changes were proposed in this pull request?
Univariate feature selection works by selecting the best features based on univariate statistical tests. False Positive Rate (FPR) is a popular univariate statistical test for feature selection. We add a chiSquare Selector based on False Positive Rate (FPR) test in this PR, like it is implemented in scikit-learn.
http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection
## How was this patch tested?
Add Scala ut
Author: Peng, Meng <peng.meng@intel.com>
Closes #14597 from mpjlu/fprChiSquare.
Diffstat (limited to 'project/MimaExcludes.scala')
-rw-r--r-- | project/MimaExcludes.scala | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/project/MimaExcludes.scala b/project/MimaExcludes.scala index d4cbf510b9..f13f3ff789 100644 --- a/project/MimaExcludes.scala +++ b/project/MimaExcludes.scala @@ -815,6 +815,9 @@ object MimaExcludes { ) ++ Seq( // [SPARK-17163] Unify logistic regression interface. Private constructor has new signature. ProblemFilters.exclude[DirectMissingMethodProblem]("org.apache.spark.ml.classification.LogisticRegressionModel.this") + ) ++ Seq( + // [SPARK-17017] Add chiSquare selector based on False Positive Rate (FPR) test + ProblemFilters.exclude[DirectMissingMethodProblem]("org.apache.spark.mllib.feature.ChiSqSelectorModel.isSorted") ) } |