aboutsummaryrefslogtreecommitdiff
path: root/docs/configuration.md
diff options
context:
space:
mode:
authorAlexander Ulanov <nashb@yandex.ru>2015-02-02 12:13:05 -0800
committerXiangrui Meng <meng@databricks.com>2015-02-02 12:13:05 -0800
commitc081b21b1fe4fbad845088c4144da0bd2a8d89dc (patch)
treec509dfa59591bf5ec56cf26a48ab8a62e6df4a51 /docs/configuration.md
parent6f341310bf1fa59a28c96d123fa59e12b9366b68 (diff)
downloadspark-c081b21b1fe4fbad845088c4144da0bd2a8d89dc.tar.gz
spark-c081b21b1fe4fbad845088c4144da0bd2a8d89dc.tar.bz2
spark-c081b21b1fe4fbad845088c4144da0bd2a8d89dc.zip
[MLLIB] SPARK-5491 (ex SPARK-1473): Chi-square feature selection
The following is implemented: 1) generic traits for feature selection and filtering 2) trait for feature selection of LabeledPoint with discrete data 3) traits for calculation of contingency table and chi squared 4) class for chi-squared feature selection 5) tests for the above Needs some optimization in matrix operations. This request is a try to implement feature selection for MLLIB, the previous work by the issue author izendejas was not finished (https://issues.apache.org/jira/browse/SPARK-1473). This request is also related to data discretization issues: https://issues.apache.org/jira/browse/SPARK-1303 and https://issues.apache.org/jira/browse/SPARK-1216 that weren't merged. Author: Alexander Ulanov <nashb@yandex.ru> Closes #1484 from avulanov/featureselection and squashes the following commits: 755d358 [Alexander Ulanov] Addressing reviewers comments @mengxr a6ad82a [Alexander Ulanov] Addressing reviewers comments @mengxr 714b878 [Alexander Ulanov] Addressing reviewers comments @mengxr 010acff [Alexander Ulanov] Rebase 427ca4e [Alexander Ulanov] Addressing reviewers comments: implement VectorTransformer interface, use Statistics.chiSqTest f9b070a [Alexander Ulanov] Adding Apache header in tests... 80363ca [Alexander Ulanov] Tests, comments, apache headers and scala style 150a3e0 [Alexander Ulanov] Scala style fix f356365 [Alexander Ulanov] Chi Squared by contingency table. Refactoring 2bacdc7 [Alexander Ulanov] Combinations and chi-squared values test 66e0333 [Alexander Ulanov] Feature selector, fix of lazyness aab9b73 [Alexander Ulanov] Feature selection redesign with vigdorchik e24eee4 [Alexander Ulanov] Traits for FeatureSelection, CombinationsCalculator and FeatureFilter ca49e80 [Alexander Ulanov] Feature selection filter 2ade254 [Alexander Ulanov] Code style 0bd8434 [Alexander Ulanov] Chi Squared feature selection: initial version
Diffstat (limited to 'docs/configuration.md')
0 files changed, 0 insertions, 0 deletions