diff options
author | Yanbo Liang <ybliang8@gmail.com> | 2015-03-31 11:32:14 -0700 |
---|---|---|
committer | Joseph K. Bradley <joseph@databricks.com> | 2015-03-31 11:32:14 -0700 |
commit | b5bd75d90a761199c3f9cb583c1fe48c8fda7780 (patch) | |
tree | 8defa75fba18d3fbb223bc2d780d21d33d00424b /sql/hive | |
parent | 46de6c05e0619250346f0988e296849f8f93d2b1 (diff) | |
download | spark-b5bd75d90a761199c3f9cb583c1fe48c8fda7780.tar.gz spark-b5bd75d90a761199c3f9cb583c1fe48c8fda7780.tar.bz2 spark-b5bd75d90a761199c3f9cb583c1fe48c8fda7780.zip |
[SPARK-6255] [MLLIB] Support multiclass classification in Python API
Python API parity check for classification and multiclass classification support, major disparities need to be added for Python:
```scala
LogisticRegressionWithLBFGS
setNumClasses
setValidateData
LogisticRegressionModel
getThreshold
numClasses
numFeatures
SVMWithSGD
setValidateData
SVMModel
getThreshold
```
For users the greatest benefit in this PR is multiclass classification was supported by Python API.
Users can train multiclass classification model and use it to predict in pyspark.
Author: Yanbo Liang <ybliang8@gmail.com>
Closes #5137 from yanboliang/spark-6255 and squashes the following commits:
0bd531e [Yanbo Liang] address comments
444d5e2 [Yanbo Liang] LogisticRegressionModel.predict() optimization
fc7990b [Yanbo Liang] address comments
b0d9c63 [Yanbo Liang] Support Mulinomial LR model predict in Python API
ded847c [Yanbo Liang] Python API parity check for classification (support multiclass classification)
Diffstat (limited to 'sql/hive')
0 files changed, 0 insertions, 0 deletions