diff options
author | wm624@hotmail.com <wm624@hotmail.com> | 2016-05-27 20:59:24 -0500 |
---|---|---|
committer | Sean Owen <sowen@cloudera.com> | 2016-05-27 20:59:24 -0500 |
commit | 5d4dafe8fdea49dcbd6b0e4c23e3791fa30c8911 (patch) | |
tree | 57f130594c229600e6f392c8f1b76012a5bd5ddd /examples/src/main/java/org/apache | |
parent | 4a2fb8b87ca4517e0f4a1d7a1a1b3c08c1c1294d (diff) | |
download | spark-5d4dafe8fdea49dcbd6b0e4c23e3791fa30c8911.tar.gz spark-5d4dafe8fdea49dcbd6b0e4c23e3791fa30c8911.tar.bz2 spark-5d4dafe8fdea49dcbd6b0e4c23e3791fa30c8911.zip |
[SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue
## What changes were proposed in this pull request?
(Please fill in changes proposed in this fix)
In the MLLib naivebayes example, scala and python example doesn't use libsvm data, but Java does.
I make changes in scala and python example to use the libsvm data as the same as Java example.
## How was this patch tested?
Manual tests
Author: wm624@hotmail.com <wm624@hotmail.com>
Closes #13301 from wangmiao1981/example.
Diffstat (limited to 'examples/src/main/java/org/apache')
-rw-r--r-- | examples/src/main/java/org/apache/spark/examples/mllib/JavaNaiveBayesExample.java | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/examples/src/main/java/org/apache/spark/examples/mllib/JavaNaiveBayesExample.java b/examples/src/main/java/org/apache/spark/examples/mllib/JavaNaiveBayesExample.java index 2b17dbb963..f4ec04b0c6 100644 --- a/examples/src/main/java/org/apache/spark/examples/mllib/JavaNaiveBayesExample.java +++ b/examples/src/main/java/org/apache/spark/examples/mllib/JavaNaiveBayesExample.java @@ -36,9 +36,9 @@ public class JavaNaiveBayesExample { SparkConf sparkConf = new SparkConf().setAppName("JavaNaiveBayesExample"); JavaSparkContext jsc = new JavaSparkContext(sparkConf); // $example on$ - String path = "data/mllib/sample_naive_bayes_data.txt"; + String path = "data/mllib/sample_libsvm_data.txt"; JavaRDD<LabeledPoint> inputData = MLUtils.loadLibSVMFile(jsc.sc(), path).toJavaRDD(); - JavaRDD<LabeledPoint>[] tmp = inputData.randomSplit(new double[]{0.6, 0.4}, 12345); + JavaRDD<LabeledPoint>[] tmp = inputData.randomSplit(new double[]{0.6, 0.4}); JavaRDD<LabeledPoint> training = tmp[0]; // training set JavaRDD<LabeledPoint> test = tmp[1]; // test set final NaiveBayesModel model = NaiveBayes.train(training.rdd(), 1.0); |