diff options
author | Sean Owen <sowen@cloudera.com> | 2014-07-13 19:27:43 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2014-07-13 19:27:43 -0700 |
commit | 635888cbed0e3f4127252fb84db449f0cc9ed659 (patch) | |
tree | 43433e3393c889f25a8ef4898099664a1a5ce0a7 /docs/mllib-decision-tree.md | |
parent | 4c8be64e768fe71643b37f1e82f619c8aeac6eff (diff) | |
download | spark-635888cbed0e3f4127252fb84db449f0cc9ed659.tar.gz spark-635888cbed0e3f4127252fb84db449f0cc9ed659.tar.bz2 spark-635888cbed0e3f4127252fb84db449f0cc9ed659.zip |
SPARK-2363. Clean MLlib's sample data files
(Just made a PR for this, mengxr was the reporter of:)
MLlib has sample data under serveral folders:
1) data/mllib
2) data/
3) mllib/data/*
Per previous discussion with Matei Zaharia, we want to put them under `data/mllib` and clean outdated files.
Author: Sean Owen <sowen@cloudera.com>
Closes #1394 from srowen/SPARK-2363 and squashes the following commits:
54313dd [Sean Owen] Move ML example data from /mllib/data/ and /data/ into /data/mllib/
Diffstat (limited to 'docs/mllib-decision-tree.md')
-rw-r--r-- | docs/mllib-decision-tree.md | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/docs/mllib-decision-tree.md b/docs/mllib-decision-tree.md index 3002a66a4f..9cd768599e 100644 --- a/docs/mllib-decision-tree.md +++ b/docs/mllib-decision-tree.md @@ -122,7 +122,7 @@ import org.apache.spark.mllib.tree.configuration.Algo._ import org.apache.spark.mllib.tree.impurity.Gini // Load and parse the data file -val data = sc.textFile("mllib/data/sample_tree_data.csv") +val data = sc.textFile("data/mllib/sample_tree_data.csv") val parsedData = data.map { line => val parts = line.split(',').map(_.toDouble) LabeledPoint(parts(0), Vectors.dense(parts.tail)) @@ -161,7 +161,7 @@ import org.apache.spark.mllib.tree.configuration.Algo._ import org.apache.spark.mllib.tree.impurity.Variance // Load and parse the data file -val data = sc.textFile("mllib/data/sample_tree_data.csv") +val data = sc.textFile("data/mllib/sample_tree_data.csv") val parsedData = data.map { line => val parts = line.split(',').map(_.toDouble) LabeledPoint(parts(0), Vectors.dense(parts.tail)) |