From 170b0d4200067d9015a72f5e116b9fe6f8fb4cd1 Mon Sep 17 00:00:00 2001 From: Xiangrui Meng Date: Thu, 15 Jan 2015 02:38:52 +0000 Subject: update user guide to position spark.ml better (SPARK-5254) --- site/docs/1.2.0/ml-guide.html | 19 ++++++++++--------- site/docs/1.2.0/mllib-guide.html | 18 +++++++++++------- 2 files changed, 21 insertions(+), 16 deletions(-) (limited to 'site/docs/1.2.0') diff --git a/site/docs/1.2.0/ml-guide.html b/site/docs/1.2.0/ml-guide.html index d865c263c..654809e93 100644 --- a/site/docs/1.2.0/ml-guide.html +++ b/site/docs/1.2.0/ml-guide.html @@ -123,15 +123,16 @@

Spark ML Programming Guide

-

Spark ML is Spark’s new machine learning package. It is currently an alpha component but is potentially a successor to MLlib. The spark.ml package aims to replace the old APIs with a cleaner, more uniform set of APIs which will help users create full machine learning pipelines.

- -

MLlib vs. Spark ML:

- - +

spark.ml is a new package introduced in Spark 1.2, which aims to provide a uniform set of +high-level APIs that help users create and tune practical machine learning pipelines. +It is currently an alpha component, and we would like to hear back from the community about +how it fits real-world use cases and how it could be improved.

+ +

Note that we will keep supporting and adding features to spark.mllib along with the +development of spark.ml. +Users should be comfortable using spark.mllib features and expect more features coming. +Developers should contribute new algorithms to spark.mllib and can optionally contribute +to spark.ml.

Table of Contents

diff --git a/site/docs/1.2.0/mllib-guide.html b/site/docs/1.2.0/mllib-guide.html index 573d22ed2..fdee3fd75 100644 --- a/site/docs/1.2.0/mllib-guide.html +++ b/site/docs/1.2.0/mllib-guide.html @@ -175,16 +175,20 @@ filtering, dimensionality reduction, as well as underlying optimization primitiv The APIs marked Experimental/DeveloperApi may change in future releases, and the migration guide below will explain all changes between releases.

-

spark.ml: The New ML Package

+

spark.ml: high-level APIs for ML pipelines

-

Spark 1.2 includes a new machine learning package called spark.ml, currently an alpha component but potentially a successor to spark.mllib. The spark.ml package aims to replace the old APIs with a cleaner, more uniform set of APIs which will help users create full machine learning pipelines.

+

Spark 1.2 includes a new package called spark.ml, which aims to provide a uniform set of +high-level APIs that help users create and tune practical machine learning pipelines. +It is currently an alpha component, and we would like to hear back from the community about +how it fits real-world use cases and how it could be improved.

-

See the spark.ml programming guide for more information on this package.

- -

Users can use algorithms from either of the two packages, but APIs may differ. Currently, spark.ml offers a subset of the algorithms from spark.mllib.

+

Note that we will keep supporting and adding features to spark.mllib along with the +development of spark.ml. +Users should be comfortable using spark.mllib features and expect more features coming. +Developers should contribute new algorithms to spark.mllib and can optionally contribute +to spark.ml.

-

Developers should contribute new algorithms to spark.mllib and can optionally contribute to spark.ml. -See the spark.ml programming guide linked above for more details.

+

See the spark.ml programming guide for more information on this package.

Dependencies

-- cgit v1.2.3