aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/mllib/feature.py
diff options
context:
space:
mode:
authorJoseph K. Bradley <joseph@databricks.com>2016-07-13 12:33:39 -0700
committerJoseph K. Bradley <joseph@databricks.com>2016-07-13 12:33:39 -0700
commit01f09b161217193b797c8c85969d17054c958615 (patch)
tree40d7d4f5932157f8e0f0c13228dd18063728d4d3 /python/pyspark/mllib/feature.py
parentd8220c1e5e94abbdb9643672b918f0d748206db9 (diff)
downloadspark-01f09b161217193b797c8c85969d17054c958615.tar.gz
spark-01f09b161217193b797c8c85969d17054c958615.tar.bz2
spark-01f09b161217193b797c8c85969d17054c958615.zip
[SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit for ML
## What changes were proposed in this pull request? General decisions to follow, except where noted: * spark.mllib, pyspark.mllib: Remove all Experimental annotations. Leave DeveloperApi annotations alone. * spark.ml, pyspark.ml ** Annotate Estimator-Model pairs of classes and companion objects the same way. ** For all algorithms marked Experimental with Since tag <= 1.6, remove Experimental annotation. ** For all algorithms marked Experimental with Since tag = 2.0, leave Experimental annotation. * DeveloperApi annotations are left alone, except where noted. * No changes to which types are sealed. Exceptions where I am leaving items Experimental in spark.ml, pyspark.ml, mainly because the items are new: * Model Summary classes * MLWriter, MLReader, MLWritable, MLReadable * Evaluator and subclasses: There is discussion of changes around evaluating multiple metrics at once for efficiency. * RFormula: Its behavior may need to change slightly to match R in edge cases. * AFTSurvivalRegression * MultilayerPerceptronClassifier DeveloperApi changes: * ml.tree.Node, ml.tree.Split, and subclasses should no longer be DeveloperApi ## How was this patch tested? N/A Note to reviewers: * spark.ml.clustering.LDA underwent significant changes (additional methods), so let me know if you want me to leave it Experimental. * Be careful to check for cases where a class should no longer be Experimental but has an Experimental method, val, or other feature. I did not find such cases, but please verify. Author: Joseph K. Bradley <joseph@databricks.com> Closes #14147 from jkbradley/experimental-audit.
Diffstat (limited to 'python/pyspark/mllib/feature.py')
-rw-r--r--python/pyspark/mllib/feature.py16
1 files changed, 0 insertions, 16 deletions
diff --git a/python/pyspark/mllib/feature.py b/python/pyspark/mllib/feature.py
index aef91a8ddc..c8a6e33f4d 100644
--- a/python/pyspark/mllib/feature.py
+++ b/python/pyspark/mllib/feature.py
@@ -60,8 +60,6 @@ class VectorTransformer(object):
class Normalizer(VectorTransformer):
"""
- .. note:: Experimental
-
Normalizes samples individually to unit L\ :sup:`p`\ norm
For any 1 <= `p` < float('inf'), normalizes samples using
@@ -131,8 +129,6 @@ class JavaVectorTransformer(JavaModelWrapper, VectorTransformer):
class StandardScalerModel(JavaVectorTransformer):
"""
- .. note:: Experimental
-
Represents a StandardScaler model that can transform vectors.
.. versionadded:: 1.2.0
@@ -207,8 +203,6 @@ class StandardScalerModel(JavaVectorTransformer):
class StandardScaler(object):
"""
- .. note:: Experimental
-
Standardizes features by removing the mean and scaling to unit
variance using column summary statistics on the samples in the
training set.
@@ -262,8 +256,6 @@ class StandardScaler(object):
class ChiSqSelectorModel(JavaVectorTransformer):
"""
- .. note:: Experimental
-
Represents a Chi Squared selector model.
.. versionadded:: 1.4.0
@@ -282,8 +274,6 @@ class ChiSqSelectorModel(JavaVectorTransformer):
class ChiSqSelector(object):
"""
- .. note:: Experimental
-
Creates a ChiSquared feature selector.
:param numTopFeatures: number of features that selector will select.
@@ -361,8 +351,6 @@ class PCA(object):
class HashingTF(object):
"""
- .. note:: Experimental
-
Maps a sequence of terms to their term frequencies using the hashing
trick.
@@ -448,8 +436,6 @@ class IDFModel(JavaVectorTransformer):
class IDF(object):
"""
- .. note:: Experimental
-
Inverse document frequency (IDF).
The standard formulation is used: `idf = log((m + 1) / (d(t) + 1))`,
@@ -697,8 +683,6 @@ class Word2Vec(object):
class ElementwiseProduct(VectorTransformer):
"""
- .. note:: Experimental
-
Scales each column of the vector, with the supplied weight vector.
i.e the elementwise product.