aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/ml/param/_shared_params_code_gen.py
diff options
context:
space:
mode:
authorYanbo Liang <ybliang8@gmail.com>2015-09-10 20:43:38 -0700
committerXiangrui Meng <meng@databricks.com>2015-09-10 20:43:38 -0700
commita140dd77c62255d6f7f6817a2517d47feb8540d4 (patch)
tree2fb9533b8e36ab564baeb1ec0aa3e368906e9fa0 /python/pyspark/ml/param/_shared_params_code_gen.py
parent339a527141984bfb182862b0987d3c4690c9ede1 (diff)
downloadspark-a140dd77c62255d6f7f6817a2517d47feb8540d4.tar.gz
spark-a140dd77c62255d6f7f6817a2517d47feb8540d4.tar.bz2
spark-a140dd77c62255d6f7f6817a2517d47feb8540d4.zip
[SPARK-10027] [ML] [PySpark] Add Python API missing methods for ml.feature
Missing method of ml.feature are listed here: ```StringIndexer``` lacks of parameter ```handleInvalid```. ```StringIndexerModel``` lacks of method ```labels```. ```VectorIndexerModel``` lacks of methods ```numFeatures``` and ```categoryMaps```. Author: Yanbo Liang <ybliang8@gmail.com> Closes #8313 from yanboliang/spark-10027.
Diffstat (limited to 'python/pyspark/ml/param/_shared_params_code_gen.py')
-rw-r--r--python/pyspark/ml/param/_shared_params_code_gen.py5
1 files changed, 4 insertions, 1 deletions
diff --git a/python/pyspark/ml/param/_shared_params_code_gen.py b/python/pyspark/ml/param/_shared_params_code_gen.py
index 69efc424ec..926375e448 100644
--- a/python/pyspark/ml/param/_shared_params_code_gen.py
+++ b/python/pyspark/ml/param/_shared_params_code_gen.py
@@ -121,7 +121,10 @@ if __name__ == "__main__":
("checkpointInterval", "checkpoint interval (>= 1)", None),
("seed", "random seed", "hash(type(self).__name__)"),
("tol", "the convergence tolerance for iterative algorithms", None),
- ("stepSize", "Step size to be used for each iteration of optimization.", None)]
+ ("stepSize", "Step size to be used for each iteration of optimization.", None),
+ ("handleInvalid", "how to handle invalid entries. Options are skip (which will filter " +
+ "out rows with bad values), or error (which will throw an errror). More options may be " +
+ "added later.", None)]
code = []
for name, doc, defaultValueStr in shared:
param_code = _gen_param_header(name, doc, defaultValueStr)