aboutsummaryrefslogtreecommitdiff
path: root/python
diff options
context:
space:
mode:
authorXiangrui Meng <meng@databricks.com>2015-10-20 18:38:06 -0700
committerXiangrui Meng <meng@databricks.com>2015-10-20 18:38:06 -0700
commit135ade905089fe90d3cc9e84f8fedc7637e901d8 (patch)
treed07e65eac8197689b67c6f6e72f3f1f4472b6d91 /python
parent45861693bef2619196f0fbdf5c166ad3f9b1e8d1 (diff)
downloadspark-135ade905089fe90d3cc9e84f8fedc7637e901d8.tar.gz
spark-135ade905089fe90d3cc9e84f8fedc7637e901d8.tar.bz2
spark-135ade905089fe90d3cc9e84f8fedc7637e901d8.zip
[MINOR][ML] fix doc warnings
Without an empty line, sphinx will treat doctest as docstring. holdenk ~~~ /Users/meng/src/spark/python/pyspark/ml/feature.py:docstring of pyspark.ml.feature.CountVectorizer:3: ERROR: Undefined substitution referenced: "label|raw |vectors | +-----+---------------+-------------------------+ |0 |[a, b, c] |(3,[0,1,2],[1.0,1.0,1.0])". /Users/meng/src/spark/python/pyspark/ml/feature.py:docstring of pyspark.ml.feature.CountVectorizer:3: ERROR: Undefined substitution referenced: "1 |[a, b, b, c, a]|(3,[0,1,2],[2.0,2.0,1.0])". ~~~ Author: Xiangrui Meng <meng@databricks.com> Closes #9188 from mengxr/py-count-vec-doc-fix.
Diffstat (limited to 'python')
-rw-r--r--python/pyspark/ml/feature.py1
1 files changed, 1 insertions, 0 deletions
diff --git a/python/pyspark/ml/feature.py b/python/pyspark/ml/feature.py
index a4e60f916b..55bde6d0ea 100644
--- a/python/pyspark/ml/feature.py
+++ b/python/pyspark/ml/feature.py
@@ -178,6 +178,7 @@ class CountVectorizer(JavaEstimator, HasInputCol, HasOutputCol):
.. note:: Experimental
Extracts a vocabulary from document collections and generates a :py:attr:`CountVectorizerModel`.
+
>>> df = sqlContext.createDataFrame(
... [(0, ["a", "b", "c"]), (1, ["a", "b", "b", "c", "a"])],
... ["label", "raw"])