diff options
author | Joseph K. Bradley <joseph@databricks.com> | 2016-04-15 11:50:21 -0700 |
---|---|---|
committer | Joseph K. Bradley <joseph@databricks.com> | 2016-04-15 11:50:21 -0700 |
commit | d6ae7d4637d23c57c4eeab79d1177216f380ec9c (patch) | |
tree | 0779a958d56eb4d7664c5cf3fe238aa42b94a2af /python/pyspark/ml/feature.py | |
parent | 83af297ac42546580983f91079f74e3a4cf25050 (diff) | |
download | spark-d6ae7d4637d23c57c4eeab79d1177216f380ec9c.tar.gz spark-d6ae7d4637d23c57c4eeab79d1177216f380ec9c.tar.bz2 spark-d6ae7d4637d23c57c4eeab79d1177216f380ec9c.zip |
[SPARK-14665][ML][PYTHON] Fixed bug with StopWordsRemover default stopwords
## What changes were proposed in this pull request?
The default stopwords were a Java object. They are no longer.
## How was this patch tested?
Unit test which failed before the fix
Author: Joseph K. Bradley <joseph@databricks.com>
Closes #12422 from jkbradley/pyspark-stopwords.
Diffstat (limited to 'python/pyspark/ml/feature.py')
-rw-r--r-- | python/pyspark/ml/feature.py | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/python/pyspark/ml/feature.py b/python/pyspark/ml/feature.py index 809a513316..0d8ef1297f 100644 --- a/python/pyspark/ml/feature.py +++ b/python/pyspark/ml/feature.py @@ -1765,7 +1765,7 @@ class StopWordsRemover(JavaTransformer, HasInputCol, HasOutputCol, JavaMLReadabl self._java_obj = self._new_java_obj("org.apache.spark.ml.feature.StopWordsRemover", self.uid) stopWordsObj = _jvm().org.apache.spark.ml.feature.StopWords - defaultStopWords = stopWordsObj.English() + defaultStopWords = list(stopWordsObj.English()) self._setDefault(stopWords=defaultStopWords, caseSensitive=False) kwargs = self.__init__._input_kwargs self.setParams(**kwargs) |