diff options
author | Yuhao Yang <hhbyyh@gmail.com> | 2015-07-30 08:20:52 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-07-30 08:20:52 -0700 |
commit | 9c0501c5d04d83ca25ce433138bf64df6a14dc58 (patch) | |
tree | 6c69ce30a4619ce9d88e3b61c6336f16455feca0 /mllib/src | |
parent | d212a314227dec26c0dbec8ed3422d0ec8f818f9 (diff) | |
download | spark-9c0501c5d04d83ca25ce433138bf64df6a14dc58.tar.gz spark-9c0501c5d04d83ca25ce433138bf64df6a14dc58.tar.bz2 spark-9c0501c5d04d83ca25ce433138bf64df6a14dc58.zip |
[SPARK-] [MLLIB] minor fix on tokenizer doc
A trivial fix for the comments of RegexTokenizer.
Maybe this is too small, yet I just noticed it and think it can be quite misleading. I can create a jira if necessary.
Author: Yuhao Yang <hhbyyh@gmail.com>
Closes #7791 from hhbyyh/docFix and squashes the following commits:
cdf2542 [Yuhao Yang] minor fix on tokenizer doc
Diffstat (limited to 'mllib/src')
-rw-r--r-- | mllib/src/main/scala/org/apache/spark/ml/feature/Tokenizer.scala | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/mllib/src/main/scala/org/apache/spark/ml/feature/Tokenizer.scala b/mllib/src/main/scala/org/apache/spark/ml/feature/Tokenizer.scala index 0b3af4747e..248288ca73 100644 --- a/mllib/src/main/scala/org/apache/spark/ml/feature/Tokenizer.scala +++ b/mllib/src/main/scala/org/apache/spark/ml/feature/Tokenizer.scala @@ -50,7 +50,7 @@ class Tokenizer(override val uid: String) extends UnaryTransformer[String, Seq[S /** * :: Experimental :: * A regex based tokenizer that extracts tokens either by using the provided regex pattern to split - * the text (default) or repeatedly matching the regex (if `gaps` is true). + * the text (default) or repeatedly matching the regex (if `gaps` is false). * Optional parameters also allow filtering tokens using a minimal length. * It returns an array of strings that can be empty. */ |