diff options
author | Xiangrui Meng <meng@databricks.com> | 2015-05-21 17:59:03 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-05-21 17:59:03 -0700 |
commit | f5db4b416c922db7a8f1b0c098b4f08647106231 (patch) | |
tree | 97ddc7467b77b169324bba8b0ec93c5d76005ca9 /.rat-excludes | |
parent | 17791a58159b3e4619d0367f54a4c5332342658b (diff) | |
download | spark-f5db4b416c922db7a8f1b0c098b4f08647106231.tar.gz spark-f5db4b416c922db7a8f1b0c098b4f08647106231.tar.bz2 spark-f5db4b416c922db7a8f1b0c098b4f08647106231.zip |
[SPARK-7794] [MLLIB] update RegexTokenizer default settings
The previous default is `{gaps: false, pattern: "\\p{L}+|[^\\p{L}\\s]+"}`. The default pattern is hard to understand. This PR changes the default to `{gaps: true, pattern: "\\s+"}`. jkbradley
Author: Xiangrui Meng <meng@databricks.com>
Closes #6330 from mengxr/SPARK-7794 and squashes the following commits:
5ee7cde [Xiangrui Meng] update RegexTokenizer default settings
Diffstat (limited to '.rat-excludes')
0 files changed, 0 insertions, 0 deletions