aboutsummaryrefslogtreecommitdiff
path: root/yarn
diff options
context:
space:
mode:
authorXiangrui Meng <meng@databricks.com>2015-05-21 17:59:03 -0700
committerXiangrui Meng <meng@databricks.com>2015-05-21 17:59:03 -0700
commitf5db4b416c922db7a8f1b0c098b4f08647106231 (patch)
tree97ddc7467b77b169324bba8b0ec93c5d76005ca9 /yarn
parent17791a58159b3e4619d0367f54a4c5332342658b (diff)
downloadspark-f5db4b416c922db7a8f1b0c098b4f08647106231.tar.gz
spark-f5db4b416c922db7a8f1b0c098b4f08647106231.tar.bz2
spark-f5db4b416c922db7a8f1b0c098b4f08647106231.zip
[SPARK-7794] [MLLIB] update RegexTokenizer default settings
The previous default is `{gaps: false, pattern: "\\p{L}+|[^\\p{L}\\s]+"}`. The default pattern is hard to understand. This PR changes the default to `{gaps: true, pattern: "\\s+"}`. jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes #6330 from mengxr/SPARK-7794 and squashes the following commits: 5ee7cde [Xiangrui Meng] update RegexTokenizer default settings
Diffstat (limited to 'yarn')
0 files changed, 0 insertions, 0 deletions