diff options
author | Yanbo Liang <ybliang8@gmail.com> | 2016-04-25 12:08:43 -0700 |
---|---|---|
committer | Joseph K. Bradley <joseph@databricks.com> | 2016-04-25 12:08:43 -0700 |
commit | 425f6916462ca5d0595c61101d52686006ed6b8b (patch) | |
tree | 57c8247319bf4f01c62f90272d2260a18cb0905c /sbin/slaves.sh | |
parent | 88e54218d5f0a8696563813feb387c08ec6b13d5 (diff) | |
download | spark-425f6916462ca5d0595c61101d52686006ed6b8b.tar.gz spark-425f6916462ca5d0595c61101d52686006ed6b8b.tar.bz2 spark-425f6916462ca5d0595c61101d52686006ed6b8b.zip |
[SPARK-10574][ML][MLLIB] HashingTF supports MurmurHash3
## What changes were proposed in this pull request?
As the discussion at [SPARK-10574](https://issues.apache.org/jira/browse/SPARK-10574), ```HashingTF``` should support MurmurHash3 and make it as the default hash algorithm. We should also expose set/get API for ```hashAlgorithm```, then users can choose the hash method.
Note: The problem that ```mllib.feature.HashingTF``` behaves differently between Scala/Java and Python will be resolved in the followup work.
## How was this patch tested?
unit tests.
cc jkbradley MLnick
Author: Yanbo Liang <ybliang8@gmail.com>
Author: Joseph K. Bradley <joseph@databricks.com>
Closes #12498 from yanboliang/spark-10574.
Diffstat (limited to 'sbin/slaves.sh')
0 files changed, 0 insertions, 0 deletions