aboutsummaryrefslogtreecommitdiff
path: root/common/network-shuffle
diff options
context:
space:
mode:
authorYanbo Liang <ybliang8@gmail.com>2016-04-25 12:08:43 -0700
committerJoseph K. Bradley <joseph@databricks.com>2016-04-25 12:08:43 -0700
commit425f6916462ca5d0595c61101d52686006ed6b8b (patch)
tree57c8247319bf4f01c62f90272d2260a18cb0905c /common/network-shuffle
parent88e54218d5f0a8696563813feb387c08ec6b13d5 (diff)
downloadspark-425f6916462ca5d0595c61101d52686006ed6b8b.tar.gz
spark-425f6916462ca5d0595c61101d52686006ed6b8b.tar.bz2
spark-425f6916462ca5d0595c61101d52686006ed6b8b.zip
[SPARK-10574][ML][MLLIB] HashingTF supports MurmurHash3
## What changes were proposed in this pull request? As the discussion at [SPARK-10574](https://issues.apache.org/jira/browse/SPARK-10574), ```HashingTF``` should support MurmurHash3 and make it as the default hash algorithm. We should also expose set/get API for ```hashAlgorithm```, then users can choose the hash method. Note: The problem that ```mllib.feature.HashingTF``` behaves differently between Scala/Java and Python will be resolved in the followup work. ## How was this patch tested? unit tests. cc jkbradley MLnick Author: Yanbo Liang <ybliang8@gmail.com> Author: Joseph K. Bradley <joseph@databricks.com> Closes #12498 from yanboliang/spark-10574.
Diffstat (limited to 'common/network-shuffle')
0 files changed, 0 insertions, 0 deletions