diff options
author | Xiangrui Meng <meng@databricks.com> | 2015-08-31 15:49:25 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-08-31 15:49:25 -0700 |
commit | 23e39cc7b1bb7f1087c4706234c9b5165a571357 (patch) | |
tree | 5780fc9bcf71cef79d053bfb34ec2f3e8f79b507 /python/pyspark | |
parent | a2d5c72091b1c602694dbca823a7b26f86b02864 (diff) | |
download | spark-23e39cc7b1bb7f1087c4706234c9b5165a571357.tar.gz spark-23e39cc7b1bb7f1087c4706234c9b5165a571357.tar.bz2 spark-23e39cc7b1bb7f1087c4706234c9b5165a571357.zip |
[SPARK-9954] [MLLIB] use first 128 nonzeros to compute Vector.hashCode
This could help reduce hash collisions, e.g., in `RDD[Vector].repartition`. jkbradley
Author: Xiangrui Meng <meng@databricks.com>
Closes #8182 from mengxr/SPARK-9954.
Diffstat (limited to 'python/pyspark')
0 files changed, 0 insertions, 0 deletions