aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
diff options
context:
space:
mode:
authorXiangrui Meng <meng@databricks.com>2015-08-31 15:49:25 -0700
committerXiangrui Meng <meng@databricks.com>2015-08-31 15:49:25 -0700
commit23e39cc7b1bb7f1087c4706234c9b5165a571357 (patch)
tree5780fc9bcf71cef79d053bfb34ec2f3e8f79b507 /python/pyspark
parenta2d5c72091b1c602694dbca823a7b26f86b02864 (diff)
downloadspark-23e39cc7b1bb7f1087c4706234c9b5165a571357.tar.gz
spark-23e39cc7b1bb7f1087c4706234c9b5165a571357.tar.bz2
spark-23e39cc7b1bb7f1087c4706234c9b5165a571357.zip
[SPARK-9954] [MLLIB] use first 128 nonzeros to compute Vector.hashCode
This could help reduce hash collisions, e.g., in `RDD[Vector].repartition`. jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes #8182 from mengxr/SPARK-9954.
Diffstat (limited to 'python/pyspark')
0 files changed, 0 insertions, 0 deletions