diff options
author | Erik Selin <erik.selin@jadedpixel.com> | 2014-06-03 13:31:16 -0700 |
---|---|---|
committer | Matei Zaharia <matei@databricks.com> | 2014-06-03 13:31:16 -0700 |
commit | 8edc9d0330c94b50e01956ae88693cff4e0977b2 (patch) | |
tree | 6adf5d8d5e416f5ce80647006f434bbd2802fa18 /examples | |
parent | b1f285359a12345d94ae59f979d9a75482e732ce (diff) | |
download | spark-8edc9d0330c94b50e01956ae88693cff4e0977b2.tar.gz spark-8edc9d0330c94b50e01956ae88693cff4e0977b2.tar.bz2 spark-8edc9d0330c94b50e01956ae88693cff4e0977b2.zip |
[SPARK-1468] Modify the partition function used by partitionBy.
Make partitionBy use a tweaked version of hash as its default partition function
since the python hash function does not consistently assign the same value
to None across python processes.
Associated JIRA at https://issues.apache.org/jira/browse/SPARK-1468
Author: Erik Selin <erik.selin@jadedpixel.com>
Closes #371 from tyro89/consistent_hashing and squashes the following commits:
201c301 [Erik Selin] Make partitionBy use a tweaked version of hash as its default partition function since the python hash function does not consistently assign the same value to None across python processes.
Diffstat (limited to 'examples')
0 files changed, 0 insertions, 0 deletions