[SPARK-12856] [SQL] speed up hashCode of unsafe array - spark

diff options

author	Wenchen Fan <wenchen@databricks.com>	2016-01-16 00:38:17 -0800
committer	Davies Liu <davies.liu@gmail.com>	2016-01-16 00:38:17 -0800
commit	2f7d0b68a29de9755fc9fafd9a52c048981ad880 (patch)
tree	7cda8a9bd00cf2a889d6ad5875b3f8d23e5b2275 /docs/ml-guide.md
parent	242efb7546084592a5e8122549a27117977303fb (diff)
download	spark-2f7d0b68a29de9755fc9fafd9a52c048981ad880.tar.gz spark-2f7d0b68a29de9755fc9fafd9a52c048981ad880.tar.bz2 spark-2f7d0b68a29de9755fc9fafd9a52c048981ad880.zip

[SPARK-12856] [SQL] speed up hashCode of unsafe array

We iterate the bytes to calculate hashCode before, but now we have `Murmur3_x86_32.hashUnsafeBytes` that don't require the bytes to be word algned, we should use that instead. A simple benchmark shows it's about 3 X faster, benchmark code: https://gist.github.com/cloud-fan/fa77713ccebf0823b2ab#file-arrayhashbenchmark-scala Author: Wenchen Fan <wenchen@databricks.com> Closes #10784 from cloud-fan/array-hashcode.

Diffstat (limited to 'docs/ml-guide.md')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: