diff options
author | Matei Zaharia <matei@databricks.com> | 2014-08-18 10:45:24 -0700 |
---|---|---|
committer | Michael Armbrust <michael@databricks.com> | 2014-08-18 10:45:24 -0700 |
commit | 4bf3de71074053af94f077c99e9c65a1962739e1 (patch) | |
tree | a7bc1e6385a6b4ff28da44dfa8e414b537d5b626 /docs | |
parent | 6a13dca12fac06f3af892ffcc8922cc84f91b786 (diff) | |
download | spark-4bf3de71074053af94f077c99e9c65a1962739e1.tar.gz spark-4bf3de71074053af94f077c99e9c65a1962739e1.tar.bz2 spark-4bf3de71074053af94f077c99e9c65a1962739e1.zip |
[SPARK-3085] [SQL] Use compact data structures in SQL joins
This reuses the CompactBuffer from Spark Core to save memory and pointer
dereferences. I also tried AppendOnlyMap instead of java.util.HashMap
but unfortunately that slows things down because it seems to do more
equals() calls and the equals on GenericRow, and especially JoinedRow,
is pretty expensive.
Author: Matei Zaharia <matei@databricks.com>
Closes #1993 from mateiz/spark-3085 and squashes the following commits:
188221e [Matei Zaharia] Remove unneeded import
5f903ee [Matei Zaharia] [SPARK-3085] [SQL] Use compact data structures in SQL joins
Diffstat (limited to 'docs')
0 files changed, 0 insertions, 0 deletions