[SPARK-3085] [SQL] Use compact data structures in SQL joins - spark

diff options

author	Matei Zaharia <matei@databricks.com>	2014-08-18 10:45:24 -0700
committer	Michael Armbrust <michael@databricks.com>	2014-08-18 10:45:24 -0700
commit	4bf3de71074053af94f077c99e9c65a1962739e1 (patch)
tree	a7bc1e6385a6b4ff28da44dfa8e414b537d5b626 /docs
parent	6a13dca12fac06f3af892ffcc8922cc84f91b786 (diff)
download	spark-4bf3de71074053af94f077c99e9c65a1962739e1.tar.gz spark-4bf3de71074053af94f077c99e9c65a1962739e1.tar.bz2 spark-4bf3de71074053af94f077c99e9c65a1962739e1.zip

[SPARK-3085] [SQL] Use compact data structures in SQL joins

This reuses the CompactBuffer from Spark Core to save memory and pointer dereferences. I also tried AppendOnlyMap instead of java.util.HashMap but unfortunately that slows things down because it seems to do more equals() calls and the equals on GenericRow, and especially JoinedRow, is pretty expensive. Author: Matei Zaharia <matei@databricks.com> Closes #1993 from mateiz/spark-3085 and squashes the following commits: 188221e [Matei Zaharia] Remove unneeded import 5f903ee [Matei Zaharia] [SPARK-3085] [SQL] Use compact data structures in SQL joins

Diffstat (limited to 'docs')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: