aboutsummaryrefslogtreecommitdiff
path: root/docs/spark-standalone.md
diff options
context:
space:
mode:
authorMatei Zaharia <matei@databricks.com>2014-08-04 23:41:03 -0700
committerMatei Zaharia <matei@databricks.com>2014-08-04 23:41:03 -0700
commit4fde28c2063f673ec7f51d514ba62a73321960a1 (patch)
tree1125522786941e5a71ea2526df435bc9f6706405 /docs/spark-standalone.md
parent066765d60d21b6b9943862b788e4a4bd07396e6c (diff)
downloadspark-4fde28c2063f673ec7f51d514ba62a73321960a1.tar.gz
spark-4fde28c2063f673ec7f51d514ba62a73321960a1.tar.bz2
spark-4fde28c2063f673ec7f51d514ba62a73321960a1.zip
SPARK-2711. Create a ShuffleMemoryManager to track memory for all spilling collections
This tracks memory properly if there are multiple spilling collections in the same task (which was a problem before), and also implements an algorithm that lets each thread grow up to 1 / 2N of the memory pool (where N is the number of threads) before spilling, which avoids an inefficiency with small spills we had before (some threads would spill many times at 0-1 MB because the pool was allocated elsewhere). Author: Matei Zaharia <matei@databricks.com> Closes #1707 from mateiz/spark-2711 and squashes the following commits: debf75b [Matei Zaharia] Review comments 24f28f3 [Matei Zaharia] Small rename c8f3a8b [Matei Zaharia] Update ShuffleMemoryManager to be able to partially grant requests 315e3a5 [Matei Zaharia] Some review comments b810120 [Matei Zaharia] Create central manager to track memory for all spilling collections
Diffstat (limited to 'docs/spark-standalone.md')
0 files changed, 0 insertions, 0 deletions