[SPARK-7795] [CORE] Speed up task scheduling in standalone mode by reusing serializer - spark

diff options

author	Akshat Aranya <aaranya@quantcast.com>	2015-05-22 22:03:31 -0700
committer	Josh Rosen <joshrosen@databricks.com>	2015-05-22 22:03:31 -0700
commit	a16357413d2823bcc1d1bf55b4da191dc9b1b69a (patch)
tree	55fa8fa2bc4e33267a29d9962ec3af65d70567be /streaming
parent	63a5ce75eac48a297751ac505d70ce4d47daf903 (diff)
download	spark-a16357413d2823bcc1d1bf55b4da191dc9b1b69a.tar.gz spark-a16357413d2823bcc1d1bf55b4da191dc9b1b69a.tar.bz2 spark-a16357413d2823bcc1d1bf55b4da191dc9b1b69a.zip

[SPARK-7795] [CORE] Speed up task scheduling in standalone mode by reusing serializer

My experiments with scheduling very short tasks in standalone cluster mode indicated that a significant amount of time was being spent in scheduling the tasks (>500ms for 256 tasks). I found that most of the time was being spent in creating a new instance of serializer for each task. Changing this to just one serializer brought down the scheduling time to 8ms. Author: Akshat Aranya <aaranya@quantcast.com> Closes #6323 from coolfrood/master and squashes the following commits: 12d8c9e [Akshat Aranya] Reduce visibility of serializer bd4a5dd [Akshat Aranya] Style fix 0b8ca93 [Akshat Aranya] Incorporate review comments fe530cd [Akshat Aranya] Speed up task scheduling in standalone mode by reusing serializer instead of creating a new one for each task.

Diffstat (limited to 'streaming')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: