vcluster
Script to manage a local, docker-based, Apache Spark cluster.
Requirements
- A copy of Spark's source code
- docker
Installation
Download the script and put it in your PATH.
curl https://raw.githubusercontent.com/jodersky/vcluster/master/vcluster -o ~/bin/vcluster && chmod 0755 ~/bin/vcluster
Usage
-
Create docker image from Spark's source
vcluster init <path to spark's source>
-
Spin up a "local cluster", consisting of several containers
vcluster start <number of workers, default 5>
-
Start spark-shell in a container
vcluster shell
-
Show the status of the cluster
vcluster list
-
Shut down the cluster
vcluster stop