Standalone Scheduler fault tolerance using ZooKeeper

This patch implements full distributed fault tolerance for standalone scheduler Masters. There is only one master Leader at a time, which is actively serving scheduling requests. If this Leader crashes, another master will eventually be elected, reconstruct the state from the first Master, and continue serving scheduling requests. Leader election is performed using the ZooKeeper leader election pattern. We try to minimize the use of ZooKeeper and the assumptions about ZooKeeper's behavior, so there is a layer of retries and session monitoring on top of the ZooKeeper client. Master failover follows directly from the single-node Master recovery via the file system (patch 194ba4b8), save that the Master state is stored in ZooKeeper instead. Configuration: By default, no recovery mechanism is enabled (spark.deploy.recoveryMode = NONE). By setting spark.deploy.recoveryMode to ZOOKEEPER and setting spark.deploy.zookeeper.url to an appropriate ZooKeeper URL, ZooKeeper recovery mode is enabled. By setting spark.deploy.recoveryMode to FILESYSTEM and setting spark.deploy.recoveryDirectory to an appropriate directory accessible by the Master, we will keep the behavior of from 194ba4b8. Additionally, places where a Master could be specificied by a spark:// url can now take comma-delimited lists to specify backup masters. Note that this is only used for registration of NEW Workers and application Clients. Once a Worker or Client has registered with the Master Leader, it is "in the system" and will never need to register again. Forthcoming: Documentation, tests (! - only ad hoc testing has been performed so far) I do not intend for this commit to be merged until tests are added, but this patch should still be mostly reviewable until then.
author: Aaron Davidson <aaron@databricks.com> 2013-09-19 14:40:14 -0700
committer: Aaron Davidson <aaron@databricks.com> 2013-09-26 15:04:23 -0700
commit: f549ea33d3d5a584f5d9965bb8e56462a1d6528e (patch)
tree: 22d9e70e68aef097a48e6a5efc5958a3acb20b1b /project/SparkBuild.scala
parent: d5a96feccb15dd290b282af9e2f94479c8e4554e (diff)
download: spark-f549ea33d3d5a584f5d9965bb8e56462a1d6528e.tar.gz
spark-f549ea33d3d5a584f5d9965bb8e56462a1d6528e.tar.bz2
spark-f549ea33d3d5a584f5d9965bb8e56462a1d6528e.zip
1 files changed, 1 insertions, 0 deletions
diff --git a/project/SparkBuild.scala b/project/SparkBuild.scala
index 99cdadb9e7..156f501a04 100644
--- a/project/SparkBuild.scala
+++ b/project/SparkBuild.scala
@@ -211,6 +211,7 @@ object SparkBuild extends Build {
       "net.java.dev.jets3t" % "jets3t" % "0.7.1",
       "org.apache.avro" % "avro" % "1.7.4",
       "org.apache.avro" % "avro-ipc" % "1.7.4" excludeAll(excludeNetty),
+      "org.apache.zookeeper" % "zookeeper" % "3.4.5" excludeAll(excludeNetty),
       "com.codahale.metrics" % "metrics-core" % "3.0.0",
       "com.codahale.metrics" % "metrics-jvm" % "3.0.0",
       "com.codahale.metrics" % "metrics-json" % "3.0.0",
author	Aaron Davidson <aaron@databricks.com>	2013-09-19 14:40:14 -0700
committer	Aaron Davidson <aaron@databricks.com>	2013-09-26 15:04:23 -0700
commit	f549ea33d3d5a584f5d9965bb8e56462a1d6528e (patch)
tree	22d9e70e68aef097a48e6a5efc5958a3acb20b1b /project/SparkBuild.scala
parent	d5a96feccb15dd290b282af9e2f94479c8e4554e (diff)
download	spark-f549ea33d3d5a584f5d9965bb8e56462a1d6528e.tar.gz spark-f549ea33d3d5a584f5d9965bb8e56462a1d6528e.tar.bz2 spark-f549ea33d3d5a584f5d9965bb8e56462a1d6528e.zip