aboutsummaryrefslogtreecommitdiff
path: root/docs/configuration.md
diff options
context:
space:
mode:
authorAndrew Or <andrew@databricks.com>2014-12-19 19:36:20 -0800
committerAndrew Or <andrew@databricks.com>2014-12-19 19:36:20 -0800
commit15c03e1e0efac29855f32984da7c6b0321f0e37a (patch)
treea3c1961f9140d33e5c26820304a952a81f154808 /docs/configuration.md
parent7cb3f54793124c527d62906c565aba2c3544e422 (diff)
downloadspark-15c03e1e0efac29855f32984da7c6b0321f0e37a.tar.gz
spark-15c03e1e0efac29855f32984da7c6b0321f0e37a.tar.bz2
spark-15c03e1e0efac29855f32984da7c6b0321f0e37a.zip
[SPARK-4140] Document dynamic allocation
Once the external shuffle service is also documented, the dynamic allocation section will link to it. Let me know if the whole dynamic allocation should be moved to its separate page; I personally think the organization might be cleaner that way. This patch builds on top of oza's work in #3689. aarondav pwendell Author: Andrew Or <andrew@databricks.com> Author: Tsuyoshi Ozawa <ozawa.tsuyoshi@gmail.com> Closes #3731 from andrewor14/document-dynamic-allocation and squashes the following commits: 1281447 [Andrew Or] Address a few comments b9843f2 [Andrew Or] Document the configs as well 246fb44 [Andrew Or] Merge branch 'SPARK-4839' of github.com:oza/spark into document-dynamic-allocation 8c64004 [Andrew Or] Add documentation for dynamic allocation (without configs) 6827b56 [Tsuyoshi Ozawa] Fixing a documentation of spark.dynamicAllocation.enabled. 53cff58 [Tsuyoshi Ozawa] Adding a documentation about dynamic resource allocation.
Diffstat (limited to 'docs/configuration.md')
-rw-r--r--docs/configuration.md61
1 files changed, 61 insertions, 0 deletions
diff --git a/docs/configuration.md b/docs/configuration.md
index 64aa94f622..2c8dea869b 100644
--- a/docs/configuration.md
+++ b/docs/configuration.md
@@ -1008,6 +1008,67 @@ Apart from these, the following properties are also available, and may be useful
</tr>
</table>
+#### Dynamic allocation
+<table class="table">
+<tr><th>Property Name</th><th>Default</th><th>Meaning</th></tr>
+<tr>
+ <td><code>spark.dynamicAllocation.enabled</code></td>
+ <td>false</td>
+ <td>
+ Whether to use dynamic resource allocation, which scales the number of executors registered
+ with this application up and down based on the workload. Note that this is currently only
+ available on YARN mode. For more detail, see the description
+ <a href="job-scheduling.html#dynamic-resource-allocation">here</a>.
+ <br><br>
+ This requires the following configurations to be set:
+ <code>spark.dynamicAllocation.minExecutors</code>,
+ <code>spark.dynamicAllocation.maxExecutors</code>, and
+ <code>spark.shuffle.service.enabled</code>
+ </td>
+</tr>
+<tr>
+ <td><code>spark.dynamicAllocation.minExecutors</code></td>
+ <td>(none)</td>
+ <td>
+ Lower bound for the number of executors if dynamic allocation is enabled (required).
+ </td>
+</tr>
+<tr>
+ <td><code>spark.dynamicAllocation.maxExecutors</code></td>
+ <td>(none)</td>
+ <td>
+ Upper bound for the number of executors if dynamic allocation is enabled (required).
+ </td>
+</tr>
+<tr>
+ <td><code>spark.dynamicAllocation.schedulerBacklogTimeout</code></td>
+ <td>60</td>
+ <td>
+ If dynamic allocation is enabled and there have been pending tasks backlogged for more than
+ this duration (in seconds), new executors will be requested. For more detail, see this
+ <a href="job-scheduling.html#resource-allocation-policy">description</a>.
+ </td>
+</tr>
+<tr>
+ <td><code>spark.dynamicAllocation.sustainedSchedulerBacklogTimeout</code></td>
+ <td><code>schedulerBacklogTimeout</code></td>
+ <td>
+ Same as <code>spark.dynamicAllocation.schedulerBacklogTimeout</code>, but used only for
+ subsequent executor requests. For more detail, see this
+ <a href="job-scheduling.html#resource-allocation-policy">description</a>.
+ </td>
+</tr>
+<tr>
+ <td><code>spark.dynamicAllocation.executorIdleTimeout</code></td>
+ <td>600</td>
+ <td>
+ If dynamic allocation is enabled and an executor has been idle for more than this duration
+ (in seconds), the executor will be removed. For more detail, see this
+ <a href="job-scheduling.html#resource-allocation-policy">description</a>.
+ </td>
+</tr>
+</table>
+
#### Security
<table class="table">
<tr><th>Property Name</th><th>Default</th><th>Meaning</th></tr>