aboutsummaryrefslogtreecommitdiff
path: root/docs/monitoring.md
diff options
context:
space:
mode:
authorMarcelo Vanzin <vanzin@cloudera.com>2014-06-23 13:53:44 -0700
committerPatrick Wendell <pwendell@gmail.com>2014-06-23 13:53:44 -0700
commit21ddd7d1e9f8e2a726427f32422c31706a20ba3f (patch)
tree0e789473fdf117d4768fca9f20b7169b3ebbc4ff /docs/monitoring.md
parent6dc6722a666ec1afc6fb13e7110907d8f5f9cd9d (diff)
downloadspark-21ddd7d1e9f8e2a726427f32422c31706a20ba3f.tar.gz
spark-21ddd7d1e9f8e2a726427f32422c31706a20ba3f.tar.bz2
spark-21ddd7d1e9f8e2a726427f32422c31706a20ba3f.zip
[SPARK-1768] History server enhancements.
Two improvements to the history server: - Separate the HTTP handling from history fetching, so that it's easy to add new backends later (thinking about SPARK-1537 in the long run) - Avoid loading all UIs in memory. Do lazy loading instead, keeping a few in memory for faster access. This allows the app limit to go away, since holding just the listing in memory shouldn't be too expensive unless the user has millions of completed apps in the history (at which point I'd expect other issues to arise aside from history server memory usage, such as FileSystem.listStatus() starting to become ridiculously expensive). I also fixed a few minor things along the way which aren't really worth mentioning. I also removed the app's log path from the UI since that information may not even exist depending on which backend is used (even though there is only one now). Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #718 from vanzin/hist-server and squashes the following commits: 53620c9 [Marcelo Vanzin] Add mima exclude, fix scaladoc wording. c21f8d8 [Marcelo Vanzin] Feedback: formatting, docs. dd8cc4b [Marcelo Vanzin] Standardize on using spark.history.* configuration. 4da3a52 [Marcelo Vanzin] Remove UI from ApplicationHistoryInfo. 2a7f68d [Marcelo Vanzin] Address review feedback. 4e72c77 [Marcelo Vanzin] Remove comment about ordering. 249bcea [Marcelo Vanzin] Remove offset / count from provider interface. ca5d320 [Marcelo Vanzin] Remove code that deals with unfinished apps. 6e2432f [Marcelo Vanzin] Second round of feedback. b2c570a [Marcelo Vanzin] Make class package-private. 4406f61 [Marcelo Vanzin] Cosmetic change to listing header. e852149 [Marcelo Vanzin] Initialize new app array to expected size. e8026f4 [Marcelo Vanzin] Review feedback. 49d2fd3 [Marcelo Vanzin] Fix a comment. 91e96ca [Marcelo Vanzin] Fix scalastyle issues. 6fbe0d8 [Marcelo Vanzin] Better handle failures when loading app info. eee2f5a [Marcelo Vanzin] Ensure server.stop() is called when shutting down. bda2fa1 [Marcelo Vanzin] Rudimentary paging support for the history UI. b284478 [Marcelo Vanzin] Separate history server from history backend.
Diffstat (limited to 'docs/monitoring.md')
-rw-r--r--docs/monitoring.md21
1 files changed, 15 insertions, 6 deletions
diff --git a/docs/monitoring.md b/docs/monitoring.md
index 2b9e9e5bd7..84073fe4d9 100644
--- a/docs/monitoring.md
+++ b/docs/monitoring.md
@@ -35,11 +35,13 @@ If Spark is run on Mesos or YARN, it is still possible to reconstruct the UI of
application through Spark's history server, provided that the application's event logs exist.
You can start a the history server by executing:
- ./sbin/start-history-server.sh <base-logging-directory>
+ ./sbin/start-history-server.sh
-The base logging directory must be supplied, and should contain sub-directories that each
-represents an application's event logs. This creates a web interface at
-`http://<server-url>:18080` by default. The history server can be configured as follows:
+When using the file-system provider class (see spark.history.provider below), the base logging
+directory must be supplied in the <code>spark.history.fs.logDirectory</code> configuration option,
+and should contain sub-directories that each represents an application's event logs. This creates a
+web interface at `http://<server-url>:18080` by default. The history server can be configured as
+follows:
<table class="table">
<tr><th style="width:21%">Environment Variable</th><th>Meaning</th></tr>
@@ -69,7 +71,14 @@ represents an application's event logs. This creates a web interface at
<table class="table">
<tr><th>Property Name</th><th>Default</th><th>Meaning</th></tr>
<tr>
- <td>spark.history.updateInterval</td>
+ <td>spark.history.provider</td>
+ <td>org.apache.spark.deploy.history.FsHistoryProvider</td>
+ <td>Name of the class implementing the application history backend. Currently there is only
+ one implementation, provided by Spark, which looks for application logs stored in the
+ file system.</td>
+ </tr>
+ <tr>
+ <td>spark.history.fs.updateInterval</td>
<td>10</td>
<td>
The period, in seconds, at which information displayed by this history server is updated.
@@ -78,7 +87,7 @@ represents an application's event logs. This creates a web interface at
</tr>
<tr>
<td>spark.history.retainedApplications</td>
- <td>250</td>
+ <td>50</td>
<td>
The number of application UIs to retain. If this cap is exceeded, then the oldest
applications will be removed.