aboutsummaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authortgravescs <tgraves@thatenemy-lm.champ.corp.yahoo.com>2013-10-03 11:52:16 -0500
committertgravescs <tgraves@thatenemy-lm.champ.corp.yahoo.com>2013-10-03 11:52:16 -0500
commit0fff4ee8523ff4137eedfc314b51135427137c63 (patch)
tree9f96cb33c2d5c3dc54b34a4716052957a1841633 /docs
parent9d4246863a25f7c91f324e004fe000b9848f6057 (diff)
downloadspark-0fff4ee8523ff4137eedfc314b51135427137c63.tar.gz
spark-0fff4ee8523ff4137eedfc314b51135427137c63.tar.bz2
spark-0fff4ee8523ff4137eedfc314b51135427137c63.zip
Adding in the --addJars option to make SparkContext.addJar work on yarn and cleanup
the classpaths
Diffstat (limited to 'docs')
-rw-r--r--docs/running-on-yarn.md2
1 files changed, 2 insertions, 0 deletions
diff --git a/docs/running-on-yarn.md b/docs/running-on-yarn.md
index beaae69aa2..a807ec603d 100644
--- a/docs/running-on-yarn.md
+++ b/docs/running-on-yarn.md
@@ -53,6 +53,7 @@ The command to launch the YARN Client is as follows:
--worker-memory <MEMORY_PER_WORKER> \
--worker-cores <CORES_PER_WORKER> \
--queue <queue_name> \
+ --addJars <any_local_files_used_in_SparkContext.addJar> \
--files <files_for_distributed_cache> \
--archives <archives_for_distributed_cache>
@@ -88,3 +89,4 @@ The above starts a YARN Client programs which periodically polls the Application
- We do not requesting container resources based on the number of cores. Thus the numbers of cores given via command line arguments cannot be guaranteed.
- The local directories used for spark will be the local directories configured for YARN (Hadoop Yarn config yarn.nodemanager.local-dirs). If the user specifies spark.local.dir, it will be ignored.
- The --files and --archives options support specifying file names with the # similar to Hadoop. For example you can specify: --files localtest.txt#appSees.txt and this will upload the file you have locally named localtest.txt into HDFS but this will be linked to by the name appSees.txt and your application should use the name as appSees.txt to reference it when running on YARN.
+- The --addJars option allows the SparkContext.addJar function to work if you are using it with local files. It does not need to be used if you are using it with HDFS, HTTP, HTTPS, or FTP files.