aboutsummaryrefslogtreecommitdiff
path: root/common/tags
diff options
context:
space:
mode:
authorjerryshao <sshao@hortonworks.com>2016-04-28 16:39:49 -0700
committerMarcelo Vanzin <vanzin@cloudera.com>2016-04-28 16:39:49 -0700
commit2398e3d69c9a675d651c192107953de8e6c2aecd (patch)
treefe0d38664199446c8c8e76c5a4f38196cdbeb329 /common/tags
parent4f4721a21cc9acc2b6f685bbfc8757d29563a775 (diff)
downloadspark-2398e3d69c9a675d651c192107953de8e6c2aecd.tar.gz
spark-2398e3d69c9a675d651c192107953de8e6c2aecd.tar.bz2
spark-2398e3d69c9a675d651c192107953de8e6c2aecd.zip
[SPARK-14836][YARN] Zip all the jars before uploading to distributed cache
## What changes were proposed in this pull request? <copy form JIRA> Currently if neither `spark.yarn.jars` nor `spark.yarn.archive` is set (by default), Spark on yarn code will upload all the jars in the folder separately into distributed cache, this is quite time consuming, and very verbose, instead of upload jars separately into distributed cache, here changes to zip all the jars first, and then put into distributed cache. This will significantly improve the speed of starting time. ## How was this patch tested? Unit test and local integrated test is done. Verified with SparkPi both in spark cluster and client mode. Author: jerryshao <sshao@hortonworks.com> Closes #12597 from jerryshao/SPARK-14836.
Diffstat (limited to 'common/tags')
0 files changed, 0 insertions, 0 deletions