diff options
author | Sandy Ryza <sandy@cloudera.com> | 2014-04-29 23:24:34 -0700 |
---|---|---|
committer | Patrick Wendell <pwendell@gmail.com> | 2014-04-29 23:24:34 -0700 |
commit | ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a (patch) | |
tree | 5bd17eaa50b3120317370821104c9c6d9e238b56 /docs | |
parent | 7025dda8fa84b57d6f12bc770df2fa10eef21d88 (diff) | |
download | spark-ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a.tar.gz spark-ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a.tar.bz2 spark-ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a.zip |
SPARK-1004. PySpark on YARN
This reopens https://github.com/apache/incubator-spark/pull/640 against the new repo
Author: Sandy Ryza <sandy@cloudera.com>
Closes #30 from sryza/sandy-spark-1004 and squashes the following commits:
89889d4 [Sandy Ryza] Move unzipping py4j to the generate-resources phase so that it gets included in the jar the first time
5165a02 [Sandy Ryza] Fix docs
fd0df79 [Sandy Ryza] PySpark on YARN
Diffstat (limited to 'docs')
-rw-r--r-- | docs/python-programming-guide.md | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/docs/python-programming-guide.md b/docs/python-programming-guide.md index 98233bf556..98c456228a 100644 --- a/docs/python-programming-guide.md +++ b/docs/python-programming-guide.md @@ -63,6 +63,9 @@ All of PySpark's library dependencies, including [Py4J](http://py4j.sourceforge. Standalone PySpark applications should be run using the `bin/pyspark` script, which automatically configures the Java and Python environment using the settings in `conf/spark-env.sh` or `.cmd`. The script automatically adds the `bin/pyspark` package to the `PYTHONPATH`. +# Running PySpark on YARN + +To run PySpark against a YARN cluster, simply set the MASTER environment variable to "yarn-client". # Interactive Use |