aboutsummaryrefslogtreecommitdiff
path: root/docs/python-programming-guide.md
diff options
context:
space:
mode:
authorSandy Ryza <sandy@cloudera.com>2014-04-29 23:24:34 -0700
committerPatrick Wendell <pwendell@gmail.com>2014-04-29 23:24:34 -0700
commitff5be9a41e52454e0f9cae83dd1fd50fbeaa684a (patch)
tree5bd17eaa50b3120317370821104c9c6d9e238b56 /docs/python-programming-guide.md
parent7025dda8fa84b57d6f12bc770df2fa10eef21d88 (diff)
downloadspark-ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a.tar.gz
spark-ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a.tar.bz2
spark-ff5be9a41e52454e0f9cae83dd1fd50fbeaa684a.zip
SPARK-1004. PySpark on YARN
This reopens https://github.com/apache/incubator-spark/pull/640 against the new repo Author: Sandy Ryza <sandy@cloudera.com> Closes #30 from sryza/sandy-spark-1004 and squashes the following commits: 89889d4 [Sandy Ryza] Move unzipping py4j to the generate-resources phase so that it gets included in the jar the first time 5165a02 [Sandy Ryza] Fix docs fd0df79 [Sandy Ryza] PySpark on YARN
Diffstat (limited to 'docs/python-programming-guide.md')
-rw-r--r--docs/python-programming-guide.md3
1 files changed, 3 insertions, 0 deletions
diff --git a/docs/python-programming-guide.md b/docs/python-programming-guide.md
index 98233bf556..98c456228a 100644
--- a/docs/python-programming-guide.md
+++ b/docs/python-programming-guide.md
@@ -63,6 +63,9 @@ All of PySpark's library dependencies, including [Py4J](http://py4j.sourceforge.
Standalone PySpark applications should be run using the `bin/pyspark` script, which automatically configures the Java and Python environment using the settings in `conf/spark-env.sh` or `.cmd`.
The script automatically adds the `bin/pyspark` package to the `PYTHONPATH`.
+# Running PySpark on YARN
+
+To run PySpark against a YARN cluster, simply set the MASTER environment variable to "yarn-client".
# Interactive Use