aboutsummaryrefslogtreecommitdiff
path: root/docs/python-programming-guide.md
diff options
context:
space:
mode:
authorroot <root@ip-10-165-19-218.ec2.internal>2013-07-01 06:25:17 +0000
committerroot <root@ip-10-165-19-218.ec2.internal>2013-07-01 06:26:43 +0000
commit7cd490ef5ba28df31f5e061eff83c855731dfca4 (patch)
treee434bfd3e9c22312db43e88fd392ff78ee86d768 /docs/python-programming-guide.md
parentec31e68d5df259e6df001529235d8c906ff02a6f (diff)
downloadspark-7cd490ef5ba28df31f5e061eff83c855731dfca4.tar.gz
spark-7cd490ef5ba28df31f5e061eff83c855731dfca4.tar.bz2
spark-7cd490ef5ba28df31f5e061eff83c855731dfca4.zip
Clarify that PySpark is not supported on Windows
Diffstat (limited to 'docs/python-programming-guide.md')
-rw-r--r--docs/python-programming-guide.md5
1 files changed, 2 insertions, 3 deletions
diff --git a/docs/python-programming-guide.md b/docs/python-programming-guide.md
index 7f1e7cf93d..e8aaac74d0 100644
--- a/docs/python-programming-guide.md
+++ b/docs/python-programming-guide.md
@@ -17,10 +17,9 @@ There are a few key differences between the Python and Scala APIs:
* Python is dynamically typed, so RDDs can hold objects of different types.
* PySpark does not currently support the following Spark features:
- Special functions on RDDs of doubles, such as `mean` and `stdev`
- - `lookup`
+ - `lookup`, `sample` and `sort`
- `persist` at storage levels other than `MEMORY_ONLY`
- - `sample`
- - `sort`
+ - Execution on Windows -- this is slated for a future release
In PySpark, RDDs support the same methods as their Scala counterparts but take Python functions and return Python collection types.
Short functions can be passed to RDD methods using Python's [`lambda`](http://www.diveintopython.net/power_of_introspection/lambda_functions.html) syntax: