aboutsummaryrefslogtreecommitdiff
path: root/pyspark
Commit message (Expand)AuthorAgeFilesLines
* Fix minor bugs in Python API examples.Josh Rosen2012-08-272-5/+5
* Add pipe(), saveAsTextFile(), sc.union() to Python API.Josh Rosen2012-08-272-8/+31
* Simplify Python worker; pipeline the map step of partitionBy().Josh Rosen2012-08-274-100/+52
* Use local combiners in Python API combineByKey().Josh Rosen2012-08-272-25/+24
* Add countByKey(), reduceByKeyLocally() to Python APIJosh Rosen2012-08-271-13/+39
* Add mapPartitions(), glom(), countByValue() to Python API.Josh Rosen2012-08-271-4/+28
* Add broadcast variables to Python API.Josh Rosen2012-08-274-12/+84
* Implement fold() in Python API.Josh Rosen2012-08-271-1/+19
* Refactor Python MappedRDD to use iterator pipelines.Josh Rosen2012-08-242-97/+41
* Fix options parsing in Python pi example.Josh Rosen2012-08-241-1/+1
* Use numpy in Python k-means example.Josh Rosen2012-08-223-26/+14
* Use only cPickle for serialization in Python API.Josh Rosen2012-08-216-560/+233
* Bundle cloudpickle with pyspark.Josh Rosen2012-08-194-5/+976
* Add Python API.Josh Rosen2012-08-1812-0/+1170