aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/tests.py
Commit message (Collapse)AuthorAgeFilesLines
* Fixed Python API for sc.setCheckpointDir. Also other fixes based on ↵Tathagata Das2013-12-241-2/+2
| | | | Reynold's comments on PR 289.
* Fix UnicodeEncodeError in PySpark saveAsTextFile().Josh Rosen2013-11-281-0/+15
| | | Fixes SPARK-970.
* Add custom serializer support to PySpark.Josh Rosen2013-11-101-1/+2
| | | | | | | | | For now, this only adds MarshalSerializer, but it lays the groundwork for other supporting custom serializers. Many of these mechanisms can also be used to support deserialization of different data formats sent by Java, such as data encoded by MsgPack. This also fixes a bug in SparkContext.union().
* Implementing SPARK-878 for PySpark: adding zip and egg files to context and ↵Andre Schumacher2013-08-161-0/+11
| | | | passing it down to workers which add these to their sys.path
* Fix PySpark unit tests on Python 2.6.Josh Rosen2013-08-141-5/+8
|
* Add Apache license headers and LICENSE and NOTICE filesMatei Zaharia2013-07-161-0/+17
|
* Add tests and fixes for Python daemon shutdownJey Kottalam2013-06-211-0/+43
|
* Do not launch JavaGateways on workers (SPARK-674).Josh Rosen2013-02-011-1/+1
| | | | | | | | | | | The problem was that the gateway was being initialized whenever the pyspark.context module was loaded. The fix uses lazy initialization that occurs only when SparkContext instances are actually constructed. I also made the gateway and jvm variables private. This change results in ~3-4x performance improvement when running the PySpark unit tests.
* Fix stdout redirection in PySpark.Josh Rosen2013-02-011-0/+9
|
* Replace old 'master' term with 'driver'.Stephen Haberman2013-01-251-1/+1
|
* Allow PySpark's SparkFiles to be used from driverJosh Rosen2013-01-231-0/+23
| | | | Fix minor documentation formatting issues.
* Fix sys.path bug in PySpark SparkContext.addPyFileJosh Rosen2013-01-221-5/+33
|
* Clean up setup code in PySpark checkpointing testsJosh Rosen2013-01-201-14/+5
|
* Add checkpointFile() and more tests to PySpark.Josh Rosen2013-01-201-0/+24
|
* Add RDD checkpointing to Python API.Josh Rosen2013-01-201-0/+46