Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Fix UnicodeEncodeError in PySpark saveAsTextFile(). | Josh Rosen | 2013-11-28 | 1 | -0/+15 |
| | | | Fixes SPARK-970. | ||||
* | Add custom serializer support to PySpark. | Josh Rosen | 2013-11-10 | 1 | -1/+2 |
| | | | | | | | | | For now, this only adds MarshalSerializer, but it lays the groundwork for other supporting custom serializers. Many of these mechanisms can also be used to support deserialization of different data formats sent by Java, such as data encoded by MsgPack. This also fixes a bug in SparkContext.union(). | ||||
* | Implementing SPARK-878 for PySpark: adding zip and egg files to context and ↵ | Andre Schumacher | 2013-08-16 | 1 | -0/+11 |
| | | | | passing it down to workers which add these to their sys.path | ||||
* | Fix PySpark unit tests on Python 2.6. | Josh Rosen | 2013-08-14 | 1 | -5/+8 |
| | |||||
* | Add Apache license headers and LICENSE and NOTICE files | Matei Zaharia | 2013-07-16 | 1 | -0/+17 |
| | |||||
* | Add tests and fixes for Python daemon shutdown | Jey Kottalam | 2013-06-21 | 1 | -0/+43 |
| | |||||
* | Do not launch JavaGateways on workers (SPARK-674). | Josh Rosen | 2013-02-01 | 1 | -1/+1 |
| | | | | | | | | | | | The problem was that the gateway was being initialized whenever the pyspark.context module was loaded. The fix uses lazy initialization that occurs only when SparkContext instances are actually constructed. I also made the gateway and jvm variables private. This change results in ~3-4x performance improvement when running the PySpark unit tests. | ||||
* | Fix stdout redirection in PySpark. | Josh Rosen | 2013-02-01 | 1 | -0/+9 |
| | |||||
* | Replace old 'master' term with 'driver'. | Stephen Haberman | 2013-01-25 | 1 | -1/+1 |
| | |||||
* | Allow PySpark's SparkFiles to be used from driver | Josh Rosen | 2013-01-23 | 1 | -0/+23 |
| | | | | Fix minor documentation formatting issues. | ||||
* | Fix sys.path bug in PySpark SparkContext.addPyFile | Josh Rosen | 2013-01-22 | 1 | -5/+33 |
| | |||||
* | Clean up setup code in PySpark checkpointing tests | Josh Rosen | 2013-01-20 | 1 | -14/+5 |
| | |||||
* | Add checkpointFile() and more tests to PySpark. | Josh Rosen | 2013-01-20 | 1 | -0/+24 |
| | |||||
* | Add RDD checkpointing to Python API. | Josh Rosen | 2013-01-20 | 1 | -0/+46 |