aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Collapse)AuthorAgeFilesLines
* Allow PySpark's SparkFiles to be used from driverJosh Rosen2013-01-235-9/+63
| | | | Fix minor documentation formatting issues.
* Fix sys.path bug in PySpark SparkContext.addPyFileJosh Rosen2013-01-224-7/+41
|
* Don't download files to master's working directory.Josh Rosen2013-01-215-5/+70
| | | | | | | This should avoid exceptions caused by existing files with different contents. I also removed some unused code.
* Merge pull request #389 from JoshRosen/python_rdd_checkpointingMatei Zaharia2013-01-205-3/+116
|\ | | | | Add checkpointing to the Python API
| * Clean up setup code in PySpark checkpointing testsJosh Rosen2013-01-202-16/+6
| |
| * Update checkpointing API docs in Python/Java.Josh Rosen2013-01-202-16/+12
| |
| * Add checkpointFile() and more tests to PySpark.Josh Rosen2013-01-203-2/+37
| |
| * Add RDD checkpointing to Python API.Josh Rosen2013-01-205-1/+93
| |
* | Fix PythonPartitioner equality; see SPARK-654.Josh Rosen2013-01-201-6/+11
|/ | | | | | PythonPartitioner did not take the Python-side partitioning function into account when checking for equality, which might cause problems in the future.
* Add __repr__ to Accumulator; fix bug in sc.accumulatorJosh Rosen2013-01-201-1/+10
|
* Merge pull request #387 from mateiz/python-accumulatorsJosh Rosen2013-01-208-5/+238
|\ | | | | Add accumulators to PySpark
| * Add a class comment to AccumulatorMatei Zaharia2013-01-201-0/+12
| |
| * Launch accumulator tests in run-testsMatei Zaharia2013-01-201-0/+3
| |
| * Added accumulators to PySparkMatei Zaharia2013-01-207-5/+223
| |
* | Minor formatting fixesMatei Zaharia2013-01-201-1/+1
| |
* | Python ALS exampleNick Pentreath2013-01-151-0/+71
|/
* Change PYSPARK_PYTHON_EXEC to PYSPARK_PYTHON.Josh Rosen2013-01-101-1/+1
|
* Use take() instead of takeSample() in PySpark kmeans example.Josh Rosen2013-01-091-1/+3
| | | | This is a temporary change until we port takeSample().
* Indicate success/failure in PySpark test script.Josh Rosen2013-01-091-0/+17
|
* Add mapPartitionsWithSplit() to PySpark.Josh Rosen2013-01-082-12/+25
|
* Change PySpark RDD.take() to not call iterator().Josh Rosen2013-01-032-6/+6
|
* Add `pyspark` script to replace the other scripts.Josh Rosen2013-01-012-26/+19
| | | Expand the PySpark programming guide.
* Rename top-level 'pyspark' directory to 'python'Josh Rosen2013-01-0121-0/+2442
|
* Fix Python 2.6 compatibility in Python API.Josh Rosen2012-09-171-22/+0
|
* Add Python API.Josh Rosen2012-08-181-0/+22