aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* Fix PySpark docs and an overly long line of code after fdbae41eMatei Zaharia2013-10-091-8/+8
* SPARK-705: implement sortByKey() in PySparkAndre Schumacher2013-10-071-1/+47
* Fixing SPARK-602: PythonPartitionerAndre Schumacher2013-10-042-4/+10
* Update build version in masterPatrick Wendell2013-09-241-1/+1
* Whoopsy daisyAaron Davidson2013-09-081-1/+0
* Export StorageLevel and refactorAaron Davidson2013-09-075-26/+62
* Remove reflection, hard-code StorageLevelsAaron Davidson2013-09-072-24/+26
* Memoize StorageLevels read from JVMAaron Davidson2013-09-061-2/+9
* SPARK-660: Add StorageLevel support in PythonAaron Davidson2013-09-053-1/+34
* Add missing license headers found with RATMatei Zaharia2013-09-021-1/+18
* Further fixes to get PySpark to work on WindowsMatei Zaharia2013-09-021-5/+12
* Allow PySpark to launch worker.py directly on WindowsMatei Zaharia2013-09-011-4/+7
* Move some classes to more appropriate packages:Matei Zaharia2013-09-011-2/+2
* Add banner to PySpark and make wordcount output nicerMatei Zaharia2013-09-011-0/+13
* Initial work to rename package to org.apache.sparkMatei Zaharia2013-09-013-5/+5
* Merge pull request #861 from AndreSchumacher/pyspark_sampling_functionMatei Zaharia2013-08-312-7/+167
|\
| * RDD sample() and takeSample() prototypes for PySparkAndre Schumacher2013-08-282-7/+167
* | Merge pull request #870 from JoshRosen/spark-885Matei Zaharia2013-08-311-1/+5
|\ \
| * | Don't send SIGINT to Py4J gateway subprocess.Josh Rosen2013-08-281-1/+5
| |/
* | Merge pull request #869 from AndreSchumacher/subtractMatei Zaharia2013-08-301-0/+37
|\ \
| * | PySpark: implementing subtractByKey(), subtract() and keyBy()Andre Schumacher2013-08-281-0/+37
| |/
* / Change build and run instructions to use assembliesMatei Zaharia2013-08-291-1/+1
|/
* Implementing SPARK-838: Add DoubleRDDFunctions methods to PySparkAndre Schumacher2013-08-212-1/+168
* Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...Andre Schumacher2013-08-164-5/+37
* Fix PySpark unit tests on Python 2.6.Josh Rosen2013-08-141-5/+8
* Merge pull request #813 from AndreSchumacher/add_files_pysparkMatei Zaharia2013-08-121-1/+6
|\
| * Implementing SPARK-865: Add the equivalent of ADD_JARS to PySparkAndre Schumacher2013-08-121-1/+6
* | Do not inherit master's PYTHONPATH on workers.Josh Rosen2013-07-291-3/+2
* | SPARK-815. Python parallelize() should split lists before batchingMatei Zaharia2013-07-291-2/+9
* | Use None instead of empty string as it's slightly smaller/fasterMatei Zaharia2013-07-291-1/+1
* | Optimize Python foreach() to not return as many objectsMatei Zaharia2013-07-291-1/+5
* | Optimize Python take() to not compute entire first partitionMatei Zaharia2013-07-291-6/+9
* | Add Apache license headers and LICENSE and NOTICE filesMatei Zaharia2013-07-1611-0/+187
* | Fixed PySpark perf regression by not using socket.makefile(), and improvedroot2013-07-011-18/+24
* | Fix reporting of PySpark exceptionsJey Kottalam2013-06-212-5/+19
* | PySpark daemon: fix deadlock, improve error handlingJey Kottalam2013-06-211-17/+50
* | Add tests and fixes for Python daemon shutdownJey Kottalam2013-06-213-22/+69
* | Prefork Python worker processesJey Kottalam2013-06-212-32/+138
* | Add Python timing instrumentationJey Kottalam2013-06-212-1/+19
* | Fix Python saveAsTextFile doctest to not expect order to be preservedJey Kottalam2013-04-021-1/+1
* | Change numSplits to numPartitions in PySpark.Josh Rosen2013-02-242-38/+38
* | Add commutative requirement for 'reduce' to Python docstring.Mark Hamstra2013-02-091-2/+2
|/
* Remove unnecessary doctest __main__ methods.Josh Rosen2013-02-032-18/+0
* Fetch fewer objects in PySpark's take() method.Josh Rosen2013-02-031-0/+4
* Fix reporting of PySpark doctest failures.Josh Rosen2013-02-032-2/+6
* Use spark.local.dir for PySpark temp files (SPARK-580).Josh Rosen2013-02-012-10/+9
* Do not launch JavaGateways on workers (SPARK-674).Josh Rosen2013-02-014-18/+25
* Fix stdout redirection in PySpark.Josh Rosen2013-02-012-2/+12
* SPARK-673: Capture and re-throw Python exceptionsPatrick Wendell2013-01-311-2/+8
* Merge pull request #430 from pwendell/pyspark-guideMatei Zaharia2013-01-301-0/+1
|\