aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* Merge branch 'master' of git://github.com/mesos/spark into scala-2.10Prashant Sharma2013-09-155-1/+78
|\
| * Whoopsy daisyAaron Davidson2013-09-081-1/+0
| * Export StorageLevel and refactorAaron Davidson2013-09-075-26/+62
| * Remove reflection, hard-code StorageLevelsAaron Davidson2013-09-072-24/+26
| * Memoize StorageLevels read from JVMAaron Davidson2013-09-061-2/+9
| * SPARK-660: Add StorageLevel support in PythonAaron Davidson2013-09-053-1/+34
* | Merged with masterPrashant Sharma2013-09-0614-44/+692
|\|
| * Add missing license headers found with RATMatei Zaharia2013-09-021-1/+18
| * Further fixes to get PySpark to work on WindowsMatei Zaharia2013-09-021-5/+12
| * Allow PySpark to launch worker.py directly on WindowsMatei Zaharia2013-09-011-4/+7
| * Move some classes to more appropriate packages:Matei Zaharia2013-09-011-2/+2
| * Add banner to PySpark and make wordcount output nicerMatei Zaharia2013-09-011-0/+13
| * Initial work to rename package to org.apache.sparkMatei Zaharia2013-09-013-5/+5
| * Merge pull request #861 from AndreSchumacher/pyspark_sampling_functionMatei Zaharia2013-08-312-7/+167
| |\
| | * RDD sample() and takeSample() prototypes for PySparkAndre Schumacher2013-08-282-7/+167
| * | Merge pull request #870 from JoshRosen/spark-885Matei Zaharia2013-08-311-1/+5
| |\ \
| | * | Don't send SIGINT to Py4J gateway subprocess.Josh Rosen2013-08-281-1/+5
| | |/
| * | Merge pull request #869 from AndreSchumacher/subtractMatei Zaharia2013-08-301-0/+37
| |\ \
| | * | PySpark: implementing subtractByKey(), subtract() and keyBy()Andre Schumacher2013-08-281-0/+37
| | |/
| * / Change build and run instructions to use assembliesMatei Zaharia2013-08-291-1/+1
| |/
| * Implementing SPARK-838: Add DoubleRDDFunctions methods to PySparkAndre Schumacher2013-08-212-1/+168
| * Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...Andre Schumacher2013-08-164-5/+37
| * Fix PySpark unit tests on Python 2.6.Josh Rosen2013-08-141-5/+8
| * Merge pull request #813 from AndreSchumacher/add_files_pysparkMatei Zaharia2013-08-121-1/+6
| |\
| | * Implementing SPARK-865: Add the equivalent of ADD_JARS to PySparkAndre Schumacher2013-08-121-1/+6
| * | Do not inherit master's PYTHONPATH on workers.Josh Rosen2013-07-291-3/+2
| * | SPARK-815. Python parallelize() should split lists before batchingMatei Zaharia2013-07-291-2/+9
| * | Use None instead of empty string as it's slightly smaller/fasterMatei Zaharia2013-07-291-1/+1
| * | Optimize Python foreach() to not return as many objectsMatei Zaharia2013-07-291-1/+5
| * | Optimize Python take() to not compute entire first partitionMatei Zaharia2013-07-291-6/+9
| * | Add Apache license headers and LICENSE and NOTICE filesMatei Zaharia2013-07-1611-0/+187
* | | PySpark: replacing class manifest by class tag for Scala 2.10.2 inside rdd.pyAndre Schumacher2013-08-301-2/+2
|/ /
* | Fixed PySpark perf regression by not using socket.makefile(), and improvedroot2013-07-011-18/+24
* | Fix reporting of PySpark exceptionsJey Kottalam2013-06-212-5/+19
* | PySpark daemon: fix deadlock, improve error handlingJey Kottalam2013-06-211-17/+50
* | Add tests and fixes for Python daemon shutdownJey Kottalam2013-06-213-22/+69
* | Prefork Python worker processesJey Kottalam2013-06-212-32/+138
* | Add Python timing instrumentationJey Kottalam2013-06-212-1/+19
* | Fix Python saveAsTextFile doctest to not expect order to be preservedJey Kottalam2013-04-021-1/+1
* | Change numSplits to numPartitions in PySpark.Josh Rosen2013-02-242-38/+38
* | Add commutative requirement for 'reduce' to Python docstring.Mark Hamstra2013-02-091-2/+2
|/
* Remove unnecessary doctest __main__ methods.Josh Rosen2013-02-032-18/+0
* Fetch fewer objects in PySpark's take() method.Josh Rosen2013-02-031-0/+4
* Fix reporting of PySpark doctest failures.Josh Rosen2013-02-032-2/+6
* Use spark.local.dir for PySpark temp files (SPARK-580).Josh Rosen2013-02-012-10/+9
* Do not launch JavaGateways on workers (SPARK-674).Josh Rosen2013-02-014-18/+25
* Fix stdout redirection in PySpark.Josh Rosen2013-02-012-2/+12
* SPARK-673: Capture and re-throw Python exceptionsPatrick Wendell2013-01-311-2/+8
* Merge pull request #430 from pwendell/pyspark-guideMatei Zaharia2013-01-301-0/+1
|\
| * Make module help available in python shell.Patrick Wendell2013-01-301-0/+1