aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* Merge branch 'master' into wip-scala-2.10Prashant Sharma2013-11-276-141/+381
|\
| * Removed unused basestring case from dump_stream.Josh Rosen2013-11-261-2/+0
| * FramedSerializer: _dumps => dumps, _loads => loads.Josh Rosen2013-11-104-18/+18
| * Send PySpark commands as bytes insetad of strings.Josh Rosen2013-11-103-16/+13
| * Add custom serializer support to PySpark.Josh Rosen2013-11-106-147/+360
| * Remove Pickle-wrapping of Java objects in PySpark.Josh Rosen2013-11-034-14/+39
| * Replace magic lengths with constants in PySpark.Josh Rosen2013-11-032-6/+13
* | Merge branch 'master' into scala-2.10Raymond Liu2013-11-132-13/+50
|\|
| * Pass self to SparkContext._ensure_initialized.Ewen Cheslack-Postava2013-10-221-1/+10
| * Add classmethod to SparkContext to set system properties.Ewen Cheslack-Postava2013-10-221-12/+29
| * Add an add() method to pyspark accumulators.Ewen Cheslack-Postava2013-10-191-1/+12
* | Merge branch 'master' of github.com:apache/incubator-spark into scala-2.10Prashant Sharma2013-10-101-7/+53
|\|
| * Fix PySpark docs and an overly long line of code after fdbae41eMatei Zaharia2013-10-091-8/+8
| * SPARK-705: implement sortByKey() in PySparkAndre Schumacher2013-10-071-1/+47
* | Merge branch 'master' into wip-merge-masterPrashant Sharma2013-10-082-4/+10
|\|
| * Fixing SPARK-602: PythonPartitionerAndre Schumacher2013-10-042-4/+10
* | Merge branch 'master' into scala-2.10Prashant Sharma2013-10-011-1/+1
|\|
| * Update build version in masterPatrick Wendell2013-09-241-1/+1
* | Merge branch 'master' of git://github.com/mesos/spark into scala-2.10Prashant Sharma2013-09-155-1/+78
|\|
| * Whoopsy daisyAaron Davidson2013-09-081-1/+0
| * Export StorageLevel and refactorAaron Davidson2013-09-075-26/+62
| * Remove reflection, hard-code StorageLevelsAaron Davidson2013-09-072-24/+26
| * Memoize StorageLevels read from JVMAaron Davidson2013-09-061-2/+9
| * SPARK-660: Add StorageLevel support in PythonAaron Davidson2013-09-053-1/+34
* | Merged with masterPrashant Sharma2013-09-0614-44/+692
|\|
| * Add missing license headers found with RATMatei Zaharia2013-09-021-1/+18
| * Further fixes to get PySpark to work on WindowsMatei Zaharia2013-09-021-5/+12
| * Allow PySpark to launch worker.py directly on WindowsMatei Zaharia2013-09-011-4/+7
| * Move some classes to more appropriate packages:Matei Zaharia2013-09-011-2/+2
| * Add banner to PySpark and make wordcount output nicerMatei Zaharia2013-09-011-0/+13
| * Initial work to rename package to org.apache.sparkMatei Zaharia2013-09-013-5/+5
| * Merge pull request #861 from AndreSchumacher/pyspark_sampling_functionMatei Zaharia2013-08-312-7/+167
| |\
| | * RDD sample() and takeSample() prototypes for PySparkAndre Schumacher2013-08-282-7/+167
| * | Merge pull request #870 from JoshRosen/spark-885Matei Zaharia2013-08-311-1/+5
| |\ \
| | * | Don't send SIGINT to Py4J gateway subprocess.Josh Rosen2013-08-281-1/+5
| | |/
| * | Merge pull request #869 from AndreSchumacher/subtractMatei Zaharia2013-08-301-0/+37
| |\ \
| | * | PySpark: implementing subtractByKey(), subtract() and keyBy()Andre Schumacher2013-08-281-0/+37
| | |/
| * / Change build and run instructions to use assembliesMatei Zaharia2013-08-291-1/+1
| |/
| * Implementing SPARK-838: Add DoubleRDDFunctions methods to PySparkAndre Schumacher2013-08-212-1/+168
| * Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...Andre Schumacher2013-08-164-5/+37
| * Fix PySpark unit tests on Python 2.6.Josh Rosen2013-08-141-5/+8
| * Merge pull request #813 from AndreSchumacher/add_files_pysparkMatei Zaharia2013-08-121-1/+6
| |\
| | * Implementing SPARK-865: Add the equivalent of ADD_JARS to PySparkAndre Schumacher2013-08-121-1/+6
| * | Do not inherit master's PYTHONPATH on workers.Josh Rosen2013-07-291-3/+2
| * | SPARK-815. Python parallelize() should split lists before batchingMatei Zaharia2013-07-291-2/+9
| * | Use None instead of empty string as it's slightly smaller/fasterMatei Zaharia2013-07-291-1/+1
| * | Optimize Python foreach() to not return as many objectsMatei Zaharia2013-07-291-1/+5
| * | Optimize Python take() to not compute entire first partitionMatei Zaharia2013-07-291-6/+9
| * | Add Apache license headers and LICENSE and NOTICE filesMatei Zaharia2013-07-1611-0/+187
* | | PySpark: replacing class manifest by class tag for Scala 2.10.2 inside rdd.pyAndre Schumacher2013-08-301-2/+2
|/ /