aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* Merge branch 'master' of git://github.com/mesos/spark into scala-2.10Prashant Sharma2013-09-155-1/+78
|\
| * Whoopsy daisyAaron Davidson2013-09-081-1/+0
| * Export StorageLevel and refactorAaron Davidson2013-09-075-26/+62
| * Remove reflection, hard-code StorageLevelsAaron Davidson2013-09-072-24/+26
| * Memoize StorageLevels read from JVMAaron Davidson2013-09-061-2/+9
| * SPARK-660: Add StorageLevel support in PythonAaron Davidson2013-09-053-1/+34
* | Merged with masterPrashant Sharma2013-09-0625-98/+948
|\|
| * Add missing license headers found with RATMatei Zaharia2013-09-021-1/+18
| * Exclude some private modules in epydocMatei Zaharia2013-09-021-0/+1
| * Further fixes to get PySpark to work on WindowsMatei Zaharia2013-09-021-5/+12
| * Allow PySpark to launch worker.py directly on WindowsMatei Zaharia2013-09-011-4/+7
| * Move some classes to more appropriate packages:Matei Zaharia2013-09-011-2/+2
| * Add banner to PySpark and make wordcount output nicerMatei Zaharia2013-09-012-1/+14
| * Initial work to rename package to org.apache.sparkMatei Zaharia2013-09-013-5/+5
| * Merge pull request #861 from AndreSchumacher/pyspark_sampling_functionMatei Zaharia2013-08-312-7/+167
| |\
| | * RDD sample() and takeSample() prototypes for PySparkAndre Schumacher2013-08-282-7/+167
| * | Merge pull request #870 from JoshRosen/spark-885Matei Zaharia2013-08-311-1/+5
| |\ \
| | * | Don't send SIGINT to Py4J gateway subprocess.Josh Rosen2013-08-281-1/+5
| | |/
| * | Merge pull request #869 from AndreSchumacher/subtractMatei Zaharia2013-08-301-0/+37
| |\ \
| | * | PySpark: implementing subtractByKey(), subtract() and keyBy()Andre Schumacher2013-08-281-0/+37
| | |/
| * | Fix PySpark for assembly run and include it in distMatei Zaharia2013-08-291-0/+0
| * | Change build and run instructions to use assembliesMatei Zaharia2013-08-291-1/+1
| |/
| * Implementing SPARK-838: Add DoubleRDDFunctions methods to PySparkAndre Schumacher2013-08-212-1/+168
| * Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...Andre Schumacher2013-08-165-5/+37
| * Fix PySpark unit tests on Python 2.6.Josh Rosen2013-08-142-19/+20
| * Merge pull request #802 from stayhf/SPARK-760-PythonMatei Zaharia2013-08-121-0/+70
| |\
| | * Code update for Matei's suggestionsstayhf2013-08-111-7/+9
| | * Simple PageRank algorithm implementation in Python for SPARK-760stayhf2013-08-101-0/+68
| * | Merge pull request #813 from AndreSchumacher/add_files_pysparkMatei Zaharia2013-08-121-1/+6
| |\ \
| | * | Implementing SPARK-865: Add the equivalent of ADD_JARS to PySparkAndre Schumacher2013-08-121-1/+6
| * | | Merge pull request #747 from mateiz/improved-lrMatei Zaharia2013-08-061-27/+26
| |\ \ \
| | * | | Fix string parsing and style in LRMatei Zaharia2013-07-311-1/+1
| | * | | Update the Python logistic regression example to read from a file andMatei Zaharia2013-07-291-27/+26
| * | | | Do not inherit master's PYTHONPATH on workers.Josh Rosen2013-07-291-3/+2
| |/ / /
| * | | Merge branch 'master' of github.com:mesos/sparkMatei Zaharia2013-07-296-15/+9
| |\ \ \
| | * | | Some fixes to Python examples (style and package name for LR)Matei Zaharia2013-07-276-15/+9
| | | |/ | | |/|
| * | | SPARK-815. Python parallelize() should split lists before batchingMatei Zaharia2013-07-291-2/+9
| * | | Use None instead of empty string as it's slightly smaller/fasterMatei Zaharia2013-07-291-1/+1
| * | | Allow python/run-tests to run from any directoryMatei Zaharia2013-07-291-0/+3
| * | | Optimize Python foreach() to not return as many objectsMatei Zaharia2013-07-291-1/+5
| * | | Optimize Python take() to not compute entire first partitionMatei Zaharia2013-07-291-6/+9
| |/ /
| * | Add Apache license headers and LICENSE and NOTICE filesMatei Zaharia2013-07-1619-1/+325
* | | PySpark: replacing class manifest by class tag for Scala 2.10.2 inside rdd.pyAndre Schumacher2013-08-301-2/+2
|/ /
* | Fixed PySpark perf regression by not using socket.makefile(), and improvedroot2013-07-011-18/+24
* | Fix reporting of PySpark exceptionsJey Kottalam2013-06-212-5/+19
* | PySpark daemon: fix deadlock, improve error handlingJey Kottalam2013-06-211-17/+50
* | Add tests and fixes for Python daemon shutdownJey Kottalam2013-06-213-22/+69
* | Prefork Python worker processesJey Kottalam2013-06-212-32/+138
* | Add Python timing instrumentationJey Kottalam2013-06-212-1/+19
* | Fix Python saveAsTextFile doctest to not expect order to be preservedJey Kottalam2013-04-021-1/+1