aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* Pass self to SparkContext._ensure_initialized.Ewen Cheslack-Postava2013-10-221-1/+10
* Add classmethod to SparkContext to set system properties.Ewen Cheslack-Postava2013-10-221-12/+29
* Add an add() method to pyspark accumulators.Ewen Cheslack-Postava2013-10-191-1/+12
* Fix PySpark docs and an overly long line of code after fdbae41eMatei Zaharia2013-10-091-8/+8
* SPARK-705: implement sortByKey() in PySparkAndre Schumacher2013-10-071-1/+47
* Fixing SPARK-602: PythonPartitionerAndre Schumacher2013-10-042-4/+10
* Update build version in masterPatrick Wendell2013-09-241-1/+1
* Whoopsy daisyAaron Davidson2013-09-081-1/+0
* Export StorageLevel and refactorAaron Davidson2013-09-075-26/+62
* Remove reflection, hard-code StorageLevelsAaron Davidson2013-09-072-24/+26
* Memoize StorageLevels read from JVMAaron Davidson2013-09-061-2/+9
* SPARK-660: Add StorageLevel support in PythonAaron Davidson2013-09-053-1/+34
* Add missing license headers found with RATMatei Zaharia2013-09-021-1/+18
* Exclude some private modules in epydocMatei Zaharia2013-09-021-0/+1
* Further fixes to get PySpark to work on WindowsMatei Zaharia2013-09-021-5/+12
* Allow PySpark to launch worker.py directly on WindowsMatei Zaharia2013-09-011-4/+7
* Move some classes to more appropriate packages:Matei Zaharia2013-09-011-2/+2
* Add banner to PySpark and make wordcount output nicerMatei Zaharia2013-09-012-1/+14
* Initial work to rename package to org.apache.sparkMatei Zaharia2013-09-013-5/+5
* Merge pull request #861 from AndreSchumacher/pyspark_sampling_functionMatei Zaharia2013-08-312-7/+167
|\
| * RDD sample() and takeSample() prototypes for PySparkAndre Schumacher2013-08-282-7/+167
* | Merge pull request #870 from JoshRosen/spark-885Matei Zaharia2013-08-311-1/+5
|\ \
| * | Don't send SIGINT to Py4J gateway subprocess.Josh Rosen2013-08-281-1/+5
| |/
* | Merge pull request #869 from AndreSchumacher/subtractMatei Zaharia2013-08-301-0/+37
|\ \
| * | PySpark: implementing subtractByKey(), subtract() and keyBy()Andre Schumacher2013-08-281-0/+37
| |/
* | Fix PySpark for assembly run and include it in distMatei Zaharia2013-08-291-0/+0
* | Change build and run instructions to use assembliesMatei Zaharia2013-08-291-1/+1
|/
* Implementing SPARK-838: Add DoubleRDDFunctions methods to PySparkAndre Schumacher2013-08-212-1/+168
* Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...Andre Schumacher2013-08-165-5/+37
* Fix PySpark unit tests on Python 2.6.Josh Rosen2013-08-142-19/+20
* Merge pull request #802 from stayhf/SPARK-760-PythonMatei Zaharia2013-08-121-0/+70
|\
| * Code update for Matei's suggestionsstayhf2013-08-111-7/+9
| * Simple PageRank algorithm implementation in Python for SPARK-760stayhf2013-08-101-0/+68
* | Merge pull request #813 from AndreSchumacher/add_files_pysparkMatei Zaharia2013-08-121-1/+6
|\ \
| * | Implementing SPARK-865: Add the equivalent of ADD_JARS to PySparkAndre Schumacher2013-08-121-1/+6
* | | Merge pull request #747 from mateiz/improved-lrMatei Zaharia2013-08-061-27/+26
|\ \ \
| * | | Fix string parsing and style in LRMatei Zaharia2013-07-311-1/+1
| * | | Update the Python logistic regression example to read from a file andMatei Zaharia2013-07-291-27/+26
* | | | Do not inherit master's PYTHONPATH on workers.Josh Rosen2013-07-291-3/+2
|/ / /
* | | Merge branch 'master' of github.com:mesos/sparkMatei Zaharia2013-07-296-15/+9
|\ \ \
| * | | Some fixes to Python examples (style and package name for LR)Matei Zaharia2013-07-276-15/+9
| | |/ | |/|
* | | SPARK-815. Python parallelize() should split lists before batchingMatei Zaharia2013-07-291-2/+9
* | | Use None instead of empty string as it's slightly smaller/fasterMatei Zaharia2013-07-291-1/+1
* | | Allow python/run-tests to run from any directoryMatei Zaharia2013-07-291-0/+3
* | | Optimize Python foreach() to not return as many objectsMatei Zaharia2013-07-291-1/+5
* | | Optimize Python take() to not compute entire first partitionMatei Zaharia2013-07-291-6/+9
|/ /
* | Add Apache license headers and LICENSE and NOTICE filesMatei Zaharia2013-07-1619-1/+325
* | Fixed PySpark perf regression by not using socket.makefile(), and improvedroot2013-07-011-18/+24
* | Fix reporting of PySpark exceptionsJey Kottalam2013-06-212-5/+19
* | PySpark daemon: fix deadlock, improve error handlingJey Kottalam2013-06-211-17/+50