aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-2079] Support batching when serializing SchemaRDD to PythonKan Zhang2014-06-141-1/+3
* SPARK-1939 Refactor takeSample method in RDD to use ScaSRSDoris Xin2014-06-121-61/+106
* SPARK-554. Add aggregateByKey.Sandy Ryza2014-06-122-1/+33
* fixed typo in docstring for min()Jeff Thompson2014-06-121-1/+1
* HOTFIX: PySpark tests should be order insensitive.Patrick Wendell2014-06-111-4/+4
* HOTFIX: A few PySpark tests were not actually runAndrew Or2014-06-111-1/+4
* [SPARK-2091][MLLIB] use numpy.dot instead of ndarray.dotXiangrui Meng2014-06-111-3/+5
* HOTFIX: Fix Python tests on Jenkins.Patrick Wendell2014-06-101-4/+7
* SPARK-1416: PySpark support for SequenceFile and Hadoop InputFormatsNick Pentreath2014-06-092-0/+282
* [SPARK-1308] Add getNumPartitions to pyspark RDDSyed Hashmi2014-06-091-18/+27
* [SPARK-1752][MLLIB] Standardize text format for vectors and labeled pointsXiangrui Meng2014-06-044-51/+129
* [SPARK-1161] Add saveAsPickleFile and SparkContext.pickleFile in PythonKan Zhang2014-06-032-8/+39
* [SPARK-1468] Modify the partition function used by partitionBy.Erik Selin2014-06-031-1/+4
* [SPARK-1942] Stop clearing spark.driver.port in unit testsSyed Hashmi2014-06-031-4/+0
* SPARK-1917: fix PySpark import of scipy.special functionsUri Laserson2014-05-312-1/+25
* SPARK-1839: PySpark RDD#take() shouldn't always read from driverAaron Davidson2014-05-312-21/+64
* Added doctest and method description in context.pyJyotiska NK2014-05-281-1/+14
* Fix PEP8 violations in Python mllib.Reynold Xin2014-05-258-88/+78
* Python docstring update for sql.py.Reynold Xin2014-05-251-61/+63
* SPARK-1822: Some minor cleanup work on SchemaRDD.count()Reynold Xin2014-05-251-1/+4
* [SPARK-1822] SchemaRDD.count() should use query optimizerKan Zhang2014-05-251-1/+13
* [SPARK-1900 / 1918] PySpark on YARN is brokenAndrew Or2014-05-241-2/+6
* [SPARK-1519] Support minPartitions param of wholeTextFiles() in PySparkKan Zhang2014-05-211-2/+10
* [SPARK-1808] Route bin/pyspark through Spark submitAndrew Or2014-05-162-5/+7
* Documentation: Encourage use of reduceByKey instead of groupByKey.Patrick Wendell2014-05-141-0/+4
* [FIX] do not load defaults when testing SparkConf in pysparkXiangrui Meng2014-05-141-1/+1
* [SQL] Make it possible to create Java/Python SQLContexts from an existing Sca...Michael Armbrust2014-05-131-2/+5
* [SPARK-1690] Tolerating empty elements when saving Python RDD to text filesKan Zhang2014-05-101-0/+8
* Add Python includes to path before depickling broadcast valuesBouke van der Bijl2014-05-101-7/+7
* [SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pysparkXiangrui Meng2014-05-072-2/+178
* SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptionsAaron Davidson2014-05-072-2/+14
* [SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...Kan Zhang2014-05-071-0/+29
* SPARK-1637: Clean up examples for 1.0Sandeep2014-05-0610-574/+0
* [SPARK-1549] Add Python support to spark-submitMatei Zaharia2014-05-063-58/+168
* [SPARK-1594][MLLIB] Cleaning up MLlib APIs and guideXiangrui Meng2014-05-051-2/+2
* SPARK-1004. PySpark on YARNSandy Ryza2014-04-295-11/+33
* [SPARK-1674] fix interrupted system call error in pyspark's RDD.pipeXiangrui Meng2014-04-291-3/+3
* Minor fix to python table caching API.Michael Armbrust2014-04-291-2/+2
* SPARK-1242 Add aggregate to python rddHolden Karau2014-04-241-2/+29
* [SPARK-986]: Job cancelation for PySparkAhir Reddy2014-04-241-3/+49
* SPARK-1438 RDD.sample() make seed param optionalArun Ramakrishnan2014-04-242-24/+20
* fix bugs of dot in pythonXusen Yin2014-04-222-5/+5
* [SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and JavadocsMatei Zaharia2014-04-211-2/+2
* Add insertInto and saveAsTable to Python API.Michael Armbrust2014-04-191-0/+13
* Fixed broken pyspark shell.Reynold Xin2014-04-181-2/+2
* SPARK-1483: Rename minSplits to minPartitions in public APIsCodingCat2014-04-181-3/+3
* FIX: Don't build Hive in assembly unless running Hive tests.Patrick Wendell2014-04-171-1/+3
* [python alternative] pyspark require Python2, failing if system default is Py...AbhishekKr2014-04-161-6/+14
* [SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDsMichael Armbrust2014-04-151-4/+10
* [WIP] SPARK-1430: Support sparse data in Python MLlibMatei Zaharia2014-04-1512-139/+1178