aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* Added doctest and method description in context.pyJyotiska NK2014-05-281-1/+14
* Fix PEP8 violations in Python mllib.Reynold Xin2014-05-258-88/+78
* Python docstring update for sql.py.Reynold Xin2014-05-251-61/+63
* SPARK-1822: Some minor cleanup work on SchemaRDD.count()Reynold Xin2014-05-251-1/+4
* [SPARK-1822] SchemaRDD.count() should use query optimizerKan Zhang2014-05-251-1/+13
* [SPARK-1900 / 1918] PySpark on YARN is brokenAndrew Or2014-05-241-2/+6
* [SPARK-1519] Support minPartitions param of wholeTextFiles() in PySparkKan Zhang2014-05-211-2/+10
* [SPARK-1808] Route bin/pyspark through Spark submitAndrew Or2014-05-162-5/+7
* Documentation: Encourage use of reduceByKey instead of groupByKey.Patrick Wendell2014-05-141-0/+4
* [FIX] do not load defaults when testing SparkConf in pysparkXiangrui Meng2014-05-141-1/+1
* [SQL] Make it possible to create Java/Python SQLContexts from an existing Sca...Michael Armbrust2014-05-131-2/+5
* [SPARK-1690] Tolerating empty elements when saving Python RDD to text filesKan Zhang2014-05-101-0/+8
* Add Python includes to path before depickling broadcast valuesBouke van der Bijl2014-05-101-7/+7
* [SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pysparkXiangrui Meng2014-05-072-2/+178
* SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptionsAaron Davidson2014-05-072-2/+14
* [SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...Kan Zhang2014-05-071-0/+29
* SPARK-1637: Clean up examples for 1.0Sandeep2014-05-0610-574/+0
* [SPARK-1549] Add Python support to spark-submitMatei Zaharia2014-05-063-58/+168
* [SPARK-1594][MLLIB] Cleaning up MLlib APIs and guideXiangrui Meng2014-05-051-2/+2
* SPARK-1004. PySpark on YARNSandy Ryza2014-04-295-11/+33
* [SPARK-1674] fix interrupted system call error in pyspark's RDD.pipeXiangrui Meng2014-04-291-3/+3
* Minor fix to python table caching API.Michael Armbrust2014-04-291-2/+2
* SPARK-1242 Add aggregate to python rddHolden Karau2014-04-241-2/+29
* [SPARK-986]: Job cancelation for PySparkAhir Reddy2014-04-241-3/+49
* SPARK-1438 RDD.sample() make seed param optionalArun Ramakrishnan2014-04-242-24/+20
* fix bugs of dot in pythonXusen Yin2014-04-222-5/+5
* [SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and JavadocsMatei Zaharia2014-04-211-2/+2
* Add insertInto and saveAsTable to Python API.Michael Armbrust2014-04-191-0/+13
* Fixed broken pyspark shell.Reynold Xin2014-04-181-2/+2
* SPARK-1483: Rename minSplits to minPartitions in public APIsCodingCat2014-04-181-3/+3
* FIX: Don't build Hive in assembly unless running Hive tests.Patrick Wendell2014-04-171-1/+3
* [python alternative] pyspark require Python2, failing if system default is Py...AbhishekKr2014-04-161-6/+14
* [SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDsMichael Armbrust2014-04-151-4/+10
* [WIP] SPARK-1430: Support sparse data in Python MLlibMatei Zaharia2014-04-1512-139/+1178
* SPARK-1426: Make MLlib work with NumPy versions older than 1.7Sandeep2014-04-152-8/+9
* SPARK-1374: PySpark API for SparkSQLAhir Reddy2014-04-154-1/+388
* Set spark.executor.uri from environment variable (needed by Mesos)Ivan Wick2014-04-101-0/+3
* SPARK-1428: MLlib should convert non-float64 NumPy arrays to float64 instead ...Sandeep2014-04-101-4/+14
* Spark 1271: Co-Group and Group-By should pass Iterable[X]Holden Karau2014-04-083-7/+41
* SPARK-1099: Introduce local[*] mode to infer number of coresAaron Davidson2014-04-071-1/+1
* SPARK-1421. Make MLlib work on Python 2.6Matei Zaharia2014-04-052-6/+11
* SPARK-1305: Support persisting RDD's directly to TachyonHaoyuan Li2014-04-043-16/+22
* SPARK-1414. Python API for SparkContext.wholeTextFilesMatei Zaharia2014-04-042-3/+43
* Spark 1162 Implemented takeOrdered in pyspark.Prashant Sharma2014-04-031-5/+102
* [SPARK-1212, Part II] Support sparse data in MLlibXiangrui Meng2014-04-021-5/+7
* SPARK-1336 Reducing the output of run-tests script.Prashant Sharma2014-03-291-7/+12
* SPARK-1322, top in pyspark should sort result in descending order.Prashant Sharma2014-03-261-3/+3
* Added doctest for map function in rdd.pyJyotiska NK2014-03-191-0/+4
* Spark 1246 add min max to stat counterDan McClary2014-03-182-3/+41
* SPARK-1240: handle the case of empty RDD when takeSampleCodingCat2014-03-161-0/+4