aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* Fix PEP8 violations in Python mllib.Reynold Xin2014-05-258-88/+78
* Python docstring update for sql.py.Reynold Xin2014-05-251-61/+63
* SPARK-1822: Some minor cleanup work on SchemaRDD.count()Reynold Xin2014-05-251-1/+4
* [SPARK-1822] SchemaRDD.count() should use query optimizerKan Zhang2014-05-251-1/+13
* [SPARK-1900 / 1918] PySpark on YARN is brokenAndrew Or2014-05-241-2/+6
* [SPARK-1519] Support minPartitions param of wholeTextFiles() in PySparkKan Zhang2014-05-211-2/+10
* [SPARK-1808] Route bin/pyspark through Spark submitAndrew Or2014-05-162-5/+7
* Documentation: Encourage use of reduceByKey instead of groupByKey.Patrick Wendell2014-05-141-0/+4
* [FIX] do not load defaults when testing SparkConf in pysparkXiangrui Meng2014-05-141-1/+1
* [SQL] Make it possible to create Java/Python SQLContexts from an existing Sca...Michael Armbrust2014-05-131-2/+5
* [SPARK-1690] Tolerating empty elements when saving Python RDD to text filesKan Zhang2014-05-101-0/+8
* Add Python includes to path before depickling broadcast valuesBouke van der Bijl2014-05-101-7/+7
* [SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pysparkXiangrui Meng2014-05-072-2/+178
* SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptionsAaron Davidson2014-05-072-2/+14
* [SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...Kan Zhang2014-05-071-0/+29
* [SPARK-1549] Add Python support to spark-submitMatei Zaharia2014-05-063-58/+168
* [SPARK-1594][MLLIB] Cleaning up MLlib APIs and guideXiangrui Meng2014-05-051-2/+2
* SPARK-1004. PySpark on YARNSandy Ryza2014-04-293-10/+30
* [SPARK-1674] fix interrupted system call error in pyspark's RDD.pipeXiangrui Meng2014-04-291-3/+3
* Minor fix to python table caching API.Michael Armbrust2014-04-291-2/+2
* SPARK-1242 Add aggregate to python rddHolden Karau2014-04-241-2/+29
* [SPARK-986]: Job cancelation for PySparkAhir Reddy2014-04-241-3/+49
* SPARK-1438 RDD.sample() make seed param optionalArun Ramakrishnan2014-04-242-24/+20
* fix bugs of dot in pythonXusen Yin2014-04-222-5/+5
* Add insertInto and saveAsTable to Python API.Michael Armbrust2014-04-191-0/+13
* Fixed broken pyspark shell.Reynold Xin2014-04-181-2/+2
* SPARK-1483: Rename minSplits to minPartitions in public APIsCodingCat2014-04-181-3/+3
* [python alternative] pyspark require Python2, failing if system default is Py...AbhishekKr2014-04-161-6/+14
* [SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDsMichael Armbrust2014-04-151-4/+10
* [WIP] SPARK-1430: Support sparse data in Python MLlibMatei Zaharia2014-04-156-131/+1066
* SPARK-1426: Make MLlib work with NumPy versions older than 1.7Sandeep2014-04-152-8/+9
* SPARK-1374: PySpark API for SparkSQLAhir Reddy2014-04-153-1/+384
* Set spark.executor.uri from environment variable (needed by Mesos)Ivan Wick2014-04-101-0/+3
* SPARK-1428: MLlib should convert non-float64 NumPy arrays to float64 instead ...Sandeep2014-04-101-4/+14
* Spark 1271: Co-Group and Group-By should pass Iterable[X]Holden Karau2014-04-083-7/+41
* SPARK-1099: Introduce local[*] mode to infer number of coresAaron Davidson2014-04-071-1/+1
* SPARK-1421. Make MLlib work on Python 2.6Matei Zaharia2014-04-052-6/+11
* SPARK-1305: Support persisting RDD's directly to TachyonHaoyuan Li2014-04-043-16/+22
* SPARK-1414. Python API for SparkContext.wholeTextFilesMatei Zaharia2014-04-042-3/+43
* Spark 1162 Implemented takeOrdered in pyspark.Prashant Sharma2014-04-031-5/+102
* [SPARK-1212, Part II] Support sparse data in MLlibXiangrui Meng2014-04-021-5/+7
* SPARK-1322, top in pyspark should sort result in descending order.Prashant Sharma2014-03-261-3/+3
* Added doctest for map function in rdd.pyJyotiska NK2014-03-191-0/+4
* Spark 1246 add min max to stat counterDan McClary2014-03-182-3/+41
* SPARK-1240: handle the case of empty RDD when takeSampleCodingCat2014-03-161-0/+4
* SPARK-1162 Added top in python.Prashant Sharma2014-03-121-0/+25
* Spark-1163, Added missing Python RDD functionsprabinb2014-03-112-0/+46
* SPARK-1168, Added foldByKey to pyspark.Prashant Sharma2014-03-101-0/+14
* [SPARK-972] Added detailed callsite info for ValueError in context.py (resubm...jyotiska2014-03-102-8/+29
* SPARK-977 Added Python RDD.zip functionPrabin Banka2014-03-102-2/+47