spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	Added doctest and method description in context.py	Jyotiska NK	2014-05-28	1	-1/+14
*	Fix PEP8 violations in Python mllib.	Reynold Xin	2014-05-25	8	-88/+78
*	Python docstring update for sql.py.	Reynold Xin	2014-05-25	1	-61/+63
*	SPARK-1822: Some minor cleanup work on SchemaRDD.count()	Reynold Xin	2014-05-25	1	-1/+4
*	[SPARK-1822] SchemaRDD.count() should use query optimizer	Kan Zhang	2014-05-25	1	-1/+13
*	[SPARK-1900 / 1918] PySpark on YARN is broken	Andrew Or	2014-05-24	1	-2/+6
*	[SPARK-1519] Support minPartitions param of wholeTextFiles() in PySpark	Kan Zhang	2014-05-21	1	-2/+10
*	[SPARK-1808] Route bin/pyspark through Spark submit	Andrew Or	2014-05-16	2	-5/+7
*	Documentation: Encourage use of reduceByKey instead of groupByKey.	Patrick Wendell	2014-05-14	1	-0/+4
*	[FIX] do not load defaults when testing SparkConf in pyspark	Xiangrui Meng	2014-05-14	1	-1/+1
*	[SQL] Make it possible to create Java/Python SQLContexts from an existing Sca...	Michael Armbrust	2014-05-13	1	-2/+5
*	[SPARK-1690] Tolerating empty elements when saving Python RDD to text files	Kan Zhang	2014-05-10	1	-0/+8
*	Add Python includes to path before depickling broadcast values	Bouke van der Bijl	2014-05-10	1	-7/+7
*	[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark	Xiangrui Meng	2014-05-07	2	-2/+178
*	SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptions	Aaron Davidson	2014-05-07	2	-2/+14
*	[SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...	Kan Zhang	2014-05-07	1	-0/+29
*	SPARK-1637: Clean up examples for 1.0	Sandeep	2014-05-06	10	-574/+0
*	[SPARK-1549] Add Python support to spark-submit	Matei Zaharia	2014-05-06	3	-58/+168
*	[SPARK-1594][MLLIB] Cleaning up MLlib APIs and guide	Xiangrui Meng	2014-05-05	1	-2/+2
*	SPARK-1004. PySpark on YARN	Sandy Ryza	2014-04-29	5	-11/+33
*	[SPARK-1674] fix interrupted system call error in pyspark's RDD.pipe	Xiangrui Meng	2014-04-29	1	-3/+3
*	Minor fix to python table caching API.	Michael Armbrust	2014-04-29	1	-2/+2
*	SPARK-1242 Add aggregate to python rdd	Holden Karau	2014-04-24	1	-2/+29
*	[SPARK-986]: Job cancelation for PySpark	Ahir Reddy	2014-04-24	1	-3/+49
*	SPARK-1438 RDD.sample() make seed param optional	Arun Ramakrishnan	2014-04-24	2	-24/+20
*	fix bugs of dot in python	Xusen Yin	2014-04-22	2	-5/+5
*	[SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and Javadocs	Matei Zaharia	2014-04-21	1	-2/+2
*	Add insertInto and saveAsTable to Python API.	Michael Armbrust	2014-04-19	1	-0/+13
*	Fixed broken pyspark shell.	Reynold Xin	2014-04-18	1	-2/+2
*	SPARK-1483: Rename minSplits to minPartitions in public APIs	CodingCat	2014-04-18	1	-3/+3
*	FIX: Don't build Hive in assembly unless running Hive tests.	Patrick Wendell	2014-04-17	1	-1/+3
*	[python alternative] pyspark require Python2, failing if system default is Py...	AbhishekKr	2014-04-16	1	-6/+14
*	[SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDs	Michael Armbrust	2014-04-15	1	-4/+10
*	[WIP] SPARK-1430: Support sparse data in Python MLlib	Matei Zaharia	2014-04-15	12	-139/+1178
*	SPARK-1426: Make MLlib work with NumPy versions older than 1.7	Sandeep	2014-04-15	2	-8/+9
*	SPARK-1374: PySpark API for SparkSQL	Ahir Reddy	2014-04-15	4	-1/+388
*	Set spark.executor.uri from environment variable (needed by Mesos)	Ivan Wick	2014-04-10	1	-0/+3
*	SPARK-1428: MLlib should convert non-float64 NumPy arrays to float64 instead ...	Sandeep	2014-04-10	1	-4/+14
*	Spark 1271: Co-Group and Group-By should pass Iterable[X]	Holden Karau	2014-04-08	3	-7/+41
*	SPARK-1099: Introduce local[*] mode to infer number of cores	Aaron Davidson	2014-04-07	1	-1/+1
*	SPARK-1421. Make MLlib work on Python 2.6	Matei Zaharia	2014-04-05	2	-6/+11
*	SPARK-1305: Support persisting RDD's directly to Tachyon	Haoyuan Li	2014-04-04	3	-16/+22
*	SPARK-1414. Python API for SparkContext.wholeTextFiles	Matei Zaharia	2014-04-04	2	-3/+43
*	Spark 1162 Implemented takeOrdered in pyspark.	Prashant Sharma	2014-04-03	1	-5/+102
*	[SPARK-1212, Part II] Support sparse data in MLlib	Xiangrui Meng	2014-04-02	1	-5/+7
*	SPARK-1336 Reducing the output of run-tests script.	Prashant Sharma	2014-03-29	1	-7/+12
*	SPARK-1322, top in pyspark should sort result in descending order.	Prashant Sharma	2014-03-26	1	-3/+3
*	Added doctest for map function in rdd.py	Jyotiska NK	2014-03-19	1	-0/+4
*	Spark 1246 add min max to stat counter	Dan McClary	2014-03-18	2	-3/+41
*	SPARK-1240: handle the case of empty RDD when takeSample	CodingCat	2014-03-16	1	-0/+4