spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...	Xiangrui Meng	2015-05-29	1	-25/+33
*	[SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel	Xiangrui Meng	2015-05-28	2	-3/+32
*	[MINOR] fix RegressionEvaluator doc	Xiangrui Meng	2015-05-28	1	-1/+1
*	[SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correct	linweizhong	2015-05-26	1	-4/+4
*	[SPARK-7833] [ML] Add python wrapper for RegressionEvaluator	Ram Sriharsha	2015-05-24	1	-2/+66
*	[SPARK-7840] add insertInto() to Writer	Davies Liu	2015-05-23	2	-8/+16
*	[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...	Davies Liu	2015-05-23	8	-56/+365
*	[SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4	Xiangrui Meng	2015-05-21	4	-61/+64
*	[SPARK-7794] [MLLIB] update RegexTokenizer default settings	Xiangrui Meng	2015-05-21	1	-21/+19
*	[SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in Python	Davies Liu	2015-05-21	1	-2/+46
*	[SPARK-7711] Add a startTime property to match the corresponding one in Scala	Holden Karau	2015-05-21	2	-0/+9
*	[SPARK-7394][SQL] Add Pandas style cast (astype)	kaka1992	2015-05-21	1	-0/+2
*	[SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutative	Sean Owen	2015-05-21	1	-2/+10
*	[SPARK-7606] [SQL] [PySpark] add version to Python SQL API docs	Davies Liu	2015-05-20	7	-18/+170
*	[SPARK-7762] [MLLIB] set default value for outputCol	Xiangrui Meng	2015-05-20	2	-2/+3
*	[SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...	Holden Karau	2015-05-20	8	-64/+96
*	[SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlib	Yanbo Liang	2015-05-20	1	-0/+117
*	[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python	Davies Liu	2015-05-19	5	-90/+421
*	[SPARK-7150] SparkContext.range() and SQLContext.range()	Daoyuan Wang	2015-05-18	4	-0/+46
*	[SPARK-6216] [PYSPARK] check python version of worker with driver	Davies Liu	2015-05-18	6	-12/+16
*	[SPARK-7380] [MLLIB] pipeline stages should be copyable in Python	Xiangrui Meng	2015-05-18	13	-254/+490
*	[SPARK-6657] [PYSPARK] Fix doc warnings	Xiangrui Meng	2015-05-18	4	-10/+11
*	[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files	Davies Liu	2015-05-15	6	-449/+552
*	[SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python	Davies Liu	2015-05-15	1	-30/+46
*	[SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...	FlytxtRnD	2015-05-15	1	-0/+6
*	[SPARK-6258] [MLLIB] GaussianMixture Python API parity check	Yanbo Liang	2015-05-15	1	-14/+53
*	[SPARK-7548] [SQL] Add explode function for DataFrames	Michael Armbrust	2015-05-14	3	-3/+44
*	[SPARK-7619] [PYTHON] fix docstring signature	Xiangrui Meng	2015-05-14	5	-55/+52
*	[SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml	Xiangrui Meng	2015-05-14	3	-1/+43
*	[SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable	ksonj	2015-05-14	1	-1/+1
*	[SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification	Burak Yavuz	2015-05-13	3	-10/+501
*	[SPARK-7593] [ML] Python Api for ml.feature.Bucketizer	Burak Yavuz	2015-05-13	1	-0/+77
*	[SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...	Reynold Xin	2015-05-12	3	-2/+57
*	[SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml	Xiangrui Meng	2015-05-12	3	-7/+11
*	[SPARK-7487] [ML] Feature Parity in PySpark for ml.regression	Burak Yavuz	2015-05-12	6	-8/+709
*	[SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark	Daoyuan Wang	2015-05-12	2	-0/+133
*	[SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.	Reynold Xin	2015-05-11	1	-1/+13
*	[SPARK-7324] [SQL] DataFrame.dropDuplicates	Reynold Xin	2015-05-11	1	-2/+34
*	[SPARK-7462][SQL] Update documentation for retaining grouping columns in Data...	Reynold Xin	2015-05-11	1	-0/+2
*	[SPARK-7462] By default retain group by columns in aggregate	Reynold Xin	2015-05-11	1	-1/+1
*	[SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlib	Yanbo Liang	2015-05-11	1	-2/+76
*	[SPARK-7427] [PYSPARK] Make sharedParams match in Scala, Python	Glenn Weidner	2015-05-10	3	-21/+19
*	[SPARK-7431] [ML] [PYTHON] Made CrossValidatorModel call parent init in PySpark	Joseph K. Bradley	2015-05-10	3	-3/+4
*	[SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib	Yanbo Liang	2015-05-10	1	-0/+129
*	[SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct	Vinod K C	2015-05-09	2	-3/+0
*	[SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendation	Burak Yavuz	2015-05-08	3	-0/+310
*	[SPARK-5913] [MLLIB] Python API for ChiSqSelector	Yanbo Liang	2015-05-08	1	-2/+57
*	[SPARK-7133] [SQL] Implement struct, array, and map field accessor	Wenchen Fan	2015-05-08	2	-12/+19
*	[SPARK-7474] [MLLIB] update ParamGridBuilder doctest	Xiangrui Meng	2015-05-08	1	-15/+13
*	[SPARK-7383] [ML] Feature Parity in PySpark for ml.features	Burak Yavuz	2015-05-08	3	-41/+849