spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...	Mike Dusenberry	2015-06-04	1	-3/+18
*	Update documentation for [SPARK-7980] [SQL] Support SQLContext.range(end)	Reynold Xin	2015-06-03	1	-0/+2
*	[SPARK-7980] [SQL] Support SQLContext.range(end)	animesh	2015-06-03	2	-2/+12
*	[SPARK-8060] Improve DataFrame Python test coverage and documentation.	Reynold Xin	2015-06-03	19	-227/+179
*	[SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust	MechCoder	2015-06-02	1	-1/+3
*	[SPARK-8038] [SQL] [PYSPARK] fix Column.when() and otherwise()	Davies Liu	2015-06-02	1	-3/+28
*	[SPARK-7432] [MLLIB] fix flaky CrossValidator doctest	Xiangrui Meng	2015-06-02	1	-10/+9
*	[SPARK-8021] [SQL] [PYSPARK] make Python read/write API consistent with Scala	Davies Liu	2015-06-02	1	-27/+94
*	[minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...	Reynold Xin	2015-06-01	1	-0/+3
*	[SPARK-7497] [PYSPARK] [STREAMING] fix streaming flaky tests	Davies Liu	2015-06-01	1	-8/+8
*	[SPARK-7978] [SQL] [PYSPARK] DecimalType should not be singleton	Davies Liu	2015-05-31	2	-2/+25
*	[MINOR] Enable PySpark SQL readerwriter and window tests	Josh Rosen	2015-05-31	1	-0/+2
*	[SPARK-7918] [MLLIB] MLlib Python doc parity check for evaluation and feature	Yanbo Liang	2015-05-30	2	-39/+36
*	[SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict	Michael Nazario	2015-05-29	6	-58/+42
*	[SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...	Xiangrui Meng	2015-05-29	1	-25/+33
*	[SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel	Xiangrui Meng	2015-05-28	2	-3/+32
*	[MINOR] fix RegressionEvaluator doc	Xiangrui Meng	2015-05-28	1	-1/+1
*	[SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correct	linweizhong	2015-05-26	1	-4/+4
*	[SPARK-7833] [ML] Add python wrapper for RegressionEvaluator	Ram Sriharsha	2015-05-24	1	-2/+66
*	[SPARK-7840] add insertInto() to Writer	Davies Liu	2015-05-23	2	-8/+16
*	[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...	Davies Liu	2015-05-23	8	-56/+365
*	[SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4	Xiangrui Meng	2015-05-21	4	-61/+64
*	[SPARK-7794] [MLLIB] update RegexTokenizer default settings	Xiangrui Meng	2015-05-21	1	-21/+19
*	[SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in Python	Davies Liu	2015-05-21	1	-2/+46
*	[SPARK-7711] Add a startTime property to match the corresponding one in Scala	Holden Karau	2015-05-21	2	-0/+9
*	[SPARK-7394][SQL] Add Pandas style cast (astype)	kaka1992	2015-05-21	1	-0/+2
*	[SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutative	Sean Owen	2015-05-21	1	-2/+10
*	[SPARK-7606] [SQL] [PySpark] add version to Python SQL API docs	Davies Liu	2015-05-20	7	-18/+170
*	[SPARK-7762] [MLLIB] set default value for outputCol	Xiangrui Meng	2015-05-20	2	-2/+3
*	[SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...	Holden Karau	2015-05-20	8	-64/+96
*	[SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlib	Yanbo Liang	2015-05-20	1	-0/+117
*	[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python	Davies Liu	2015-05-19	5	-90/+421
*	[SPARK-7150] SparkContext.range() and SQLContext.range()	Daoyuan Wang	2015-05-18	4	-0/+46
*	[SPARK-6216] [PYSPARK] check python version of worker with driver	Davies Liu	2015-05-18	6	-12/+16
*	[SPARK-7380] [MLLIB] pipeline stages should be copyable in Python	Xiangrui Meng	2015-05-18	13	-254/+490
*	[SPARK-6657] [PYSPARK] Fix doc warnings	Xiangrui Meng	2015-05-18	4	-10/+11
*	[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files	Davies Liu	2015-05-15	6	-449/+552
*	[SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python	Davies Liu	2015-05-15	1	-30/+46
*	[SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...	FlytxtRnD	2015-05-15	1	-0/+6
*	[SPARK-6258] [MLLIB] GaussianMixture Python API parity check	Yanbo Liang	2015-05-15	1	-14/+53
*	[SPARK-7548] [SQL] Add explode function for DataFrames	Michael Armbrust	2015-05-14	3	-3/+44
*	[SPARK-7619] [PYTHON] fix docstring signature	Xiangrui Meng	2015-05-14	5	-55/+52
*	[SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml	Xiangrui Meng	2015-05-14	3	-1/+43
*	[SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable	ksonj	2015-05-14	1	-1/+1
*	[SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification	Burak Yavuz	2015-05-13	3	-10/+501
*	[SPARK-7593] [ML] Python Api for ml.feature.Bucketizer	Burak Yavuz	2015-05-13	1	-0/+77
*	[SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...	Reynold Xin	2015-05-12	3	-2/+57
*	[SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml	Xiangrui Meng	2015-05-12	3	-7/+11
*	[SPARK-7487] [ML] Feature Parity in PySpark for ml.regression	Burak Yavuz	2015-05-12	6	-8/+709
*	[SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark	Daoyuan Wang	2015-05-12	2	-0/+133