spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
...
*	[SPARK-7916] [MLLIB] MLlib Python doc parity check for classification and reg...	Yanbo Liang	2015-06-16	2	-107/+247
*	[SPARK-6411] [SQL] [PySpark] support date/datetime with timezone in Python	Davies Liu	2015-06-11	2	-9/+50
*	[SPARK-8189] [SQL] use Long for TimestampType in SQL	Davies Liu	2015-06-10	1	-0/+11
*	[SPARK-7886] Add built-in expressions to FunctionRegistry.	Reynold Xin	2015-06-09	1	-1/+1
*	[SPARK-7990][SQL] Add methods to facilitate equi-join on multiple joining keys	Liang-Chi Hsieh	2015-06-08	1	-13/+32
*	[SPARK-2808] [STREAMING] [KAFKA] cleanup tests from	cody koeninger	2015-06-07	1	-5/+0
*	[SPARK-8146] DataFrame Python API: Alias replace in df.na	Reynold Xin	2015-06-07	2	-26/+22
*	[SPARK-7639] [PYSPARK] [MLLIB] Python API for KernelDensity	MechCoder	2015-06-06	2	-1/+63
*	[SPARK-7991] [PySpark] Adding support for passing lists to describe.	amey	2015-06-05	1	-0/+12
*	[SPARK-8116][PYSPARK] Allow sc.range() to take a single argument.	Ted Blackman	2015-06-04	1	-2/+12
*	[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...	Mike Dusenberry	2015-06-04	1	-3/+18
*	Update documentation for [SPARK-7980] [SQL] Support SQLContext.range(end)	Reynold Xin	2015-06-03	1	-0/+2
*	[SPARK-7980] [SQL] Support SQLContext.range(end)	animesh	2015-06-03	2	-2/+12
*	[SPARK-8060] Improve DataFrame Python test coverage and documentation.	Reynold Xin	2015-06-03	5	-227/+176
*	[SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust	MechCoder	2015-06-02	1	-1/+3
*	[SPARK-8038] [SQL] [PYSPARK] fix Column.when() and otherwise()	Davies Liu	2015-06-02	1	-3/+28
*	[SPARK-7432] [MLLIB] fix flaky CrossValidator doctest	Xiangrui Meng	2015-06-02	1	-10/+9
*	[SPARK-8021] [SQL] [PYSPARK] make Python read/write API consistent with Scala	Davies Liu	2015-06-02	1	-27/+94
*	[minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...	Reynold Xin	2015-06-01	1	-0/+3
*	[SPARK-7497] [PYSPARK] [STREAMING] fix streaming flaky tests	Davies Liu	2015-06-01	1	-8/+8
*	[SPARK-7978] [SQL] [PYSPARK] DecimalType should not be singleton	Davies Liu	2015-05-31	2	-2/+25
*	[SPARK-7918] [MLLIB] MLlib Python doc parity check for evaluation and feature	Yanbo Liang	2015-05-30	2	-39/+36
*	[SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict	Michael Nazario	2015-05-29	5	-20/+4
*	[SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...	Xiangrui Meng	2015-05-29	1	-25/+33
*	[SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel	Xiangrui Meng	2015-05-28	2	-3/+32
*	[MINOR] fix RegressionEvaluator doc	Xiangrui Meng	2015-05-28	1	-1/+1
*	[SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correct	linweizhong	2015-05-26	1	-4/+4
*	[SPARK-7833] [ML] Add python wrapper for RegressionEvaluator	Ram Sriharsha	2015-05-24	1	-2/+66
*	[SPARK-7840] add insertInto() to Writer	Davies Liu	2015-05-23	2	-8/+16
*	[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...	Davies Liu	2015-05-23	8	-56/+365
*	[SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4	Xiangrui Meng	2015-05-21	4	-61/+64
*	[SPARK-7794] [MLLIB] update RegexTokenizer default settings	Xiangrui Meng	2015-05-21	1	-21/+19
*	[SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in Python	Davies Liu	2015-05-21	1	-2/+46
*	[SPARK-7711] Add a startTime property to match the corresponding one in Scala	Holden Karau	2015-05-21	2	-0/+9
*	[SPARK-7394][SQL] Add Pandas style cast (astype)	kaka1992	2015-05-21	1	-0/+2
*	[SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutative	Sean Owen	2015-05-21	1	-2/+10
*	[SPARK-7606] [SQL] [PySpark] add version to Python SQL API docs	Davies Liu	2015-05-20	7	-18/+170
*	[SPARK-7762] [MLLIB] set default value for outputCol	Xiangrui Meng	2015-05-20	2	-2/+3
*	[SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...	Holden Karau	2015-05-20	8	-64/+96
*	[SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlib	Yanbo Liang	2015-05-20	1	-0/+117
*	[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python	Davies Liu	2015-05-19	5	-90/+421
*	[SPARK-7150] SparkContext.range() and SQLContext.range()	Daoyuan Wang	2015-05-18	4	-0/+46
*	[SPARK-6216] [PYSPARK] check python version of worker with driver	Davies Liu	2015-05-18	6	-12/+16
*	[SPARK-7380] [MLLIB] pipeline stages should be copyable in Python	Xiangrui Meng	2015-05-18	13	-254/+490
*	[SPARK-6657] [PYSPARK] Fix doc warnings	Xiangrui Meng	2015-05-18	4	-10/+11
*	[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files	Davies Liu	2015-05-15	5	-449/+550
*	[SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python	Davies Liu	2015-05-15	1	-30/+46
*	[SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...	FlytxtRnD	2015-05-15	1	-0/+6
*	[SPARK-6258] [MLLIB] GaussianMixture Python API parity check	Yanbo Liang	2015-05-15	1	-14/+53
*	[SPARK-7548] [SQL] Add explode function for DataFrames	Michael Armbrust	2015-05-14	3	-3/+44