spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.	Reynold Xin	2015-05-11	1	-1/+13
*	[SPARK-7324] [SQL] DataFrame.dropDuplicates	Reynold Xin	2015-05-11	1	-2/+34
*	[SPARK-7462][SQL] Update documentation for retaining grouping columns in Data...	Reynold Xin	2015-05-11	1	-0/+2
*	[SPARK-7462] By default retain group by columns in aggregate	Reynold Xin	2015-05-11	1	-1/+1
*	[SPARK-7133] [SQL] Implement struct, array, and map field accessor	Wenchen Fan	2015-05-08	2	-12/+19
*	[SPARK-7118] [Python] Add the coalesce Spark SQL function available in PySpark	Olivier Girardot	2015-05-07	1	-0/+37
*	[SPARK-7295][SQL] bitwise operations for DataFrame DSL	Shiti	2015-05-07	3	-0/+20
*	[SPARK-7358][SQL] Move DataFrame mathfunctions into functions	Burak Yavuz	2015-05-05	3	-102/+53
*	[SPARK-7294][SQL] ADD BETWEEN	云峤	2015-05-05	2	-0/+15
*	[SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySpark	Xiangrui Meng	2015-05-05	1	-1/+1
*	[SPARK-7243][SQL] Reduce size for Contingency Tables in DataFrames	Burak Yavuz	2015-05-05	1	-4/+5
*	[SPARK-7243][SQL] Contingency Tables for DataFrames	Burak Yavuz	2015-05-04	2	-0/+34
*	[SPARK-7319][SQL] Improve the output from DataFrame.show()	云峤	2015-05-04	1	-36/+69
*	[SPARK-7241] Pearson correlation for DataFrames	Burak Yavuz	2015-05-03	2	-0/+32
*	[SPARK-3444] Fix typo in Dataframes.py introduced in []	Dean Chen	2015-05-02	1	-1/+1
*	[SPARK-7242] added python api for freqItems in DataFrames	Burak Yavuz	2015-05-01	2	-0/+32
*	[SPARK-3444] Provide an easy way to change log level	Holden Karau	2015-05-01	1	-1/+1
*	[SPARK-7240][SQL] Single pass covariance calculation for dataframes	Burak Yavuz	2015-05-01	3	-2/+43
*	[SPARK-7274] [SQL] Create Column expression for array/struct creation.	Reynold Xin	2015-05-01	1	-19/+61
*	[SPARK-7248] implemented random number generators for DataFrames	Burak Yavuz	2015-04-30	2	-1/+34
*	[SPARK-7156][SQL] Addressed follow up comments for randomSplit	Burak Yavuz	2015-04-29	1	-1/+6
*	[SPARK-7156][SQL] support RandomSplit in DataFrames	Burak Yavuz	2015-04-29	1	-1/+17
*	Better error message on access to non-existing attribute	ksonj	2015-04-29	1	-1/+2
*	[SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations	Patrick Wendell	2015-04-29	1	-1/+2
*	[SPARK-7188] added python support for math DataFrame functions	Burak Yavuz	2015-04-29	3	-1/+131
*	[SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.	Reynold Xin	2015-04-28	1	-1/+21
*	[SPARK-7152][SQL] Add a Column expression for partition ID.	Reynold Xin	2015-04-26	1	-9/+21
*	[SPARK-7060][SQL] Add alias function to python dataframe	Yin Huai	2015-04-23	1	-0/+14
*	[SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.	Reynold Xin	2015-04-22	1	-1/+8
*	[SPARK-6953] [PySpark] speed up python tests	Reynold Xin	2015-04-21	1	-2/+2
*	[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression	Davies Liu	2015-04-21	4	-23/+46
*	[SPARK-6661] Python type errors should print type, not object	Elisey Zanko	2015-04-20	3	-11/+11
*	Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.	Reynold Xin	2015-04-17	1	-3/+4
*	[SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandas	Davies Liu	2015-04-17	3	-39/+70
*	[SPARK-6911] [SQL] improve accessor for nested types	Davies Liu	2015-04-16	2	-5/+62
*	[SPARK-4897] [PySpark] Python 3 support	Davies Liu	2015-04-16	6	-56/+120
*	[SPARK-6638] [SQL] Improve performance of StringType in SQL	Davies Liu	2015-04-15	1	-5/+5
*	[SPARK-6677] [SQL] [PySpark] fix cached classes	Davies Liu	2015-04-11	1	-19/+20
*	[SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySpark	Cheng Lian	2015-04-08	1	-0/+9
*	[SPARK-6781] [SQL] use sqlContext in python shell	Davies Liu	2015-04-08	4	-46/+45
*	[SPARK-6553] [pyspark] Support functools.partial as UDF	ksonj	2015-04-01	2	-1/+33
*	[Doc] Improve Python DataFrame documentation	Reynold Xin	2015-03-31	5	-390/+250
*	[SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.	Reynold Xin	2015-03-31	2	-6/+45
*	[SPARK-6119][SQL] DataFrame support for missing data handling	Reynold Xin	2015-03-30	2	-0/+182
*	[SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() a...	Davies Liu	2015-03-30	1	-27/+60
*	[DOC] Improvements to Python docs.	Reynold Xin	2015-03-28	2	-14/+9
*	[SPARK-6117] [SQL] Improvements to DataFrame.describe()	Reynold Xin	2015-03-26	1	-0/+19
*	[SPARK-6536] [PySpark] Column.inSet() in Python	Davies Liu	2015-03-26	1	-0/+17
*	[SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTab...	Yin Huai	2015-03-18	1	-2/+2
*	[SPARK-6210] [SQL] use prettyString as column name in agg()	Davies Liu	2015-03-14	1	-16/+16