spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
...
*	[SPARK-18766][SQL] Push Down Filter Through BatchEvalPython (Python UDF)	gatorsmile	2016-12-10	1	-0/+9
*	[SPARK-16589] [PYTHON] Chained cartesian produces incorrect number of records	Andrew Ray	2016-12-08	2	-23/+53
*	[SPARK-18667][PYSPARK][SQL] Change the way to group row in BatchEvalPythonExe...	Liang-Chi Hsieh	2016-12-08	1	-0/+8
*	[SPARK-18754][SS] Rename recentProgresses to recentProgress	Michael Armbrust	2016-12-07	2	-5/+5
*	[SPARK-18652][PYTHON] Include the example data and third-party licenses in py...	Shuai Lin	2016-12-07	2	-1/+21
*	[SPARK-18657][SPARK-18668] Make StreamingQuery.id persists across restart and...	Tathagata Das	2016-12-05	1	-2/+17
*	[SPARK-18634][PYSPARK][SQL] Corruption and Correctness issues with exploding ...	Liang-Chi Hsieh	2016-12-05	1	-0/+20
*	[SPARK-18694][SS] Add StreamingQuery.explain and exception to Python and fix ...	Shixiong Zhu	2016-12-05	2	-0/+69
*	[SPARK-18690][PYTHON][SQL] Backward compatibility of unbounded frames	zero323	2016-12-02	2	-14/+51
*	[SPARK-18274][ML][PYSPARK] Memory leak in PySpark JavaWrapper	Sandeep Singh	2016-12-01	2	-18/+41
*	[SPARK-18366][PYSPARK][ML] Add handleInvalid to Pyspark for QuantileDiscretiz...	Sandeep Singh	2016-11-30	1	-14/+71
*	[SPARK-18516][STRUCTURED STREAMING] Follow up PR to add StreamingQuery.status...	Tathagata Das	2016-11-29	2	-0/+13
*	[SPARK-15819][PYSPARK][ML] Add KMeanSummary in KMeans of PySpark	Jeff Zhang	2016-11-29	2	-0/+56
*	[SPARK-18319][ML][QA2.1] 2.1 QA: API: Experimental, DeveloperApi, final, seal...	Yuhao	2016-11-29	4	-32/+0
*	[SPARK-18516][SQL] Split state and progress in streaming	Tathagata Das	2016-11-29	2	-304/+44
*	[SPARK-18523][PYSPARK] Make SparkContext.stop more reliable	Alexander Shorin	2016-11-28	1	-2/+15
*	[SPARK-18481][ML] ML 2.1 QA: Remove deprecated methods for ML	Yanbo Liang	2016-11-26	1	-4/+36
*	[SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...	hyukjinkwon	2016-11-22	20	-146/+157
*	[SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframe	Burak Yavuz	2016-11-21	1	-3/+54
*	[SPARK-18361][PYSPARK] Expose RDD localCheckpoint in PySpark	Gabriel Huang	2016-11-21	2	-1/+49
*	[SPARK-18282][ML][PYSPARK] Add python clustering summaries for GMM and BKM	sethah	2016-11-21	4	-13/+212
*	[SPARK-18445][BUILD][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that`/`...	hyukjinkwon	2016-11-19	4	-6/+6
*	[SPARK-18365][DOCS] Improve Sample Method Documentation	anabranch	2016-11-17	2	-0/+10
*	[SPARK-1267][SPARK-18129] Allow PySpark to be pip installed	Holden Karau	2016-11-16	8	-1/+381
*	[SPARK-18459][SPARK-18460][STRUCTUREDSTREAMING] Rename triggerId to batchId a...	Tathagata Das	2016-11-16	1	-3/+3
*	[MINOR][PYSPARK] Improve error message when running PySpark with different mi...	Liang-Chi Hsieh	2016-11-10	1	-1/+3
*	[SPARK-17829][SQL] Stable format for offset log	Tyson Condie	2016-11-09	1	-6/+6
*	[SPARK-18239][SPARKR] Gradient Boosted Tree for R	Felix Cheung	2016-11-08	1	-5/+5
*	[MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently ...	hyukjinkwon	2016-11-05	1	-15/+20
*	[SPARK-14393][SQL][DOC] update doc for python and R	Felix Cheung	2016-11-03	1	-1/+1
*	[SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2....	Sean Owen	2016-11-03	1	-0/+4
*	[SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassi...	Zheng RuiFeng	2016-11-03	1	-5/+5
*	[SPARK-18088][ML] Various ChiSqSelector cleanups	Joseph K. Bradley	2016-11-01	2	-49/+46
*	[SPARK-17764][SQL] Add `to_json` supporting to convert nested struct column t...	hyukjinkwon	2016-11-01	3	-2/+25
*	[SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest re...	Felix Cheung	2016-10-30	2	-11/+12
*	[SPARK-17219][ML] enhanced NaN value handling in Bucketizer	VinceShieh	2016-10-27	1	-5/+0
*	[SQL][DOC] updating doc for JSON source to link to jsonlines.org	Felix Cheung	2016-10-26	2	-3/+5
*	[SPARK-17926][SQL][STREAMING] Added json for statuses	Tathagata Das	2016-10-21	1	-6/+5
*	[SPARK-17960][PYSPARK][UPGRADE TO PY4J 0.10.4]	Jagadeesan	2016-10-21	3	-1/+1
*	[SPARK-17817] [PYSPARK] [FOLLOWUP] PySpark RDD Repartitioning Results in High...	Liang-Chi Hsieh	2016-10-18	1	-6/+6
*	[SPARK-17946][PYSPARK] Python crossJoin API similar to Scala	Srinath Shankar	2016-10-14	2	-6/+35
*	[SPARK-11775][PYSPARK][SQL] Allow PySpark to register Java UDF	Jeff Zhang	2016-10-14	1	-1/+27
*	[SPARK-16063][SQL] Add storageLevel to Dataset	Nick Pentreath	2016-10-14	1	-6/+30
*	[SPARK-17870][MLLIB][ML] Change statistic to pValue for SelectKBest and Selec...	Peng	2016-10-14	2	-6/+6
*	[SPARK-15402][ML][PYSPARK] PySpark ml.evaluation should support save/load	Yanbo Liang	2016-10-14	1	-9/+36
*	[SPARK-15957][FOLLOW-UP][ML][PYSPARK] Add Python API for RFormula forceIndexL...	Yanbo Liang	2016-10-13	2	-4/+43
*	[SPARK-17731][SQL][STREAMING] Metrics for structured streaming	Tathagata Das	2016-10-13	1	-0/+301
*	[SPARK-17745][ML][PYSPARK] update NB python api - add weight col parameter	WeichenXu	2016-10-12	1	-13/+13
*	[SPARK-17845] [SQL] More self-evident window function frame boundary API	Reynold Xin	2016-10-12	2	-30/+84
*	[SPARK-14761][SQL] Reject invalid join methods when join columns are not spec...	Bijay Pathak	2016-10-12	2	-16/+21