spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-17264][SQL] DataStreamWriter should document that it only supports Par...	Sean Owen	2016-08-30	1	-1/+1
*	[SPARK-17001][ML] Enable standardScaler to standardize sparse vectors when wi...	Sean Owen	2016-08-27	1	-3/+2
*	[SPARK-17197][ML][PYSPARK] PySpark LiR/LoR supports tree aggregation level co...	Yanbo Liang	2016-08-25	4	-11/+42
*	[SPARK-17215][SQL] Method `SQLContext.parseDataType(dataTypeString: String)` ...	jiangxingbo	2016-08-24	6	-13/+16
*	[SPARK-16216][SQL] Read/write timestamps and dates in ISO 8601 and dateFormat...	hyukjinkwon	2016-08-24	2	-20/+66
*	[SPARK-15113][PYSPARK][ML] Add missing num features num classes	Holden Karau	2016-08-22	3	-11/+64
*	[SPARK-15018][PYSPARK][ML] Improve handling of PySpark Pipeline when used wit...	Bryan Cutler	2016-08-19	2	-8/+14
*	[SPARK-16965][MLLIB][PYSPARK] Fix bound checking for SparseVector.	Jeff Zhang	2016-08-19	1	-0/+15
*	[SPARK-16961][CORE] Fixed off-by-one error that biased randomizeInPlace	Nick Lavers	2016-08-19	3	-8/+8
*	[MINOR][DOC] Fix the descriptions for `properties` argument in the documenati...	mvervuurt	2016-08-16	1	-5/+6
*	[SPARK-17035] [SQL] [PYSPARK] Improve Timestamp not to lose precision for all...	Dongjoon Hyun	2016-08-16	2	-1/+6
*	[SPARK-16700][PYSPARK][SQL] create DataFrame from dict/Row with schema	Davies Liu	2016-08-15	4	-28/+62
*	[MINOR][ML] Rename TreeEnsembleModels to TreeEnsembleModel for PySpark	Yanbo Liang	2016-08-11	2	-6/+6
*	[SPARK-16324][SQL] regexp_extract should doc that it returns empty string whe...	Sean Owen	2016-08-10	1	-1/+5
*	[SPARK-16950] [PYSPARK] fromOffsets parameter support in KafkaUtils.createDir...	Mariusz Strzelecki	2016-08-09	2	-9/+6
*	[SPARK-16409][SQL] regexp_extract with optional groups causes NPE	Sean Owen	2016-08-07	1	-0/+3
*	[SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Updat...	Nicholas Chammas	2016-08-06	1	-6/+5
*	[SPARK-16831][PYTHON] Fixed bug in CrossValidator.avgMetrics	=^_^=	2016-08-03	1	-1/+3
*	[SPARK-16062] [SPARK-15989] [SQL] Fix two bugs of Python-only UDTs	Liang-Chi Hsieh	2016-08-02	1	-0/+35
*	[SPARK-16772][PYTHON][DOCS] Restore "datatype string" to Python API docstrings	Nicholas Chammas	2016-07-29	2	-12/+8
*	[SPARK-16772] Correct API doc references to PySpark classes + formatting fixes	Nicholas Chammas	2016-07-28	8	-58/+75
*	[SPARK-15254][DOC] Improve ML pipeline Cross Validation Scaladoc & PyDoc	krishnakalyan3	2016-07-27	1	-2/+11
*	[SPARK-16653][ML][OPTIMIZER] update ANN convergence tolerance param default t...	WeichenXu	2016-07-25	1	-4/+4
*	[PYSPARK] add picklable SparseMatrix in pyspark.ml.common	WeichenXu	2016-07-24	1	-0/+1
*	[SPARK-16662][PYSPARK][SQL] fix HiveContext warning bug	WeichenXu	2016-07-23	1	-5/+4
*	[SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...	Dongjoon Hyun	2016-07-22	1	-0/+2
*	[SPARK-16494][ML] Upgrade breeze version to 0.12	Yanbo Liang	2016-07-19	1	-1/+1
*	[DOC] improve python doc for rdd.histogram and dataframe.join	Mortada Mehyar	2016-07-18	2	-14/+14
*	[SPARK-14817][ML][MLLIB][DOC] Made DataFrame-based API primary in MLlib guide	Joseph K. Bradley	2016-07-15	3	-4/+7
*	[SPARK-16546][SQL][PYSPARK] update python dataframe.drop	WeichenXu	2016-07-14	1	-8/+19
*	[SPARK-16503] SparkSession should provide Spark version	Liwei Lin	2016-07-13	1	-0/+6
*	[SPARK-16536][SQL][PYSPARK][MINOR] Expose `sql` in PySpark Shell	Dongjoon Hyun	2016-07-13	1	-0/+1
*	[SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit ...	Joseph K. Bradley	2016-07-13	13	-197/+11
*	[SPARK-16429][SQL] Include `StringType` columns in `describe()`	Dongjoon Hyun	2016-07-08	1	-4/+4
*	[SPARK-13638][SQL] Add quoteAll option to CSV DataFrameWriter	Jurriaan Pruis	2016-07-08	1	-2/+5
*	[SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...	Dongjoon Hyun	2016-07-08	1	-2/+2
*	[MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentation	hyukjinkwon	2016-07-06	6	-23/+26
*	[SPARK-16348][ML][MLLIB][PYTHON] Use full classpaths for pyspark ML JVM calls	Joseph K. Bradley	2016-07-05	8	-26/+28
*	[SPARK-16335][SQL] Structured streaming should fail if source directory does ...	Reynold Xin	2016-07-01	1	-7/+4
*	[SPARK-15954][SQL] Disable loading test tables in Python tests	Reynold Xin	2016-06-30	1	-1/+1
*	[SPARK-16328][ML][MLLIB][PYSPARK] Add 'asML' and 'fromML' conversion methods ...	Nick Pentreath	2016-06-30	2	-0/+168
*	[SPARK-16313][SQL] Spark should not silently drop exceptions in file listing	Reynold Xin	2016-06-30	2	-2/+2
*	[SPARK-16289][SQL] Implement posexplode table generating function	Dongjoon Hyun	2016-06-30	1	-0/+21
*	[SPARK-15820][PYSPARK][SQL] Add Catalog.refreshTable into python API	WeichenXu	2016-06-30	1	-0/+5
*	[TRIVIAL] [PYSPARK] Clean up orc compression option as well	hyukjinkwon	2016-06-29	1	-2/+1
*	[SPARK-16236][SQL][FOLLOWUP] Add Path Option back to Load API in DataFrameReader	gatorsmile	2016-06-29	1	-1/+3
*	[SPARK-16266][SQL][STREAING] Moved DataStreamReader/Writer from pyspark.sql t...	Tathagata Das	2016-06-28	5	-499/+505
*	[SPARK-16268][PYSPARK] SQLContext should import DataStreamReader	Shixiong Zhu	2016-06-28	1	-2/+9
*	[MINOR][DOCS][STRUCTURED STREAMING] Minor doc fixes around `DataFrameWriter` ...	Burak Yavuz	2016-06-28	1	-2/+2
*	[SPARK-16175] [PYSPARK] handle None for UDT	Davies Liu	2016-06-28	2	-2/+16