spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-20132][DOCS] Add documentation for column string functions	Michael Patterson	2017-04-22	1	-6/+64
*	[SPARK-20360][PYTHON] reprs for interpreters	Kyle Kelley	2017-04-18	2	-0/+37
*	[SPARK-20232][PYTHON] Improve combineByKey docs	David Gingrich	2017-04-13	1	-5/+19
*	[SPARK-19570][PYSPARK] Allow to disable hive in pyspark shell	Jeff Zhang	2017-04-12	1	-6/+16
*	[MINOR][DOCS] JSON APIs related documentation fixes	hyukjinkwon	2017-04-12	2	-5/+7
*	[SPARK-19505][PYTHON] AttributeError on Exception.message in Python3	David Gingrich	2017-04-11	3	-5/+53
*	[SPARK-20285][TESTS] Increase the pyspark streaming test timeout to 30 seconds	Shixiong Zhu	2017-04-10	1	-1/+1
*	[SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.Correlation	Liang-Chi Hsieh	2017-04-07	1	-0/+61
*	[SPARK-20196][PYTHON][SQL] update doc for catalog functions for all languages...	Felix Cheung	2017-04-06	2	-9/+20
*	[SPARK-20064][PYSPARK] Bump the PySpark verison number to 2.2	setjet	2017-04-06	1	-1/+1
*	[SPARK-20214][ML] Make sure converted csc matrix has sorted indices	Liang-Chi Hsieh	2017-04-05	3	-0/+17
*	[SPARK-19454][PYTHON][SQL] DataFrame.replace improvements	zero323	2017-04-05	2	-25/+128
*	[SPARK-20166][SQL] Use XXX for ISO 8601 timezone instead of ZZ (FastDateForma...	hyukjinkwon	2017-04-03	2	-6/+6
*	[SPARK-19955][PYSPARK] Jenkins Python Conda based test.	Holden Karau	2017-03-29	1	-3/+3
*	[SPARK-20040][ML][PYTHON] pyspark wrapper for ChiSquareTest	Bago Amirbekian	2017-03-28	3	-9/+123
*	[SPARK-20102] Fix nightly packaging and RC packaging scripts w/ two minor bui...	Josh Rosen	2017-03-27	1	-1/+0
*	[MINOR][DOCS] Match several documentation changes in Scala to R/Python	hyukjinkwon	2017-03-26	2	-4/+12
*	[SPARK-19281][PYTHON][ML] spark.ml Python API for FPGrowth	zero323	2017-03-26	3	-7/+270
*	[SPARK-15040][ML][PYSPARK] Add Imputer to PySpark	Nick Pentreath	2017-03-24	2	-0/+170
*	[SPARK-19876][SS][WIP] OneTime Trigger Executor	Tyson Condie	2017-03-23	2	-48/+32
*	[SPARK-18579][SQL] Use ignoreLeadingWhiteSpace and ignoreTrailingWhiteSpace o...	hyukjinkwon	2017-03-23	3	-16/+37
*	[SPARK-19949][SQL][FOLLOW-UP] Clean up parse modes and update related comments	hyukjinkwon	2017-03-22	2	-4/+4
*	[SPARK-20041][DOC] Update docs for NaN handling in approxQuantile	Zheng RuiFeng	2017-03-21	1	-1/+2
*	[SPARK-20011][ML][DOCS] Clarify documentation for ALS 'rank' parameter	christopher snow	2017-03-21	1	-2/+2
*	[SPARK-19849][SQL] Support ArrayType in to_json to produce JSON array	hyukjinkwon	2017-03-19	1	-5/+10
*	[SPARK-19986][TESTS] Make pyspark.streaming.tests.CheckpointTests more stable	Shixiong Zhu	2017-03-17	1	-5/+6
*	[SPARK-19872] [PYTHON] Use the correct deserializer for RDD construction for ...	hyukjinkwon	2017-03-15	2	-1/+9
*	[SPARK-19817][SS] Make it clear that `timeZone` is a general option in DataSt...	Liwei Lin	2017-03-14	2	-12/+28
*	[SPARK-19817][SQL] Make it clear that `timeZone` option is a general option i...	Takuya UESHIN	2017-03-14	1	-18/+28
*	[SPARK-12334][SQL][PYSPARK] Support read from multiple input paths for orc fi...	Jeff Zhang	2017-03-09	2	-6/+13
*	[SPARK-19561][SQL] add int case handling for TimestampType	Jason White	2017-03-09	1	-0/+8
*	[SPARK-19806][ML][PYSPARK] PySpark GeneralizedLinearRegression supports tweed...	Yanbo Liang	2017-03-08	2	-8/+73
*	Revert "[SPARK-19561] [PYTHON] cast TimestampType.toInternal output to long"	Wenchen Fan	2017-03-07	2	-7/+1
*	[SPARK-19561] [PYTHON] cast TimestampType.toInternal output to long	Jason White	2017-03-07	2	-1/+7
*	[SPARK-19701][SQL][PYTHON] Throws a correct exception for 'in' operator again...	hyukjinkwon	2017-03-05	2	-1/+6
*	[SPARK-19595][SQL] Support json array in from_json	hyukjinkwon	2017-03-05	1	-3/+8
*	[SPARK-19348][PYTHON] PySpark keyword_only decorator is not thread-safe	Bryan Cutler	2017-03-03	11	-120/+161
*	[SPARK-18352][DOCS] wholeFile JSON update doc and programming guide	Felix Cheung	2017-03-02	2	-4/+4
*	[SPARK-19734][PYTHON][ML] Correct OneHotEncoder doc string to say dropLast	Mark Grover	2017-03-01	1	-1/+1
*	[MINOR][ML] Fix comments in LSH Examples and Python API	Yun Ni	2017-03-01	1	-1/+1
*	[SPARK-19610][SQL] Support parsing multiline CSV files	hyukjinkwon	2017-02-28	4	-5/+22
*	[SPARK-14489][ML][PYSPARK] ALS unknown user/item prediction strategy	Nick Pentreath	2017-02-28	1	-5/+25
*	[SPARK-19660][CORE][SQL] Replace the configuration property names that are de...	Yuming Wang	2017-02-28	1	-23/+24
*	[SPARK-13330][PYSPARK] PYTHONHASHSEED is not propgated to python worker	Jeff Zhang	2017-02-24	2	-5/+4
*	[SPARK-19161][PYTHON][SQL] Improving UDF Docstrings	zero323	2017-02-24	2	-11/+25
*	[SPARK-14772][PYTHON][ML] Fixed Params.copy method to match Scala implementation	Bryan Cutler	2017-02-23	2	-6/+27
*	[SPARK-19706][PYSPARK] add Column.contains in pyspark	Wenchen Fan	2017-02-23	2	-1/+3
*	[SPARK-18699][SQL] Put malformed tokens into a new field when parsing CSV data	Takeshi Yamamuro	2017-02-23	2	-16/+48
*	[SPARK-19497][SS] Implement streaming deduplication	Shixiong Zhu	2017-02-23	1	-0/+6
*	[SPARK-19405][STREAMING] Support for cross-account Kinesis reads via STS	Adam Budde	2017-02-22	1	-2/+10