spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-19454][PYTHON][SQL] DataFrame.replace improvements	zero323	2017-04-05	1	-25/+56
*	[SPARK-20041][DOC] Update docs for NaN handling in approxQuantile	Zheng RuiFeng	2017-03-21	1	-1/+2
*	[SPARK-19497][SS] Implement streaming deduplication	Shixiong Zhu	2017-02-23	1	-0/+6
*	[SPARK-19399][SPARKR] Add R coalesce API for DataFrame and Column	Felix Cheung	2017-02-15	1	-1/+9
*	[SPARK-19453][PYTHON][SQL][DOC] Correct and extend DataFrame.replace docstring	zero323	2017-02-14	1	-6/+12
*	[SPARK-14352][SQL] approxQuantile should support multi columns	Zheng RuiFeng	2017-02-01	1	-7/+30
*	[SPARK-19126][DOCS] Update Join Documentation Across Languages	anabranch	2017-01-08	1	-2/+3
*	[SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...	hyukjinkwon	2016-11-22	1	-15/+13
*	[SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframe	Burak Yavuz	2016-11-21	1	-3/+54
*	[SPARK-18365][DOCS] Improve Sample Method Documentation	anabranch	2016-11-17	1	-0/+5
*	[SPARK-17946][PYSPARK] Python crossJoin API similar to Scala	Srinath Shankar	2016-10-14	1	-5/+21
*	[SPARK-16063][SQL] Add storageLevel to Dataset	Nick Pentreath	2016-10-14	1	-6/+30
*	[SPARK-14761][SQL] Reject invalid join methods when join columns are not spec...	Bijay Pathak	2016-10-12	1	-16/+15
*	[SPARK-17338][SQL] add global temp view	Wenchen Fan	2016-10-10	1	-2/+23
*	[MINOR][PYSPARK][DOCS] Fix examples in PySpark documentation	hyukjinkwon	2016-09-28	1	-1/+1
*	[SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...	Josh Rosen	2016-09-14	1	-4/+1
*	[SPARK-17298][SQL] Require explicit CROSS join for cartesian products	Srinath Shankar	2016-09-03	1	-1/+1
*	[SPARK-16772] Correct API doc references to PySpark classes + formatting fixes	Nicholas Chammas	2016-07-28	1	-1/+1
*	[SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...	Dongjoon Hyun	2016-07-22	1	-0/+2
*	[DOC] improve python doc for rdd.histogram and dataframe.join	Mortada Mehyar	2016-07-18	1	-5/+5
*	[SPARK-16546][SQL][PYSPARK] update python dataframe.drop	WeichenXu	2016-07-14	1	-8/+19
*	[SPARK-16429][SQL] Include `StringType` columns in `describe()`	Dongjoon Hyun	2016-07-08	1	-4/+4
*	[SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...	Dongjoon Hyun	2016-07-08	1	-2/+2
*	[MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentation	hyukjinkwon	2016-07-06	1	-4/+4
*	[SPARK-16266][SQL][STREAING] Moved DataStreamReader/Writer from pyspark.sql t...	Tathagata Das	2016-06-28	1	-1/+2
*	[MINOR][DOCS][STRUCTURED STREAMING] Minor doc fixes around `DataFrameWriter` ...	Burak Yavuz	2016-06-28	1	-2/+2
*	[SPARK-16128][SQL] Allow setting length of characters to be truncated to, in ...	Prashant Sharma	2016-06-28	1	-3/+15
*	[SPARK-15953][WIP][STREAMING] Renamed ContinuousQuery to StreamingQuery	Tathagata Das	2016-06-15	1	-1/+1
*	[SPARK-15933][SQL][STREAMING] Refactored DF reader-writer to use readStream a...	Tathagata Das	2016-06-14	1	-2/+16
*	[SPARK-15392][SQL] fix default value of size estimation of logical plan	Davies Liu	2016-05-19	1	-1/+1
*	[SPARK-14603][SQL][FOLLOWUP] Verification of Metadata Operations by Session C...	gatorsmile	2016-05-19	1	-2/+1
*	[SPARK-15171][SQL] Deprecate registerTempTable and add dataset.createTempView	Sean Zhong	2016-05-12	1	-3/+48
*	[SPARK-15278] [SQL] Remove experimental tag from Python DataFrame	Reynold Xin	2016-05-11	1	-2/+2
*	[MINOR] remove dead code	Davies Liu	2016-05-04	1	-9/+0
*	[SPARK-14555] First cut of Python API for Structured Streaming	Burak Yavuz	2016-04-20	1	-0/+12
*	[SPARK-14717] [PYTHON] Scala, Python APIs for Dataset.unpersist differ in def...	felixcheung	2016-04-19	1	-1/+3
*	[SPARK-14573][PYSPARK][BUILD] Fix PyDoc Makefile & highlighting issues	Holden Karau	2016-04-14	1	-1/+1
*	[SPARK-14334] [SQL] add toLocalIterator for Dataset/DataFrame	Davies Liu	2016-04-04	1	-0/+14
*	[SPARK-14142][SQL] Replace internal use of unionAll with union	Reynold Xin	2016-03-24	1	-2/+2
*	[SPARK-14088][SQL] Some Dataset API touch-up	Reynold Xin	2016-03-22	1	-2/+12
*	[SPARK-10380][SQL] Fix confusing documentation examples for astype/drop_dupli...	Reynold Xin	2016-03-14	1	-5/+15
*	[SPARK-13671] [SPARK-13311] [SQL] Use different physical plans for RDD and da...	Davies Liu	2016-03-12	1	-2/+1
*	[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python Dat...	Wenchen Fan	2016-03-02	1	-40/+2
*	[SPARK-13479][SQL][PYTHON] Added Python API for approxQuantile	Joseph K. Bradley	2016-02-24	1	-0/+54
*	[SPARK-13250] [SQL] Update PhysicallRDD to convert to UnsafeRow if using the ...	Nong Li	2016-02-24	1	-1/+2
*	[SPARK-13329] [SQL] considering output for statistics of logical plan	Davies Liu	2016-02-23	1	-2/+2
*	[SPARK-13296][SQL] Move UserDefinedFunction into sql.expressions.	Reynold Xin	2016-02-13	1	-1/+1
*	[SPARK-12706] [SQL] grouping() and grouping_id()	Davies Liu	2016-02-10	1	-11/+11
*	[SPARK-5865][API DOC] Add doc warnings for methods that return local data str...	Tommy YU	2016-02-06	1	-0/+6
*	[SPARK-12756][SQL] use hash expression in Exchange	Wenchen Fan	2016-01-13	1	-13/+13