spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...	Josh Rosen	2016-09-14	1	-4/+1
*	[SPARK-17298][SQL] Require explicit CROSS join for cartesian products	Srinath Shankar	2016-09-03	1	-1/+1
*	[SPARK-16772] Correct API doc references to PySpark classes + formatting fixes	Nicholas Chammas	2016-07-28	1	-1/+1
*	[SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...	Dongjoon Hyun	2016-07-22	1	-0/+2
*	[DOC] improve python doc for rdd.histogram and dataframe.join	Mortada Mehyar	2016-07-18	1	-5/+5
*	[SPARK-16546][SQL][PYSPARK] update python dataframe.drop	WeichenXu	2016-07-14	1	-8/+19
*	[SPARK-16429][SQL] Include `StringType` columns in `describe()`	Dongjoon Hyun	2016-07-08	1	-4/+4
*	[SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...	Dongjoon Hyun	2016-07-08	1	-2/+2
*	[MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentation	hyukjinkwon	2016-07-06	1	-4/+4
*	[SPARK-16266][SQL][STREAING] Moved DataStreamReader/Writer from pyspark.sql t...	Tathagata Das	2016-06-28	1	-1/+2
*	[MINOR][DOCS][STRUCTURED STREAMING] Minor doc fixes around `DataFrameWriter` ...	Burak Yavuz	2016-06-28	1	-2/+2
*	[SPARK-16128][SQL] Allow setting length of characters to be truncated to, in ...	Prashant Sharma	2016-06-28	1	-3/+15
*	[SPARK-15953][WIP][STREAMING] Renamed ContinuousQuery to StreamingQuery	Tathagata Das	2016-06-15	1	-1/+1
*	[SPARK-15933][SQL][STREAMING] Refactored DF reader-writer to use readStream a...	Tathagata Das	2016-06-14	1	-2/+16
*	[SPARK-15392][SQL] fix default value of size estimation of logical plan	Davies Liu	2016-05-19	1	-1/+1
*	[SPARK-14603][SQL][FOLLOWUP] Verification of Metadata Operations by Session C...	gatorsmile	2016-05-19	1	-2/+1
*	[SPARK-15171][SQL] Deprecate registerTempTable and add dataset.createTempView	Sean Zhong	2016-05-12	1	-3/+48
*	[SPARK-15278] [SQL] Remove experimental tag from Python DataFrame	Reynold Xin	2016-05-11	1	-2/+2
*	[MINOR] remove dead code	Davies Liu	2016-05-04	1	-9/+0
*	[SPARK-14555] First cut of Python API for Structured Streaming	Burak Yavuz	2016-04-20	1	-0/+12
*	[SPARK-14717] [PYTHON] Scala, Python APIs for Dataset.unpersist differ in def...	felixcheung	2016-04-19	1	-1/+3
*	[SPARK-14573][PYSPARK][BUILD] Fix PyDoc Makefile & highlighting issues	Holden Karau	2016-04-14	1	-1/+1
*	[SPARK-14334] [SQL] add toLocalIterator for Dataset/DataFrame	Davies Liu	2016-04-04	1	-0/+14
*	[SPARK-14142][SQL] Replace internal use of unionAll with union	Reynold Xin	2016-03-24	1	-2/+2
*	[SPARK-14088][SQL] Some Dataset API touch-up	Reynold Xin	2016-03-22	1	-2/+12
*	[SPARK-10380][SQL] Fix confusing documentation examples for astype/drop_dupli...	Reynold Xin	2016-03-14	1	-5/+15
*	[SPARK-13671] [SPARK-13311] [SQL] Use different physical plans for RDD and da...	Davies Liu	2016-03-12	1	-2/+1
*	[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python Dat...	Wenchen Fan	2016-03-02	1	-40/+2
*	[SPARK-13479][SQL][PYTHON] Added Python API for approxQuantile	Joseph K. Bradley	2016-02-24	1	-0/+54
*	[SPARK-13250] [SQL] Update PhysicallRDD to convert to UnsafeRow if using the ...	Nong Li	2016-02-24	1	-1/+2
*	[SPARK-13329] [SQL] considering output for statistics of logical plan	Davies Liu	2016-02-23	1	-2/+2
*	[SPARK-13296][SQL] Move UserDefinedFunction into sql.expressions.	Reynold Xin	2016-02-13	1	-1/+1
*	[SPARK-12706] [SQL] grouping() and grouping_id()	Davies Liu	2016-02-10	1	-11/+11
*	[SPARK-5865][API DOC] Add doc warnings for methods that return local data str...	Tommy YU	2016-02-06	1	-0/+6
*	[SPARK-12756][SQL] use hash expression in Exchange	Wenchen Fan	2016-01-13	1	-13/+13
*	[SPARK-12600][SQL] Remove deprecated methods in Spark SQL	Reynold Xin	2016-01-04	1	-47/+1
*	[SPARK-12520] [PYSPARK] Correct Descriptions and Add Use Cases in Equi-Join	gatorsmile	2015-12-27	1	-1/+4
*	[SQL] Fix mistake doc of join type for dataframe.join	Yanbo Liang	2015-12-19	1	-1/+1
*	[SPARK-12091] [PYSPARK] Deprecate the JAVA-specific deserialized storage levels	gatorsmile	2015-12-18	1	-3/+3
*	[SPARK-12012][SQL] Show more comprehensive PhysicalRDD metadata when visualiz...	Cheng Lian	2015-12-09	1	-1/+1
*	[SPARK-11969] [SQL] [PYSPARK] visualization of SQL query for pyspark	Davies Liu	2015-11-25	1	-1/+1
*	[SPARK-11720][SQL][ML] Handle edge cases when count = 0 or 1 for Stats function	JihongMa	2015-11-18	1	-1/+1
*	[SPARK-11420] Updating Stddev support via Imperative Aggregate	JihongMa	2015-11-12	1	-1/+1
*	[SPARK-9830][SQL] Remove AggregateExpression1 and Aggregate Operator used to ...	Yin Huai	2015-11-10	1	-1/+1
*	[SPARK-11410] [PYSPARK] Add python bindings for repartition and sortW…	Nong Li	2015-11-06	1	-16/+101
*	[SPARK-10116][CORE] XORShiftRandom.hashSeed is random in high bits	Imran Rashid	2015-11-06	1	-3/+3
*	[SPARK-11279][PYSPARK] Add DataFrame#toDF in PySpark	Jeff Zhang	2015-10-26	1	-0/+12
*	[SPARK-11205][PYSPARK] Delegate to scala DataFrame API rather than p…	Jeff Zhang	2015-10-20	1	-1/+2
*	[SPARK-10782] [PYTHON] Update dropDuplicates documentation	asokadiggs	2015-09-29	1	-0/+2
*	[SPARK-10731] [SQL] Delegate to Scala's DataFrame.take implementation in Pyth...	Reynold Xin	2015-09-23	1	-1/+4