spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-13467] [PYSPARK] abstract python function to simplify pyspark code	Wenchen Fan	2016-02-24	1	-9/+14
*	[SPARK-13339][DOCS] Clarify commutative / associative operator requirements f...	Sean Owen	2016-02-19	1	-4/+3
*	[SPARK-5865][API DOC] Add doc warnings for methods that return local data str...	Tommy YU	2016-02-06	1	-0/+17
*	[SPARK-7683][PYSPARK] Confusing behavior of fold function of RDD in pyspark	Sean Owen	2016-01-19	1	-1/+1
*	[SPARK-12091] [PYSPARK] Deprecate the JAVA-specific deserialized storage levels	gatorsmile	2015-12-18	1	-4/+4
*	[SPARK-12090] [PYSPARK] consider shuffle in coalesce()	Davies Liu	2015-12-01	1	-1/+1
*	[SPARK-11658] simplify documentation for PySpark combineByKey	Chris Snow	2015-11-12	1	-1/+0
*	[SPARK-9821] [PYSPARK] pyspark-reduceByKey-should-take-a-custom-partitioner	Holden Karau	2015-09-21	1	-13/+16
*	[SPARK-10710] Remove ability to disable spilling in core and SQL	Josh Rosen	2015-09-19	1	-18/+7
*	[SPARK-10642] [PYSPARK] Fix crash when calling rdd.lookup() on tuple keys	Liang-Chi Hsieh	2015-09-17	1	-1/+4
*	[SPARK-9828] [PYSPARK] Mutable values should not be default arguments	MechCoder	2015-08-14	1	-1/+4
*	[SPARK-9144] Remove DAGScheduler.runLocallyWithinThread and spark.localExecut...	Josh Rosen	2015-07-22	1	-2/+2
*	[SPARK-9021] [PYSPARK] Change RDD.aggregate() to do reduce(mapPartitions()) i...	Nicholas Hwang	2015-07-19	1	-2/+8
*	[SPARK-7735] [PYSPARK] Raise Exception on non-zero exit from pipe commands	Scott Taylor	2015-07-10	1	-2/+14
*	[SPARK-8738] [SQL] [PYSPARK] capture SQL AnalysisException in Python API	Davies Liu	2015-06-30	1	-1/+2
*	[SPARK-7810] [PYSPARK] solve python rdd socket connection problem	Ai He	2015-06-29	1	-3/+15
*	[SPARK-8541] [PYSPARK] test the absolute error in approx doctests	Scott Taylor	2015-06-22	1	-2/+2
*	[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling...	zsxwing	2015-06-17	1	-1/+1
*	[SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutative	Sean Owen	2015-05-21	1	-2/+10
*	[SPARK-6216] [PYSPARK] check python version of worker with driver	Davies Liu	2015-05-18	1	-2/+2
*	[SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct	Vinod K C	2015-05-09	1	-2/+0
*	[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression	Davies Liu	2015-04-21	1	-0/+3
*	[SPARK-4897] [PySpark] Python 3 support	Davies Liu	2015-04-16	1	-80/+109
*	[SPARK-6886] [PySpark] fix big closure with shuffle	Davies Liu	2015-04-15	1	-10/+5
*	[SPARK-6216] [PySpark] check the python version in worker	Davies Liu	2015-04-10	1	-1/+1
*	[SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.	Milan Straka	2015-04-10	1	-1/+1
*	[SPARK-3074] [PySpark] support groupByKey() with single huge key	Davies Liu	2015-04-09	1	-12/+36
*	[SPARK-6667] [PySpark] remove setReuseAddress	Davies Liu	2015-04-02	1	-0/+1
*	[SPARK-6370][core] Documentation: Improve all 3 docs for RDD.sample	mbonaci	2015-03-20	1	-0/+6
*	[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()	Davies Liu	2015-03-09	1	-16/+14
*	[SPARK-5944] [PySpark] fix version in Python API docs	Davies Liu	2015-02-25	1	-0/+4
*	[SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer	Davies Liu	2015-02-24	1	-1/+1
*	[SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark	Davies Liu	2015-02-17	1	-16/+33
*	SPARK-5633 pyspark saveAsTextFile support for compression codec	Vladimir Vladimirov	2015-02-06	1	-2/+20
*	[SPARK-5577] Python udf for DataFrame	Davies Liu	2015-02-04	1	-16/+22
*	[SPARK-5430] move treeReduce and treeAggregate from mllib to core	Xiangrui Meng	2015-01-28	1	-1/+90
*	[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible	Yandu Oppacher	2015-01-28	1	-6/+9
*	[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd	Michael Nazario	2015-01-28	1	-0/+14
*	SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs	Sandy Ryza	2015-01-28	1	-2/+2
*	[SPARK-5063] More helpful error messages for several invalid operations	Josh Rosen	2015-01-23	1	-0/+11
*	SPARK-5270 [CORE] Provide isEmpty() function in RDD API	Sean Owen	2015-01-19	1	-0/+12
*	[SPARK-4822] Use sphinx tags for Python doc annotations	lewuathe	2014-12-17	1	-4/+4
*	[SPARK-4841] fix zip with textFile()	Davies Liu	2014-12-15	1	-14/+11
*	[SPARK-4477] [PySpark] remove numpy from RDDSampler	Davies Liu	2014-11-20	1	-4/+6
*	[SPARK-4327] [PySpark] Python API for RDD.randomSplit()	Davies Liu	2014-11-18	1	-3/+27
*	[SPARK-4304] [PySpark] Fix sort on empty RDD	Davies Liu	2014-11-07	1	-0/+2
*	[SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...	Davies Liu	2014-11-03	1	-54/+37
*	[SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample	Xiangrui Meng	2014-11-03	1	-3/+0
*	[SPARK-4150][PySpark] return self in rdd.setName	Xiangrui Meng	2014-10-31	1	-2/+2
*	[Spark] RDD take() method: overestimate too much	yingjieMiao	2014-10-13	1	-1/+4