aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/rdd.py
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-13467] [PYSPARK] abstract python function to simplify pyspark codeWenchen Fan2016-02-241-9/+14
* [SPARK-13339][DOCS] Clarify commutative / associative operator requirements f...Sean Owen2016-02-191-4/+3
* [SPARK-5865][API DOC] Add doc warnings for methods that return local data str...Tommy YU2016-02-061-0/+17
* [SPARK-7683][PYSPARK] Confusing behavior of fold function of RDD in pysparkSean Owen2016-01-191-1/+1
* [SPARK-12091] [PYSPARK] Deprecate the JAVA-specific deserialized storage levelsgatorsmile2015-12-181-4/+4
* [SPARK-12090] [PYSPARK] consider shuffle in coalesce()Davies Liu2015-12-011-1/+1
* [SPARK-11658] simplify documentation for PySpark combineByKeyChris Snow2015-11-121-1/+0
* [SPARK-9821] [PYSPARK] pyspark-reduceByKey-should-take-a-custom-partitionerHolden Karau2015-09-211-13/+16
* [SPARK-10710] Remove ability to disable spilling in core and SQLJosh Rosen2015-09-191-18/+7
* [SPARK-10642] [PYSPARK] Fix crash when calling rdd.lookup() on tuple keysLiang-Chi Hsieh2015-09-171-1/+4
* [SPARK-9828] [PYSPARK] Mutable values should not be default argumentsMechCoder2015-08-141-1/+4
* [SPARK-9144] Remove DAGScheduler.runLocallyWithinThread and spark.localExecut...Josh Rosen2015-07-221-2/+2
* [SPARK-9021] [PYSPARK] Change RDD.aggregate() to do reduce(mapPartitions()) i...Nicholas Hwang2015-07-191-2/+8
* [SPARK-7735] [PYSPARK] Raise Exception on non-zero exit from pipe commandsScott Taylor2015-07-101-2/+14
* [SPARK-8738] [SQL] [PYSPARK] capture SQL AnalysisException in Python APIDavies Liu2015-06-301-1/+2
* [SPARK-7810] [PYSPARK] solve python rdd socket connection problemAi He2015-06-291-3/+15
* [SPARK-8541] [PYSPARK] test the absolute error in approx doctestsScott Taylor2015-06-221-2/+2
* [SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling...zsxwing2015-06-171-1/+1
* [SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutativeSean Owen2015-05-211-2/+10
* [SPARK-6216] [PYSPARK] check python version of worker with driverDavies Liu2015-05-181-2/+2
* [SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinctVinod K C2015-05-091-2/+0
* [SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expressionDavies Liu2015-04-211-0/+3
* [SPARK-4897] [PySpark] Python 3 supportDavies Liu2015-04-161-80/+109
* [SPARK-6886] [PySpark] fix big closure with shuffleDavies Liu2015-04-151-10/+5
* [SPARK-6216] [PySpark] check the python version in workerDavies Liu2015-04-101-1/+1
* [SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.Milan Straka2015-04-101-1/+1
* [SPARK-3074] [PySpark] support groupByKey() with single huge keyDavies Liu2015-04-091-12/+36
* [SPARK-6667] [PySpark] remove setReuseAddressDavies Liu2015-04-021-0/+1
* [SPARK-6370][core] Documentation: Improve all 3 docs for RDD.samplembonaci2015-03-201-0/+6
* [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()Davies Liu2015-03-091-16/+14
* [SPARK-5944] [PySpark] fix version in Python API docsDavies Liu2015-02-251-0/+4
* [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializerDavies Liu2015-02-241-1/+1
* [SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySparkDavies Liu2015-02-171-16/+33
* SPARK-5633 pyspark saveAsTextFile support for compression codecVladimir Vladimirov2015-02-061-2/+20
* [SPARK-5577] Python udf for DataFrameDavies Liu2015-02-041-16/+22
* [SPARK-5430] move treeReduce and treeAggregate from mllib to coreXiangrui Meng2015-01-281-1/+90
* [SPARK-4387][PySpark] Refactoring python profiling code to make it extensibleYandu Oppacher2015-01-281-6/+9
* [SPARK-5440][pyspark] Add toLocalIterator to pyspark rddMichael Nazario2015-01-281-0/+14
* SPARK-5458. Refer to aggregateByKey instead of combineByKey in docsSandy Ryza2015-01-281-2/+2
* [SPARK-5063] More helpful error messages for several invalid operationsJosh Rosen2015-01-231-0/+11
* SPARK-5270 [CORE] Provide isEmpty() function in RDD APISean Owen2015-01-191-0/+12
* [SPARK-4822] Use sphinx tags for Python doc annotationslewuathe2014-12-171-4/+4
* [SPARK-4841] fix zip with textFile()Davies Liu2014-12-151-14/+11
* [SPARK-4477] [PySpark] remove numpy from RDDSamplerDavies Liu2014-11-201-4/+6
* [SPARK-4327] [PySpark] Python API for RDD.randomSplit()Davies Liu2014-11-181-3/+27
* [SPARK-4304] [PySpark] Fix sort on empty RDDDavies Liu2014-11-071-0/+2
* [SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...Davies Liu2014-11-031-54/+37
* [SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sampleXiangrui Meng2014-11-031-3/+0
* [SPARK-4150][PySpark] return self in rdd.setNameXiangrui Meng2014-10-311-2/+2
* [Spark] RDD take() method: overestimate too muchyingjieMiao2014-10-131-1/+4