aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/rdd.py
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-20232][PYTHON] Improve combineByKey docsDavid Gingrich2017-04-131-5/+19
* [SPARK-19872] [PYTHON] Use the correct deserializer for RDD construction for ...hyukjinkwon2017-03-151-1/+3
* [SPARK-13330][PYSPARK] PYTHONHASHSEED is not propgated to python workerJeff Zhang2017-02-241-1/+2
* [SPARK-18281] [SQL] [PYSPARK] Remove timeout for reading data through socket ...Liang-Chi Hsieh2016-12-201-6/+5
* [SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...hyukjinkwon2016-11-221-26/+28
* [SPARK-18361][PYSPARK] Expose RDD localCheckpoint in PySparkGabriel Huang2016-11-211-1/+32
* [SPARK-18445][BUILD][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that`/`...hyukjinkwon2016-11-191-2/+2
* [SPARK-18365][DOCS] Improve Sample Method Documentationanabranch2016-11-171-0/+5
* [SPARK-17817] [PYSPARK] [FOLLOWUP] PySpark RDD Repartitioning Results in High...Liang-Chi Hsieh2016-10-181-6/+6
* [SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly Skewed Pa...Liang-Chi Hsieh2016-10-111-3/+10
* [SPARK-17679] [PYSPARK] remove unnecessary Py4J ListConverter patchJason White2016-10-031-11/+2
* [MINOR][PYSPARK][DOCS] Fix examples in PySpark documentationhyukjinkwon2016-09-281-2/+2
* [DOC] improve python doc for rdd.histogram and dataframe.joinMortada Mehyar2016-07-181-9/+9
* [MINOR] Fix Typos 'an -> a'Zheng RuiFeng2016-06-061-2/+2
* [SPARK-15136][PYSPARK][DOC] Fix links to sphinx style and add a default param...Holden Karau2016-05-091-3/+3
* [SPARK-14368][PYSPARK] Support python.spark.worker.memory with upper-case unit.Yong Tang2016-04-051-1/+1
* [SPARK-14334] [SQL] add toLocalIterator for Dataset/DataFrameDavies Liu2016-04-041-4/+4
* [SPARK-13467] [PYSPARK] abstract python function to simplify pyspark codeWenchen Fan2016-02-241-9/+14
* [SPARK-13339][DOCS] Clarify commutative / associative operator requirements f...Sean Owen2016-02-191-4/+3
* [SPARK-5865][API DOC] Add doc warnings for methods that return local data str...Tommy YU2016-02-061-0/+17
* [SPARK-7683][PYSPARK] Confusing behavior of fold function of RDD in pysparkSean Owen2016-01-191-1/+1
* [SPARK-12091] [PYSPARK] Deprecate the JAVA-specific deserialized storage levelsgatorsmile2015-12-181-4/+4
* [SPARK-12090] [PYSPARK] consider shuffle in coalesce()Davies Liu2015-12-011-1/+1
* [SPARK-11658] simplify documentation for PySpark combineByKeyChris Snow2015-11-121-1/+0
* [SPARK-9821] [PYSPARK] pyspark-reduceByKey-should-take-a-custom-partitionerHolden Karau2015-09-211-13/+16
* [SPARK-10710] Remove ability to disable spilling in core and SQLJosh Rosen2015-09-191-18/+7
* [SPARK-10642] [PYSPARK] Fix crash when calling rdd.lookup() on tuple keysLiang-Chi Hsieh2015-09-171-1/+4
* [SPARK-9828] [PYSPARK] Mutable values should not be default argumentsMechCoder2015-08-141-1/+4
* [SPARK-9144] Remove DAGScheduler.runLocallyWithinThread and spark.localExecut...Josh Rosen2015-07-221-2/+2
* [SPARK-9021] [PYSPARK] Change RDD.aggregate() to do reduce(mapPartitions()) i...Nicholas Hwang2015-07-191-2/+8
* [SPARK-7735] [PYSPARK] Raise Exception on non-zero exit from pipe commandsScott Taylor2015-07-101-2/+14
* [SPARK-8738] [SQL] [PYSPARK] capture SQL AnalysisException in Python APIDavies Liu2015-06-301-1/+2
* [SPARK-7810] [PYSPARK] solve python rdd socket connection problemAi He2015-06-291-3/+15
* [SPARK-8541] [PYSPARK] test the absolute error in approx doctestsScott Taylor2015-06-221-2/+2
* [SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling...zsxwing2015-06-171-1/+1
* [SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutativeSean Owen2015-05-211-2/+10
* [SPARK-6216] [PYSPARK] check python version of worker with driverDavies Liu2015-05-181-2/+2
* [SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinctVinod K C2015-05-091-2/+0
* [SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expressionDavies Liu2015-04-211-0/+3
* [SPARK-4897] [PySpark] Python 3 supportDavies Liu2015-04-161-80/+109
* [SPARK-6886] [PySpark] fix big closure with shuffleDavies Liu2015-04-151-10/+5
* [SPARK-6216] [PySpark] check the python version in workerDavies Liu2015-04-101-1/+1
* [SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.Milan Straka2015-04-101-1/+1
* [SPARK-3074] [PySpark] support groupByKey() with single huge keyDavies Liu2015-04-091-12/+36
* [SPARK-6667] [PySpark] remove setReuseAddressDavies Liu2015-04-021-0/+1
* [SPARK-6370][core] Documentation: Improve all 3 docs for RDD.samplembonaci2015-03-201-0/+6
* [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()Davies Liu2015-03-091-16/+14
* [SPARK-5944] [PySpark] fix version in Python API docsDavies Liu2015-02-251-0/+4
* [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializerDavies Liu2015-02-241-1/+1
* [SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySparkDavies Liu2015-02-171-16/+33