index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
rdd.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-11658] simplify documentation for PySpark combineByKey
Chris Snow
2015-11-12
1
-1
/
+0
*
[SPARK-9821] [PYSPARK] pyspark-reduceByKey-should-take-a-custom-partitioner
Holden Karau
2015-09-21
1
-13
/
+16
*
[SPARK-10710] Remove ability to disable spilling in core and SQL
Josh Rosen
2015-09-19
1
-18
/
+7
*
[SPARK-10642] [PYSPARK] Fix crash when calling rdd.lookup() on tuple keys
Liang-Chi Hsieh
2015-09-17
1
-1
/
+4
*
[SPARK-9828] [PYSPARK] Mutable values should not be default arguments
MechCoder
2015-08-14
1
-1
/
+4
*
[SPARK-9144] Remove DAGScheduler.runLocallyWithinThread and spark.localExecut...
Josh Rosen
2015-07-22
1
-2
/
+2
*
[SPARK-9021] [PYSPARK] Change RDD.aggregate() to do reduce(mapPartitions()) i...
Nicholas Hwang
2015-07-19
1
-2
/
+8
*
[SPARK-7735] [PYSPARK] Raise Exception on non-zero exit from pipe commands
Scott Taylor
2015-07-10
1
-2
/
+14
*
[SPARK-8738] [SQL] [PYSPARK] capture SQL AnalysisException in Python API
Davies Liu
2015-06-30
1
-1
/
+2
*
[SPARK-7810] [PYSPARK] solve python rdd socket connection problem
Ai He
2015-06-29
1
-3
/
+15
*
[SPARK-8541] [PYSPARK] test the absolute error in approx doctests
Scott Taylor
2015-06-22
1
-2
/
+2
*
[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling...
zsxwing
2015-06-17
1
-1
/
+1
*
[SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutative
Sean Owen
2015-05-21
1
-2
/
+10
*
[SPARK-6216] [PYSPARK] check python version of worker with driver
Davies Liu
2015-05-18
1
-2
/
+2
*
[SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct
Vinod K C
2015-05-09
1
-2
/
+0
*
[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression
Davies Liu
2015-04-21
1
-0
/
+3
*
[SPARK-4897] [PySpark] Python 3 support
Davies Liu
2015-04-16
1
-80
/
+109
*
[SPARK-6886] [PySpark] fix big closure with shuffle
Davies Liu
2015-04-15
1
-10
/
+5
*
[SPARK-6216] [PySpark] check the python version in worker
Davies Liu
2015-04-10
1
-1
/
+1
*
[SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.
Milan Straka
2015-04-10
1
-1
/
+1
*
[SPARK-3074] [PySpark] support groupByKey() with single huge key
Davies Liu
2015-04-09
1
-12
/
+36
*
[SPARK-6667] [PySpark] remove setReuseAddress
Davies Liu
2015-04-02
1
-0
/
+1
*
[SPARK-6370][core] Documentation: Improve all 3 docs for RDD.sample
mbonaci
2015-03-20
1
-0
/
+6
*
[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
Davies Liu
2015-03-09
1
-16
/
+14
*
[SPARK-5944] [PySpark] fix version in Python API docs
Davies Liu
2015-02-25
1
-0
/
+4
*
[SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
Davies Liu
2015-02-24
1
-1
/
+1
*
[SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
Davies Liu
2015-02-17
1
-16
/
+33
*
SPARK-5633 pyspark saveAsTextFile support for compression codec
Vladimir Vladimirov
2015-02-06
1
-2
/
+20
*
[SPARK-5577] Python udf for DataFrame
Davies Liu
2015-02-04
1
-16
/
+22
*
[SPARK-5430] move treeReduce and treeAggregate from mllib to core
Xiangrui Meng
2015-01-28
1
-1
/
+90
*
[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
Yandu Oppacher
2015-01-28
1
-6
/
+9
*
[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
Michael Nazario
2015-01-28
1
-0
/
+14
*
SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs
Sandy Ryza
2015-01-28
1
-2
/
+2
*
[SPARK-5063] More helpful error messages for several invalid operations
Josh Rosen
2015-01-23
1
-0
/
+11
*
SPARK-5270 [CORE] Provide isEmpty() function in RDD API
Sean Owen
2015-01-19
1
-0
/
+12
*
[SPARK-4822] Use sphinx tags for Python doc annotations
lewuathe
2014-12-17
1
-4
/
+4
*
[SPARK-4841] fix zip with textFile()
Davies Liu
2014-12-15
1
-14
/
+11
*
[SPARK-4477] [PySpark] remove numpy from RDDSampler
Davies Liu
2014-11-20
1
-4
/
+6
*
[SPARK-4327] [PySpark] Python API for RDD.randomSplit()
Davies Liu
2014-11-18
1
-3
/
+27
*
[SPARK-4304] [PySpark] Fix sort on empty RDD
Davies Liu
2014-11-07
1
-0
/
+2
*
[SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...
Davies Liu
2014-11-03
1
-54
/
+37
*
[SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample
Xiangrui Meng
2014-11-03
1
-3
/
+0
*
[SPARK-4150][PySpark] return self in rdd.setName
Xiangrui Meng
2014-10-31
1
-2
/
+2
*
[Spark] RDD take() method: overestimate too much
yingjieMiao
2014-10-13
1
-1
/
+4
*
[SPARK-3909][PySpark][Doc] A corrupted format in Sphinx documents and buildin...
cocoatomo
2014-10-11
1
-1
/
+1
*
[SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs
Davies Liu
2014-10-07
1
-26
/
+26
*
[SPARK-3773][PySpark][Doc] Sphinx build warning
cocoatomo
2014-10-06
1
-0
/
+1
*
[SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDD
Davies Liu
2014-10-01
1
-3
/
+9
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-30
1
-2
/
+8
*
Revert "[SPARK-3478] [PySpark] Profile the Python tasks"
Josh Rosen
2014-09-26
1
-8
/
+2
[next]