index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
rdd.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
Fix PySpark docs and an overly long line of code after fdbae41e
Matei Zaharia
2013-10-09
1
-8
/
+8
*
SPARK-705: implement sortByKey() in PySpark
Andre Schumacher
2013-10-07
1
-1
/
+47
*
Fixing SPARK-602: PythonPartitioner
Andre Schumacher
2013-10-04
1
-4
/
+6
*
Export StorageLevel and refactor
Aaron Davidson
2013-09-07
1
-1
/
+2
*
SPARK-660: Add StorageLevel support in Python
Aaron Davidson
2013-09-05
1
-0
/
+18
*
Merge pull request #861 from AndreSchumacher/pyspark_sampling_function
Matei Zaharia
2013-08-31
1
-7
/
+55
|
\
|
*
RDD sample() and takeSample() prototypes for PySpark
Andre Schumacher
2013-08-28
1
-7
/
+55
*
|
PySpark: implementing subtractByKey(), subtract() and keyBy()
Andre Schumacher
2013-08-28
1
-0
/
+37
|
/
*
Implementing SPARK-838: Add DoubleRDDFunctions methods to PySpark
Andre Schumacher
2013-08-21
1
-1
/
+59
*
Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...
Andre Schumacher
2013-08-16
1
-1
/
+3
*
Do not inherit master's PYTHONPATH on workers.
Josh Rosen
2013-07-29
1
-3
/
+2
*
Use None instead of empty string as it's slightly smaller/faster
Matei Zaharia
2013-07-29
1
-1
/
+1
*
Optimize Python foreach() to not return as many objects
Matei Zaharia
2013-07-29
1
-1
/
+5
*
Optimize Python take() to not compute entire first partition
Matei Zaharia
2013-07-29
1
-6
/
+9
*
Add Apache license headers and LICENSE and NOTICE files
Matei Zaharia
2013-07-16
1
-0
/
+17
*
Fix Python saveAsTextFile doctest to not expect order to be preserved
Jey Kottalam
2013-04-02
1
-1
/
+1
*
Change numSplits to numPartitions in PySpark.
Josh Rosen
2013-02-24
1
-28
/
+28
*
Add commutative requirement for 'reduce' to Python docstring.
Mark Hamstra
2013-02-09
1
-2
/
+2
*
Fetch fewer objects in PySpark's take() method.
Josh Rosen
2013-02-03
1
-0
/
+4
*
Fix reporting of PySpark doctest failures.
Josh Rosen
2013-02-03
1
-1
/
+3
*
Use spark.local.dir for PySpark temp files (SPARK-580).
Josh Rosen
2013-02-01
1
-6
/
+1
*
Do not launch JavaGateways on workers (SPARK-674).
Josh Rosen
2013-02-01
1
-6
/
+6
*
Merge pull request #389 from JoshRosen/python_rdd_checkpointing
Matei Zaharia
2013-01-20
1
-1
/
+34
|
\
|
*
Clean up setup code in PySpark checkpointing tests
Josh Rosen
2013-01-20
1
-2
/
+1
|
*
Update checkpointing API docs in Python/Java.
Josh Rosen
2013-01-20
1
-12
/
+5
|
*
Add checkpointFile() and more tests to PySpark.
Josh Rosen
2013-01-20
1
-1
/
+8
|
*
Add RDD checkpointing to Python API.
Josh Rosen
2013-01-20
1
-0
/
+34
*
|
Fix PythonPartitioner equality; see SPARK-654.
Josh Rosen
2013-01-20
1
-6
/
+11
|
/
*
Added accumulators to PySpark
Matei Zaharia
2013-01-20
1
-1
/
+1
*
Add mapPartitionsWithSplit() to PySpark.
Josh Rosen
2013-01-08
1
-11
/
+22
*
Change PySpark RDD.take() to not call iterator().
Josh Rosen
2013-01-03
1
-6
/
+5
*
Rename top-level 'pyspark' directory to 'python'
Josh Rosen
2013-01-01
1
-0
/
+713