index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
tests.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-3786] [PySpark] speedup tests
Davies Liu
2014-10-06
1
-50
/
+42
*
[SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDD
Davies Liu
2014-10-01
1
-2
/
+6
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-30
1
-0
/
+30
*
[SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDD
Davies Liu
2014-09-27
1
-0
/
+21
*
Revert "[SPARK-3478] [PySpark] Profile the Python tasks"
Josh Rosen
2014-09-26
1
-30
/
+0
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-26
1
-0
/
+30
*
[SPARK-3679] [PySpark] pickle the exact globals of functions
Davies Liu
2014-09-24
1
-0
/
+18
*
[SPARK-3634] [PySpark] User's module should take precedence over system modules
Davies Liu
2014-09-24
1
-0
/
+12
*
[PySpark] remove unnecessary use of numSlices from pyspark tests
Matthew Farrellee
2014-09-20
1
-2
/
+2
*
[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row
Davies Liu
2014-09-19
1
-1
/
+10
*
[SPARK-3554] [PySpark] use broadcast automatically for large closure
Davies Liu
2014-09-18
1
-0
/
+6
*
[SPARK-3519] add distinct(n) to PySpark
Matthew Farrellee
2014-09-16
1
-0
/
+17
*
[SPARK-2951] [PySpark] support unpickle array.array for Python 2.6
Davies Liu
2014-09-15
1
-2
/
+0
*
[SPARK-3463] [PySpark] aggregate and show spilled bytes in Python
Davies Liu
2014-09-13
1
-7
/
+8
*
[SPARK-3030] [PySpark] Reuse Python worker
Davies Liu
2014-09-13
1
-0
/
+35
*
[SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rdd
Davies Liu
2014-09-12
1
-0
/
+28
*
[SPARK-3094] [PySpark] compatitable with PyPy
Davies Liu
2014-09-12
1
-9
/
+76
*
[SPARK-3458] enable python "with" statements for SparkContext
Matthew Farrellee
2014-09-09
1
-0
/
+29
*
SPARK-2978. Transformation with MR shuffle semantics
Sandy Ryza
2014-09-08
1
-0
/
+8
*
[SPARK-3415] [PySpark] removes SerializingAdapter code
Ward Viaene
2014-09-07
1
-0
/
+11
*
[SPARK-2334] fix AttributeError when call PipelineRDD.id()
Davies Liu
2014-09-06
1
-0
/
+9
*
[SPARK-3335] [SQL] [PySpark] support broadcast in Python UDF
Davies Liu
2014-09-03
1
-0
/
+22
*
[SPARK-2871] [PySpark] add countApproxDistinct() API
Davies Liu
2014-09-02
1
-0
/
+16
*
[SPARK-3073] [PySpark] use external sort in sortBy() and sortByKey()
Davies Liu
2014-08-26
1
-1
/
+41
*
[SPARK-2871] [PySpark] add histgram() API
Davies Liu
2014-08-26
1
-0
/
+104
*
[SPARK-2790] [PySpark] fix zip with serializers which have different batch si...
Davies Liu
2014-08-19
1
-1
/
+26
*
[SPARK-3103] [PySpark] fix saveAsTextFile() with utf-8
Davies Liu
2014-08-18
1
-0
/
+9
*
[SPARK-1065] [PySpark] improve supporting for large broadcast
Davies Liu
2014-08-16
1
-0
/
+7
*
[PySpark] [SPARK-2954] [SPARK-2948] [SPARK-2910] [SPARK-2101] Python 2.6 Fixes
Josh Rosen
2014-08-11
1
-3
/
+10
*
[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically
Nicholas Chammas
2014-08-06
1
-62
/
+81
*
[SPARK-1687] [PySpark] pickable namedtuple
Davies Liu
2014-08-04
1
-0
/
+19
*
[SPARK-1740] [PySpark] kill the python worker
Davies Liu
2014-08-03
1
-0
/
+51
*
StatCounter on NumPy arrays [PYSPARK][SPARK-2012]
Jeremy Freeman
2014-08-01
1
-0
/
+24
*
[SPARK-2024] Add saveAsSequenceFile to PySpark
Kan Zhang
2014-07-30
1
-13
/
+304
*
[SPARK-791] [PySpark] fix pickle itemgetter with cloudpickle
Davies Liu
2014-07-29
1
-0
/
+6
*
[SPARK-2580] [PySpark] keep silent in worker if JVM close the socket
Davies Liu
2014-07-29
1
-0
/
+6
*
[SPARK-1550] [PySpark] Allow SparkContext creation after failed attempts
Josh Rosen
2014-07-27
1
-0
/
+6
*
[SPARK-2601] [PySpark] Fix Py4J error when transforming pickleFiles
Josh Rosen
2014-07-26
1
-0
/
+9
*
[SPARK-2538] [PySpark] Hash based disk spilling aggregation
Davies Liu
2014-07-24
1
-0
/
+57
*
[SPARK-2470] PEP8 fixes to PySpark
Nicholas Chammas
2014-07-21
1
-3
/
+7
*
SPARK-554. Add aggregateByKey.
Sandy Ryza
2014-06-12
1
-0
/
+15
*
SPARK-1416: PySpark support for SequenceFile and Hadoop InputFormats
Nick Pentreath
2014-06-09
1
-0
/
+145
*
[SPARK-1942] Stop clearing spark.driver.port in unit tests
Syed Hashmi
2014-06-03
1
-4
/
+0
*
SPARK-1917: fix PySpark import of scipy.special functions
Uri Laserson
2014-05-31
1
-0
/
+24
*
[SPARK-1549] Add Python support to spark-submit
Matei Zaharia
2014-05-06
1
-4
/
+127
*
SPARK-1004. PySpark on YARN
Sandy Ryza
2014-04-29
1
-1
/
+3
*
Fix for SPARK-1025: PySpark hang on missing files.
Josh Rosen
2014-01-23
1
-0
/
+11
*
Fix SPARK-978: ClassCastException in PySpark cartesian.
Josh Rosen
2014-01-23
1
-0
/
+9
*
Fix SPARK-1034: Py4JException on PySpark Cartesian Result
Josh Rosen
2014-01-23
1
-0
/
+7
*
Fixed Python API for sc.setCheckpointDir. Also other fixes based on Reynold's...
Tathagata Das
2013-12-24
1
-2
/
+2
[next]