index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib
Davies Liu
2014-09-19
16
-917
/
+650
*
[SPARK-3554] [PySpark] use broadcast automatically for large closure
Davies Liu
2014-09-18
4
-3
/
+19
*
[SPARK-3430] [PySpark] [Doc] generate PySpark API docs using Sphinx
Davies Liu
2014-09-16
13
-5
/
+944
*
[SPARK-2314][SQL] Override collect and take in python library, and count in j...
Aaron Staple
2014-09-16
1
-5
/
+42
*
[SPARK-3519] add distinct(n) to PySpark
Matthew Farrellee
2014-09-16
3
-4
/
+24
*
[SPARK-1087] Move python traceback utilities into new traceback_utils.py file.
Aaron Staple
2014-09-15
3
-61
/
+83
*
[SPARK-2951] [PySpark] support unpickle array.array for Python 2.6
Davies Liu
2014-09-15
2
-2
/
+1
*
[SPARK-3516] [mllib] DecisionTree: Add minInstancesPerNode, minInfoGain param...
qiping.lqp
2014-09-15
1
-4
/
+12
*
[SPARK-3463] [PySpark] aggregate and show spilled bytes in Python
Davies Liu
2014-09-13
3
-14
/
+34
*
[SPARK-3030] [PySpark] Reuse Python worker
Davies Liu
2014-09-13
5
-28
/
+70
*
[SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rdd
Davies Liu
2014-09-12
2
-20
/
+46
*
[SPARK-3094] [PySpark] compatitable with PyPy
Davies Liu
2014-09-12
5
-118
/
+172
*
[PySpark] Add blank line so that Python RDD.top() docstring renders correctly
RJ Nowling
2014-09-12
1
-0
/
+1
*
[SPARK-3047] [PySpark] add an option to use str in textFileRDD
Davies Liu
2014-09-11
2
-11
/
+23
*
[SPARK-3458] enable python "with" statements for SparkContext
Matthew Farrellee
2014-09-09
2
-0
/
+43
*
[SPARK-3443][MLLIB] update default values of tree:
Xiangrui Meng
2014-09-08
1
-2
/
+2
*
[SPARK-3417] Use new-style classes in PySpark
Matthew Rocklin
2014-09-08
4
-4
/
+4
*
Provide a default PYSPARK_PYTHON for python/run_tests
Matthew Farrellee
2014-09-08
1
-0
/
+2
*
SPARK-2978. Transformation with MR shuffle semantics
Sandy Ryza
2014-09-08
2
-0
/
+32
*
SPARK-3337 Paranoid quoting in shell to allow install dirs with spaces within.
Prashant Sharma
2014-09-08
1
-2
/
+4
*
[SPARK-3415] [PySpark] removes SerializingAdapter code
Ward Viaene
2014-09-07
2
-5
/
+12
*
[SPARK-2334] fix AttributeError when call PipelineRDD.id()
Davies Liu
2014-09-06
3
-4
/
+20
*
[SPARK-3273][SPARK-3301]We should read the version information from the same ...
GuoQiang Li
2014-09-06
1
-2
/
+2
*
Spark-3406 add a default storage level to python RDD persist API
Holden Karau
2014-09-06
2
-2
/
+8
*
SPARK-3211 .take() is OOM-prone with empty partitions
Andrew Ash
2014-09-05
1
-4
/
+4
*
[SPARK-3378] [DOCS] Replace the word "SparkSQL" with right word "Spark SQL"
Kousuke Saruta
2014-09-04
2
-4
/
+4
*
[SPARK-3401][PySpark] Wrong usage of tee command in python/run-tests
Kousuke Saruta
2014-09-04
1
-1
/
+1
*
[SPARK-2435] Add shutdown hook to pyspark
Matthew Farrellee
2014-09-03
1
-0
/
+2
*
[SPARK-3335] [SQL] [PySpark] support broadcast in Python UDF
Davies Liu
2014-09-03
2
-8
/
+31
*
[SPARK-3309] [PySpark] Put all public API in __all__
Davies Liu
2014-09-03
17
-26
/
+81
*
[SPARK-2871] [PySpark] add countApproxDistinct() API
Davies Liu
2014-09-02
2
-5
/
+50
*
SPARK-3318: Documentation update in addFile on how to use SparkFiles.get
Holden Karau
2014-08-30
1
-2
/
+2
*
[SPARK-3307] [PySpark] Fix doc string of SparkContext.broadcast()
Davies Liu
2014-08-29
1
-2
/
+0
*
[SPARK-2871] [PySpark] add RDD.lookup(key)
Davies Liu
2014-08-27
1
-132
/
+79
*
[SPARK-3167] Handle special driver configs in Windows
Andrew Or
2014-08-26
1
-0
/
+17
*
[SPARK-3073] [PySpark] use external sort in sortBy() and sortByKey()
Davies Liu
2014-08-26
4
-11
/
+1021
*
[SPARK-2969][SQL] Make ScalaReflection be able to handle ArrayType.containsNu...
Takuya UESHIN
2014-08-26
1
-3
/
+3
*
[SPARK-2871] [PySpark] add histgram() API
Davies Liu
2014-08-26
2
-1
/
+232
*
[SPARK-2871] [PySpark] add zipWithIndex() and zipWithUniqueId()
Davies Liu
2014-08-24
1
-0
/
+47
*
[SPARK-2871] [PySpark] add approx API for RDD
Davies Liu
2014-08-23
1
-0
/
+81
*
[SPARK-2871] [PySpark] add `key` argument for max(), min() and top(n)
Davies Liu
2014-08-23
1
-17
/
+27
*
[SPARK-3140] Clarify confusing PySpark exception message
Andrew Or
2014-08-20
1
-3
/
+10
*
[SPARK-3141] [PySpark] fix sortByKey() with take()
Davies Liu
2014-08-19
1
-10
/
+8
*
[SPARK-2974] [SPARK-2975] Fix two bugs related to spark.local.dirs
Josh Rosen
2014-08-19
1
-1
/
+1
*
[SPARK-3136][MLLIB] Create Java-friendly methods in RandomRDDs
Xiangrui Meng
2014-08-19
1
-10
/
+10
*
[SPARK-2790] [PySpark] fix zip with serializers which have different batch si...
Davies Liu
2014-08-19
3
-1
/
+54
*
[SPARK-3114] [PySpark] Fix Python UDFs in Spark SQL.
Josh Rosen
2014-08-18
3
-5
/
+3
*
[SPARK-2850] [SPARK-2626] [mllib] MLlib stats examples + small fixes
Joseph K. Bradley
2014-08-18
3
-10
/
+23
*
[mllib] DecisionTree: treeAggregate + Python example bug fix
Joseph K. Bradley
2014-08-18
2
-6
/
+9
*
[SPARK-3103] [PySpark] fix saveAsTextFile() with utf-8
Davies Liu
2014-08-18
2
-1
/
+12
[prev]
[next]