index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
Commit message (
Expand
)
Author
Age
Files
Lines
*
SPARK-4022 [CORE] [MLLIB] Replace colt dependency (LGPL) with commons-math
Sean Owen
2014-10-27
1
-1
/
+1
*
[SPARK-4088] [PySpark] Python worker should exit after socket is closed by JVM
Davies Liu
2014-10-25
1
-5
/
+7
*
[SPARK-4051] [SQL] [PySpark] Convert Row into dictionary
Davies Liu
2014-10-24
2
-0
/
+21
*
[SPARK-2652] [PySpark] donot use KyroSerializer as default serializer
Davies Liu
2014-10-23
1
-1
/
+0
*
[SPARK-3993] [PySpark] fix bug while reuse worker after take()
Davies Liu
2014-10-23
4
-4
/
+32
*
Fix for sampling error in NumPy v1.9 [SPARK-3995][PYSPARK]
freeman
2014-10-22
2
-2
/
+8
*
SPARK-3770: Make userFeatures accessible from python
Michelangelo D'Agostino
2014-10-21
1
-0
/
+31
*
replace awaitTransformation with awaitTermination in scaladoc/javadoc
Holden Karau
2014-10-21
1
-1
/
+1
*
[SPARK-4023] [MLlib] [PySpark] convert rdd into RDD of Vector
Davies Liu
2014-10-21
2
-4
/
+24
*
[SPARK-3207][MLLIB]Choose splits for continuous features in DecisionTree more...
Qiping Li
2014-10-20
1
-2
/
+2
*
[SPARK-3952] [Streaming] [PySpark] add Python examples in Streaming Programmi...
Davies Liu
2014-10-18
1
-3
/
+5
*
[SPARK-3855][SQL] Preserve the result attribute of python UDFs though transfo...
Michael Armbrust
2014-10-17
1
-0
/
+6
*
[SPARK-3971] [MLLib] [PySpark] hotfix: Customized pickler should work in clus...
Davies Liu
2014-10-16
11
-25
/
+37
*
[Spark] RDD take() method: overestimate too much
yingjieMiao
2014-10-13
1
-1
/
+4
*
[SPARK-2377] Python API for Streaming
giwa
2014-10-12
7
-4
/
+1647
*
[SPARK-3909][PySpark][Doc] A corrupted format in Sphinx documents and buildin...
cocoatomo
2014-10-11
3
-6
/
+8
*
[SPARK-3867][PySpark] ./python/run-tests failed when it run with Python 2.6 a...
cocoatomo
2014-10-11
2
-2
/
+10
*
[SPARK-3886] [PySpark] use AutoBatchedSerializer by default
Davies Liu
2014-10-10
2
-6
/
+9
*
[SPARK-3713][SQL] Uses JSON to serialize DataType objects
Cheng Lian
2014-10-08
1
-78
/
+75
*
[SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs
Davies Liu
2014-10-07
9
-145
/
+142
*
[SPARK-3486][MLlib][PySpark] PySpark support for Word2Vec
Liquan Pei
2014-10-07
1
-0
/
+193
*
[SPARK-3773][PySpark][Doc] Sphinx build warning
cocoatomo
2014-10-06
5
-16
/
+28
*
[SPARK-3786] [PySpark] speedup tests
Davies Liu
2014-10-06
3
-54
/
+45
*
[SPARK-2461] [PySpark] Add a toString method to GeneralizedLinearModel
Sandy Ryza
2014-10-06
3
-4
/
+7
*
[SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDD
Davies Liu
2014-10-01
3
-6
/
+16
*
[SPARK-3751] [mllib] DecisionTree: example update + print options
Joseph K. Bradley
2014-10-01
1
-2
/
+8
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-30
6
-7
/
+108
*
[SPARK-3701][MLLIB] update python linalg api and small fixes
Xiangrui Meng
2014-09-30
1
-29
/
+121
*
[SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDD
Davies Liu
2014-09-27
2
-27
/
+34
*
Revert "[SPARK-3478] [PySpark] Profile the Python tasks"
Josh Rosen
2014-09-26
6
-108
/
+7
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-26
6
-7
/
+108
*
[SPARK-546] Add full outer join to RDD and DStream.
Aaron Staple
2014-09-24
2
-2
/
+39
*
[SPARK-3679] [PySpark] pickle the exact globals of functions
Davies Liu
2014-09-24
2
-6
/
+54
*
[SPARK-3634] [PySpark] User's module should take precedence over system modules
Davies Liu
2014-09-24
3
-8
/
+26
*
[PySpark] remove unnecessary use of numSlices from pyspark tests
Matthew Farrellee
2014-09-20
1
-2
/
+2
*
[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row
Davies Liu
2014-09-19
2
-4
/
+20
*
[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib
Davies Liu
2014-09-19
14
-915
/
+649
*
[SPARK-3554] [PySpark] use broadcast automatically for large closure
Davies Liu
2014-09-18
4
-3
/
+19
*
[SPARK-3430] [PySpark] [Doc] generate PySpark API docs using Sphinx
Davies Liu
2014-09-16
4
-5
/
+15
*
[SPARK-2314][SQL] Override collect and take in python library, and count in j...
Aaron Staple
2014-09-16
1
-5
/
+42
*
[SPARK-3519] add distinct(n) to PySpark
Matthew Farrellee
2014-09-16
3
-4
/
+24
*
[SPARK-1087] Move python traceback utilities into new traceback_utils.py file.
Aaron Staple
2014-09-15
3
-61
/
+83
*
[SPARK-2951] [PySpark] support unpickle array.array for Python 2.6
Davies Liu
2014-09-15
2
-2
/
+1
*
[SPARK-3516] [mllib] DecisionTree: Add minInstancesPerNode, minInfoGain param...
qiping.lqp
2014-09-15
1
-4
/
+12
*
[SPARK-3463] [PySpark] aggregate and show spilled bytes in Python
Davies Liu
2014-09-13
3
-14
/
+34
*
[SPARK-3030] [PySpark] Reuse Python worker
Davies Liu
2014-09-13
5
-28
/
+70
*
[SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rdd
Davies Liu
2014-09-12
2
-20
/
+46
*
[SPARK-3094] [PySpark] compatitable with PyPy
Davies Liu
2014-09-12
4
-118
/
+151
*
[PySpark] Add blank line so that Python RDD.top() docstring renders correctly
RJ Nowling
2014-09-12
1
-0
/
+1
*
[SPARK-3047] [PySpark] add an option to use str in textFileRDD
Davies Liu
2014-09-11
2
-11
/
+23
[next]