index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...
FlytxtRnD
2015-05-15
1
-0
/
+6
*
[SPARK-6258] [MLLIB] GaussianMixture Python API parity check
Yanbo Liang
2015-05-15
1
-14
/
+53
*
[SPARK-7548] [SQL] Add explode function for DataFrames
Michael Armbrust
2015-05-14
3
-3
/
+44
*
[SPARK-7619] [PYTHON] fix docstring signature
Xiangrui Meng
2015-05-14
5
-55
/
+52
*
[SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml
Xiangrui Meng
2015-05-14
3
-1
/
+43
*
[SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable
ksonj
2015-05-14
1
-1
/
+1
*
[SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification
Burak Yavuz
2015-05-13
3
-10
/
+501
*
[SPARK-7593] [ML] Python Api for ml.feature.Bucketizer
Burak Yavuz
2015-05-13
1
-0
/
+77
*
[SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...
Reynold Xin
2015-05-12
3
-2
/
+57
*
[SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml
Xiangrui Meng
2015-05-12
3
-7
/
+11
*
[SPARK-7487] [ML] Feature Parity in PySpark for ml.regression
Burak Yavuz
2015-05-12
6
-8
/
+709
*
[SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark
Daoyuan Wang
2015-05-12
2
-0
/
+133
*
[SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.
Reynold Xin
2015-05-11
1
-1
/
+13
*
[SPARK-7324] [SQL] DataFrame.dropDuplicates
Reynold Xin
2015-05-11
1
-2
/
+34
*
[SPARK-7462][SQL] Update documentation for retaining grouping columns in Data...
Reynold Xin
2015-05-11
1
-0
/
+2
*
[SPARK-7462] By default retain group by columns in aggregate
Reynold Xin
2015-05-11
1
-1
/
+1
*
[SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlib
Yanbo Liang
2015-05-11
1
-2
/
+76
*
[SPARK-7427] [PYSPARK] Make sharedParams match in Scala, Python
Glenn Weidner
2015-05-10
3
-21
/
+19
*
[SPARK-7431] [ML] [PYTHON] Made CrossValidatorModel call parent init in PySpark
Joseph K. Bradley
2015-05-10
3
-3
/
+4
*
[SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib
Yanbo Liang
2015-05-10
1
-0
/
+129
*
[SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct
Vinod K C
2015-05-09
2
-3
/
+0
*
[SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendation
Burak Yavuz
2015-05-08
3
-0
/
+310
*
[SPARK-5913] [MLLIB] Python API for ChiSqSelector
Yanbo Liang
2015-05-08
1
-2
/
+57
*
[SPARK-7133] [SQL] Implement struct, array, and map field accessor
Wenchen Fan
2015-05-08
2
-12
/
+19
*
[SPARK-7474] [MLLIB] update ParamGridBuilder doctest
Xiangrui Meng
2015-05-08
1
-15
/
+13
*
[SPARK-7383] [ML] Feature Parity in PySpark for ml.features
Burak Yavuz
2015-05-08
3
-41
/
+849
*
[SPARK-6948] [MLLIB] compress vectors in VectorAssembler
Xiangrui Meng
2015-05-07
1
-3
/
+3
*
[SPARK-7328] [MLLIB] [PYSPARK] Pyspark.mllib.linalg.Vectors: Missing items
MechCoder
2015-05-07
2
-2
/
+171
*
[SPARK-6093] [MLLIB] Add RegressionMetrics in PySpark/MLlib
Yanbo Liang
2015-05-07
1
-2
/
+76
*
[SPARK-7118] [Python] Add the coalesce Spark SQL function available in PySpark
Olivier Girardot
2015-05-07
1
-0
/
+37
*
[SPARK-7388] [SPARK-7383] wrapper for VectorAssembler in Python
Burak Yavuz
2015-05-07
4
-8
/
+78
*
[SPARK-7295][SQL] bitwise operations for DataFrame DSL
Shiti
2015-05-07
3
-0
/
+20
*
[SPARK-7432] [MLLIB] disable cv doctest
Xiangrui Meng
2015-05-06
1
-4
/
+4
*
[SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API
Xiangrui Meng
2015-05-06
3
-6
/
+194
*
[SPARK-6267] [MLLIB] Python API for IsotonicRegression
Yanbo Liang
2015-05-05
1
-2
/
+71
*
[SPARK-7358][SQL] Move DataFrame mathfunctions into functions
Burak Yavuz
2015-05-05
3
-102
/
+53
*
[SPARK-7294][SQL] ADD BETWEEN
云峤
2015-05-05
2
-0
/
+15
*
[SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySpark
Xiangrui Meng
2015-05-05
8
-3
/
+193
*
[SPARK-7243][SQL] Reduce size for Contingency Tables in DataFrames
Burak Yavuz
2015-05-05
1
-4
/
+5
*
[SPARK-6612] [MLLIB] [PYSPARK] Python KMeans parity
Hrishikesh Subramonian
2015-05-05
2
-7
/
+31
*
[SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDe
MechCoder
2015-05-05
2
-2
/
+5
*
[SPARK-7243][SQL] Contingency Tables for DataFrames
Burak Yavuz
2015-05-04
2
-0
/
+34
*
[SPARK-7319][SQL] Improve the output from DataFrame.show()
云峤
2015-05-04
1
-36
/
+69
*
[SPARK-7241] Pearson correlation for DataFrames
Burak Yavuz
2015-05-03
2
-0
/
+32
*
[SPARK-7329] [MLLIB] simplify ParamGridBuilder impl
Xiangrui Meng
2015-05-03
1
-19
/
+9
*
[SPARK-7022] [PYSPARK] [ML] Add ML.Tuning.ParamGridBuilder to PySpark
Omede Firouz
2015-05-03
2
-0
/
+95
*
[SPARK-3444] Fix typo in Dataframes.py introduced in []
Dean Chen
2015-05-02
1
-1
/
+1
*
[SPARK-7242] added python api for freqItems in DataFrames
Burak Yavuz
2015-05-01
2
-0
/
+32
*
[SPARK-3444] Provide an easy way to change log level
Holden Karau
2015-05-01
2
-1
/
+8
*
[SPARK-2808][Streaming][Kafka] update kafka to 0.8.2
cody koeninger
2015-05-01
1
-3
/
+5
[next]