index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-7991] [PySpark] Adding support for passing lists to describe.
amey
2015-06-05
1
-0
/
+12
*
[SPARK-8116][PYSPARK] Allow sc.range() to take a single argument.
Ted Blackman
2015-06-04
1
-2
/
+12
*
[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...
Mike Dusenberry
2015-06-04
1
-3
/
+18
*
Update documentation for [SPARK-7980] [SQL] Support SQLContext.range(end)
Reynold Xin
2015-06-03
1
-0
/
+2
*
[SPARK-7980] [SQL] Support SQLContext.range(end)
animesh
2015-06-03
2
-2
/
+12
*
[SPARK-8060] Improve DataFrame Python test coverage and documentation.
Reynold Xin
2015-06-03
5
-227
/
+176
*
[SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust
MechCoder
2015-06-02
1
-1
/
+3
*
[SPARK-8038] [SQL] [PYSPARK] fix Column.when() and otherwise()
Davies Liu
2015-06-02
1
-3
/
+28
*
[SPARK-7432] [MLLIB] fix flaky CrossValidator doctest
Xiangrui Meng
2015-06-02
1
-10
/
+9
*
[SPARK-8021] [SQL] [PYSPARK] make Python read/write API consistent with Scala
Davies Liu
2015-06-02
1
-27
/
+94
*
[minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...
Reynold Xin
2015-06-01
1
-0
/
+3
*
[SPARK-7497] [PYSPARK] [STREAMING] fix streaming flaky tests
Davies Liu
2015-06-01
1
-8
/
+8
*
[SPARK-7978] [SQL] [PYSPARK] DecimalType should not be singleton
Davies Liu
2015-05-31
2
-2
/
+25
*
[SPARK-7918] [MLLIB] MLlib Python doc parity check for evaluation and feature
Yanbo Liang
2015-05-30
2
-39
/
+36
*
[SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict
Michael Nazario
2015-05-29
5
-20
/
+4
*
[SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...
Xiangrui Meng
2015-05-29
1
-25
/
+33
*
[SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel
Xiangrui Meng
2015-05-28
2
-3
/
+32
*
[MINOR] fix RegressionEvaluator doc
Xiangrui Meng
2015-05-28
1
-1
/
+1
*
[SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correct
linweizhong
2015-05-26
1
-4
/
+4
*
[SPARK-7833] [ML] Add python wrapper for RegressionEvaluator
Ram Sriharsha
2015-05-24
1
-2
/
+66
*
[SPARK-7840] add insertInto() to Writer
Davies Liu
2015-05-23
2
-8
/
+16
*
[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...
Davies Liu
2015-05-23
8
-56
/
+365
*
[SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4
Xiangrui Meng
2015-05-21
4
-61
/
+64
*
[SPARK-7794] [MLLIB] update RegexTokenizer default settings
Xiangrui Meng
2015-05-21
1
-21
/
+19
*
[SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in Python
Davies Liu
2015-05-21
1
-2
/
+46
*
[SPARK-7711] Add a startTime property to match the corresponding one in Scala
Holden Karau
2015-05-21
2
-0
/
+9
*
[SPARK-7394][SQL] Add Pandas style cast (astype)
kaka1992
2015-05-21
1
-0
/
+2
*
[SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutative
Sean Owen
2015-05-21
1
-2
/
+10
*
[SPARK-7606] [SQL] [PySpark] add version to Python SQL API docs
Davies Liu
2015-05-20
7
-18
/
+170
*
[SPARK-7762] [MLLIB] set default value for outputCol
Xiangrui Meng
2015-05-20
2
-2
/
+3
*
[SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...
Holden Karau
2015-05-20
8
-64
/
+96
*
[SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlib
Yanbo Liang
2015-05-20
1
-0
/
+117
*
[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python
Davies Liu
2015-05-19
5
-90
/
+421
*
[SPARK-7150] SparkContext.range() and SQLContext.range()
Daoyuan Wang
2015-05-18
4
-0
/
+46
*
[SPARK-6216] [PYSPARK] check python version of worker with driver
Davies Liu
2015-05-18
6
-12
/
+16
*
[SPARK-7380] [MLLIB] pipeline stages should be copyable in Python
Xiangrui Meng
2015-05-18
13
-254
/
+490
*
[SPARK-6657] [PYSPARK] Fix doc warnings
Xiangrui Meng
2015-05-18
4
-10
/
+11
*
[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files
Davies Liu
2015-05-15
5
-449
/
+550
*
[SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python
Davies Liu
2015-05-15
1
-30
/
+46
*
[SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...
FlytxtRnD
2015-05-15
1
-0
/
+6
*
[SPARK-6258] [MLLIB] GaussianMixture Python API parity check
Yanbo Liang
2015-05-15
1
-14
/
+53
*
[SPARK-7548] [SQL] Add explode function for DataFrames
Michael Armbrust
2015-05-14
3
-3
/
+44
*
[SPARK-7619] [PYTHON] fix docstring signature
Xiangrui Meng
2015-05-14
4
-48
/
+45
*
[SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml
Xiangrui Meng
2015-05-14
3
-1
/
+43
*
[SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable
ksonj
2015-05-14
1
-1
/
+1
*
[SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification
Burak Yavuz
2015-05-13
3
-10
/
+501
*
[SPARK-7593] [ML] Python Api for ml.feature.Bucketizer
Burak Yavuz
2015-05-13
1
-0
/
+77
*
[SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...
Reynold Xin
2015-05-12
3
-2
/
+57
*
[SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml
Xiangrui Meng
2015-05-12
2
-6
/
+2
*
[SPARK-7487] [ML] Feature Parity in PySpark for ml.regression
Burak Yavuz
2015-05-12
4
-8
/
+691
[next]