index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-7633] [MLLIB] [PYSPARK] Python bindings for StreamingLogisticRegressio...
MechCoder
2015-06-24
2
-2
/
+229
*
Revert "[SPARK-7157][SQL] add sampleBy to DataFrame"
Reynold Xin
2015-06-23
1
-40
/
+0
*
[SPARK-7157][SQL] add sampleBy to DataFrame
Xiangrui Meng
2015-06-23
1
-0
/
+40
*
[SPARK-8573] [SPARK-8568] [SQL] [PYSPARK] raise Exception if column is used i...
Davies Liu
2015-06-23
2
-1
/
+14
*
[SPARK-8265] [MLLIB] [PYSPARK] Add LinearDataGenerator to pyspark.mllib.utils
MechCoder
2015-06-23
2
-2
/
+55
*
[SPARK-8541] [PYSPARK] test the absolute error in approx doctests
Scott Taylor
2015-06-22
1
-2
/
+2
*
[SPARK-7781] [MLLIB] gradient boosted trees.train regressor missing max bins
Holden Karau
2015-06-22
2
-8
/
+21
*
[SPARK-8532] [SQL] In Python's DataFrameWriter, save/saveAsTable/json/parquet...
Yin Huai
2015-06-22
2
-11
/
+51
*
[SPARK-8104] [SQL] auto alias expressions in analyzer
Wenchen Fan
2015-06-22
1
-4
/
+5
*
[SPARK-8511] [PYSPARK] Modify a test to remove a saved model in `regression.py`
Yu ISHIKAWA
2015-06-22
5
-11
/
+21
*
[SPARK-7604] [MLLIB] Python API for PCA and PCAModel
Yanbo Liang
2015-06-21
1
-0
/
+35
*
[SPARK-8468] [ML] Take the negative of some metrics in RegressionEvaluator to...
Liang-Chi Hsieh
2015-06-20
1
-3
/
+5
*
[SPARK-4118] [MLLIB] [PYSPARK] Python bindings for StreamingKMeans
MechCoder
2015-06-19
2
-5
/
+352
*
[SPARK-8207] [SQL] Add math function bin
Liang-Chi Hsieh
2015-06-19
1
-0
/
+14
*
[SPARK-8339] [PYSPARK] integer division for python 3
Kevin Conor
2015-06-19
1
-1
/
+1
*
[SPARK-8444] [STREAMING] Adding Python streaming example for queueStream
Bryan Cutler
2015-06-19
1
-1
/
+1
*
[SPARK-8218][SQL] Binary log math function update.
Reynold Xin
2015-06-18
1
-4
/
+9
*
[SPARK-8202] [PYSPARK] fix infinite loop during external sort in PySpark
Davies Liu
2015-06-18
2
-5
/
+5
*
[SPARK-8218][SQL] Add binary log math function
Liang-Chi Hsieh
2015-06-17
1
-1
/
+17
*
[SPARK-7605] [MLLIB] [PYSPARK] Python API for ElementwiseProduct
MechCoder
2015-06-17
2
-2
/
+48
*
[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling...
zsxwing
2015-06-17
3
-1
/
+15
*
[SPARK-6390] [SQL] [MLlib] Port MatrixUDT to PySpark
MechCoder
2015-06-17
3
-4
/
+95
*
[SPARK-7916] [MLLIB] MLlib Python doc parity check for classification and reg...
Yanbo Liang
2015-06-16
2
-107
/
+247
*
[SPARK-6411] [SQL] [PySpark] support date/datetime with timezone in Python
Davies Liu
2015-06-11
2
-9
/
+50
*
[SPARK-8189] [SQL] use Long for TimestampType in SQL
Davies Liu
2015-06-10
1
-0
/
+11
*
[SPARK-7886] Add built-in expressions to FunctionRegistry.
Reynold Xin
2015-06-09
1
-1
/
+1
*
[SPARK-7990][SQL] Add methods to facilitate equi-join on multiple joining keys
Liang-Chi Hsieh
2015-06-08
1
-13
/
+32
*
[SPARK-2808] [STREAMING] [KAFKA] cleanup tests from
cody koeninger
2015-06-07
1
-5
/
+0
*
[SPARK-8146] DataFrame Python API: Alias replace in df.na
Reynold Xin
2015-06-07
2
-26
/
+22
*
[SPARK-7639] [PYSPARK] [MLLIB] Python API for KernelDensity
MechCoder
2015-06-06
2
-1
/
+63
*
[SPARK-7991] [PySpark] Adding support for passing lists to describe.
amey
2015-06-05
1
-0
/
+12
*
[SPARK-8116][PYSPARK] Allow sc.range() to take a single argument.
Ted Blackman
2015-06-04
1
-2
/
+12
*
[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...
Mike Dusenberry
2015-06-04
1
-3
/
+18
*
Update documentation for [SPARK-7980] [SQL] Support SQLContext.range(end)
Reynold Xin
2015-06-03
1
-0
/
+2
*
[SPARK-7980] [SQL] Support SQLContext.range(end)
animesh
2015-06-03
2
-2
/
+12
*
[SPARK-8060] Improve DataFrame Python test coverage and documentation.
Reynold Xin
2015-06-03
5
-227
/
+176
*
[SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust
MechCoder
2015-06-02
1
-1
/
+3
*
[SPARK-8038] [SQL] [PYSPARK] fix Column.when() and otherwise()
Davies Liu
2015-06-02
1
-3
/
+28
*
[SPARK-7432] [MLLIB] fix flaky CrossValidator doctest
Xiangrui Meng
2015-06-02
1
-10
/
+9
*
[SPARK-8021] [SQL] [PYSPARK] make Python read/write API consistent with Scala
Davies Liu
2015-06-02
1
-27
/
+94
*
[minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...
Reynold Xin
2015-06-01
1
-0
/
+3
*
[SPARK-7497] [PYSPARK] [STREAMING] fix streaming flaky tests
Davies Liu
2015-06-01
1
-8
/
+8
*
[SPARK-7978] [SQL] [PYSPARK] DecimalType should not be singleton
Davies Liu
2015-05-31
2
-2
/
+25
*
[SPARK-7918] [MLLIB] MLlib Python doc parity check for evaluation and feature
Yanbo Liang
2015-05-30
2
-39
/
+36
*
[SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict
Michael Nazario
2015-05-29
5
-20
/
+4
*
[SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...
Xiangrui Meng
2015-05-29
1
-25
/
+33
*
[SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel
Xiangrui Meng
2015-05-28
2
-3
/
+32
*
[MINOR] fix RegressionEvaluator doc
Xiangrui Meng
2015-05-28
1
-1
/
+1
*
[SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correct
linweizhong
2015-05-26
1
-4
/
+4
*
[SPARK-7833] [ML] Add python wrapper for RegressionEvaluator
Ram Sriharsha
2015-05-24
1
-2
/
+66
[next]