index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-7156][SQL] Addressed follow up comments for randomSplit
Burak Yavuz
2015-04-29
1
-1
/
+6
*
[SPARK-7156][SQL] support RandomSplit in DataFrames
Burak Yavuz
2015-04-29
1
-1
/
+17
*
Better error message on access to non-existing attribute
ksonj
2015-04-29
1
-1
/
+2
*
[SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations
Patrick Wendell
2015-04-29
1
-1
/
+2
*
[SPARK-7188] added python support for math DataFrame functions
Burak Yavuz
2015-04-29
3
-1
/
+131
*
[SPARK-7208] [ML] [PYTHON] Added Matrix, SparseMatrix to __all__ list in lina...
Joseph K. Bradley
2015-04-28
1
-1
/
+2
*
[SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.
Reynold Xin
2015-04-28
1
-1
/
+21
*
[SPARK-5946] [STREAMING] Add Python API for direct Kafka stream
jerryshao
2015-04-27
2
-14
/
+237
*
[SPARK-7152][SQL] Add a Column expression for partition ID.
Reynold Xin
2015-04-26
1
-9
/
+21
*
[SPARK-7060][SQL] Add alias function to python dataframe
Yin Huai
2015-04-23
1
-0
/
+14
*
[SPARK-6827] [MLLIB] Wrap FPGrowthModel.freqItemsets and make it consistent w...
Yanbo Liang
2015-04-22
1
-3
/
+12
*
[SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.
Reynold Xin
2015-04-22
1
-1
/
+8
*
[SPARK-6953] [PySpark] speed up python tests
Reynold Xin
2015-04-21
8
-122
/
+174
*
[SPARK-7036][MLLIB] ALS.train should support DataFrames in PySpark
Xiangrui Meng
2015-04-21
1
-10
/
+26
*
[SPARK-6845] [MLlib] [PySpark] Add isTranposed flag to DenseMatrix
MechCoder
2015-04-21
2
-16
/
+49
*
[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression
Davies Liu
2015-04-21
10
-47
/
+70
*
[SPARK-6661] Python type errors should print type, not object
Elisey Zanko
2015-04-20
10
-21
/
+23
*
Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.
Reynold Xin
2015-04-17
1
-3
/
+4
*
[SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandas
Davies Liu
2015-04-17
3
-39
/
+70
*
[SPARK-6911] [SQL] improve accessor for nested types
Davies Liu
2015-04-16
2
-5
/
+62
*
[SPARK-4897] [PySpark] Python 3 support
Davies Liu
2015-04-16
46
-1090
/
+1018
*
[SPARK-6893][ML] default pipeline parameter handling in python
Xiangrui Meng
2015-04-15
9
-114
/
+266
*
[SPARK-6638] [SQL] Improve performance of StringType in SQL
Davies Liu
2015-04-15
1
-5
/
+5
*
[SPARK-6886] [PySpark] fix big closure with shuffle
Davies Liu
2015-04-15
2
-14
/
+7
*
[SPARK-6643][MLLIB] Implement StandardScalerModel missing methods
lewuathe
2015-04-12
2
-0
/
+40
*
[SPARK-6677] [SQL] [PySpark] fix cached classes
Davies Liu
2015-04-11
1
-19
/
+20
*
[SPARK-6216] [PySpark] check the python version in worker
Davies Liu
2015-04-10
3
-2
/
+22
*
[SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.
Milan Straka
2015-04-10
2
-1
/
+12
*
[SPARK-6211][Streaming] Add Python Kafka API unit test
jerryshao
2015-04-09
1
-1
/
+42
*
[SPARK-6577] [MLlib] [PySpark] SparseMatrix should be supported in PySpark
MechCoder
2015-04-09
2
-8
/
+154
*
[SPARK-3074] [PySpark] support groupByKey() with single huge key
Davies Liu
2015-04-09
6
-143
/
+531
*
[SPARK-6264] [MLLIB] Support FPGrowth algorithm in Python API
Yanbo Liang
2015-04-09
2
-1
/
+82
*
[SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySpark
Cheng Lian
2015-04-08
1
-0
/
+9
*
[SPARK-6781] [SQL] use sqlContext in python shell
Davies Liu
2015-04-08
7
-53
/
+52
*
[SPARK-6506] [pyspark] Do not try to retrieve SPARK_HOME when not needed...
Marcelo Vanzin
2015-04-08
1
-2
/
+1
*
[SPARK-6720][MLLIB] PySpark MultivariateStatisticalSummary unit test for norm...
lewuathe
2015-04-07
1
-0
/
+7
*
[SPARK-6262][MLLIB]Implement missing methods for MultivariateStatisticalSummary
lewuathe
2015-04-05
2
-0
/
+12
*
[SPARK-6615][MLLIB] Python API for Word2Vec
lewuathe
2015-04-03
2
-6
/
+57
*
[SPARK-6667] [PySpark] remove setReuseAddress
Davies Liu
2015-04-02
1
-0
/
+1
*
[SPARK-6660][MLLIB] pythonToJava doesn't recognize object arrays
Xiangrui Meng
2015-04-01
1
-0
/
+8
*
[SPARK-6553] [pyspark] Support functools.partial as UDF
ksonj
2015-04-01
2
-1
/
+33
*
[SPARK-6576] [MLlib] [PySpark] DenseMatrix in PySpark should support indexing
MechCoder
2015-04-01
2
-0
/
+17
*
[SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from Normal...
Xiangrui Meng
2015-04-01
1
-3
/
+3
*
[SPARK-6657] [Python] [Docs] fixed python doc build warnings
Joseph K. Bradley
2015-04-01
1
-16
/
+10
*
[SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy...
Xiangrui Meng
2015-04-01
1
-1
/
+37
*
[Doc] Improve Python DataFrame documentation
Reynold Xin
2015-03-31
5
-390
/
+250
*
[SPARK-6255] [MLLIB] Support multiclass classification in Python API
Yanbo Liang
2015-03-31
2
-28
/
+116
*
[SPARK-6598][MLLIB] Python API for IDFModel
lewuathe
2015-03-31
2
-0
/
+20
*
[SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
Reynold Xin
2015-03-31
2
-6
/
+45
*
[SPARK-6119][SQL] DataFrame support for missing data handling
Reynold Xin
2015-03-30
2
-0
/
+182
[next]