index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-19002][BUILD][PYTHON] Check pep8 against all Python scripts
hyukjinkwon
2017-01-02
1
-0
/
+3
*
[SPARK-17645][MLLIB][ML] add feature selector method based on: False Discover...
Peng
2016-12-28
2
-15
/
+109
*
[SPARK-18949][SQL] Add recoverPartitions API to Catalog
gatorsmile
2016-12-20
1
-0
/
+5
*
[SPARK-18576][PYTHON] Add basic TaskContext information to PySpark
Holden Karau
2016-12-20
4
-1
/
+165
*
[SPARK-18281] [SQL] [PYSPARK] Remove timeout for reading data through socket ...
Liang-Chi Hsieh
2016-12-20
2
-6
/
+17
*
[SPARK-18888] partitionBy in DataStreamWriter in Python throws _to_seq not de...
Burak Yavuz
2016-12-15
2
-3
/
+5
*
[SPARK-18852][SS] StreamingQuery.lastProgress should be null when recentProgr...
Shixiong Zhu
2016-12-14
2
-3
/
+24
*
[SPARK-18628][ML] Update Scala param and Python param to have quotes
krishnakalyan3
2016-12-11
1
-2
/
+2
*
[SPARK-18766][SQL] Push Down Filter Through BatchEvalPython (Python UDF)
gatorsmile
2016-12-10
1
-0
/
+9
*
[SPARK-16589] [PYTHON] Chained cartesian produces incorrect number of records
Andrew Ray
2016-12-08
2
-23
/
+53
*
[SPARK-18667][PYSPARK][SQL] Change the way to group row in BatchEvalPythonExe...
Liang-Chi Hsieh
2016-12-08
1
-0
/
+8
*
[SPARK-18754][SS] Rename recentProgresses to recentProgress
Michael Armbrust
2016-12-07
2
-5
/
+5
*
[SPARK-18652][PYTHON] Include the example data and third-party licenses in py...
Shuai Lin
2016-12-07
2
-1
/
+21
*
[SPARK-18657][SPARK-18668] Make StreamingQuery.id persists across restart and...
Tathagata Das
2016-12-05
1
-2
/
+17
*
[SPARK-18634][PYSPARK][SQL] Corruption and Correctness issues with exploding ...
Liang-Chi Hsieh
2016-12-05
1
-0
/
+20
*
[SPARK-18694][SS] Add StreamingQuery.explain and exception to Python and fix ...
Shixiong Zhu
2016-12-05
2
-0
/
+69
*
[SPARK-18690][PYTHON][SQL] Backward compatibility of unbounded frames
zero323
2016-12-02
2
-14
/
+51
*
[SPARK-18274][ML][PYSPARK] Memory leak in PySpark JavaWrapper
Sandeep Singh
2016-12-01
2
-18
/
+41
*
[SPARK-18366][PYSPARK][ML] Add handleInvalid to Pyspark for QuantileDiscretiz...
Sandeep Singh
2016-11-30
1
-14
/
+71
*
[SPARK-18516][STRUCTURED STREAMING] Follow up PR to add StreamingQuery.status...
Tathagata Das
2016-11-29
2
-0
/
+13
*
[SPARK-15819][PYSPARK][ML] Add KMeanSummary in KMeans of PySpark
Jeff Zhang
2016-11-29
2
-0
/
+56
*
[SPARK-18319][ML][QA2.1] 2.1 QA: API: Experimental, DeveloperApi, final, seal...
Yuhao
2016-11-29
4
-32
/
+0
*
[SPARK-18516][SQL] Split state and progress in streaming
Tathagata Das
2016-11-29
2
-304
/
+44
*
[SPARK-18523][PYSPARK] Make SparkContext.stop more reliable
Alexander Shorin
2016-11-28
1
-2
/
+15
*
[SPARK-18481][ML] ML 2.1 QA: Remove deprecated methods for ML
Yanbo Liang
2016-11-26
1
-4
/
+36
*
[SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...
hyukjinkwon
2016-11-22
20
-146
/
+157
*
[SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframe
Burak Yavuz
2016-11-21
1
-3
/
+54
*
[SPARK-18361][PYSPARK] Expose RDD localCheckpoint in PySpark
Gabriel Huang
2016-11-21
2
-1
/
+49
*
[SPARK-18282][ML][PYSPARK] Add python clustering summaries for GMM and BKM
sethah
2016-11-21
4
-13
/
+212
*
[SPARK-18445][BUILD][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that`/`...
hyukjinkwon
2016-11-19
4
-6
/
+6
*
[SPARK-18365][DOCS] Improve Sample Method Documentation
anabranch
2016-11-17
2
-0
/
+10
*
[SPARK-1267][SPARK-18129] Allow PySpark to be pip installed
Holden Karau
2016-11-16
8
-1
/
+381
*
[SPARK-18459][SPARK-18460][STRUCTUREDSTREAMING] Rename triggerId to batchId a...
Tathagata Das
2016-11-16
1
-3
/
+3
*
[MINOR][PYSPARK] Improve error message when running PySpark with different mi...
Liang-Chi Hsieh
2016-11-10
1
-1
/
+3
*
[SPARK-17829][SQL] Stable format for offset log
Tyson Condie
2016-11-09
1
-6
/
+6
*
[SPARK-18239][SPARKR] Gradient Boosted Tree for R
Felix Cheung
2016-11-08
1
-5
/
+5
*
[MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently ...
hyukjinkwon
2016-11-05
1
-15
/
+20
*
[SPARK-14393][SQL][DOC] update doc for python and R
Felix Cheung
2016-11-03
1
-1
/
+1
*
[SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2....
Sean Owen
2016-11-03
1
-0
/
+4
*
[SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassi...
Zheng RuiFeng
2016-11-03
1
-5
/
+5
*
[SPARK-18088][ML] Various ChiSqSelector cleanups
Joseph K. Bradley
2016-11-01
2
-49
/
+46
*
[SPARK-17764][SQL] Add `to_json` supporting to convert nested struct column t...
hyukjinkwon
2016-11-01
3
-2
/
+25
*
[SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest re...
Felix Cheung
2016-10-30
2
-11
/
+12
*
[SPARK-17219][ML] enhanced NaN value handling in Bucketizer
VinceShieh
2016-10-27
1
-5
/
+0
*
[SQL][DOC] updating doc for JSON source to link to jsonlines.org
Felix Cheung
2016-10-26
2
-3
/
+5
*
[SPARK-17926][SQL][STREAMING] Added json for statuses
Tathagata Das
2016-10-21
1
-6
/
+5
*
[SPARK-17960][PYSPARK][UPGRADE TO PY4J 0.10.4]
Jagadeesan
2016-10-21
3
-1
/
+1
*
[SPARK-17817] [PYSPARK] [FOLLOWUP] PySpark RDD Repartitioning Results in High...
Liang-Chi Hsieh
2016-10-18
1
-6
/
+6
*
[SPARK-17946][PYSPARK] Python crossJoin API similar to Scala
Srinath Shankar
2016-10-14
2
-6
/
+35
*
[SPARK-11775][PYSPARK][SQL] Allow PySpark to register Java UDF
Jeff Zhang
2016-10-14
1
-1
/
+27
[next]