aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
...
* [SPARK-18766][SQL] Push Down Filter Through BatchEvalPython (Python UDF)gatorsmile2016-12-101-0/+9
* [SPARK-16589] [PYTHON] Chained cartesian produces incorrect number of recordsAndrew Ray2016-12-082-23/+53
* [SPARK-18667][PYSPARK][SQL] Change the way to group row in BatchEvalPythonExe...Liang-Chi Hsieh2016-12-081-0/+8
* [SPARK-18754][SS] Rename recentProgresses to recentProgressMichael Armbrust2016-12-072-5/+5
* [SPARK-18652][PYTHON] Include the example data and third-party licenses in py...Shuai Lin2016-12-072-1/+21
* [SPARK-18657][SPARK-18668] Make StreamingQuery.id persists across restart and...Tathagata Das2016-12-051-2/+17
* [SPARK-18634][PYSPARK][SQL] Corruption and Correctness issues with exploding ...Liang-Chi Hsieh2016-12-051-0/+20
* [SPARK-18694][SS] Add StreamingQuery.explain and exception to Python and fix ...Shixiong Zhu2016-12-052-0/+69
* [SPARK-18690][PYTHON][SQL] Backward compatibility of unbounded frameszero3232016-12-022-14/+51
* [SPARK-18274][ML][PYSPARK] Memory leak in PySpark JavaWrapperSandeep Singh2016-12-012-18/+41
* [SPARK-18366][PYSPARK][ML] Add handleInvalid to Pyspark for QuantileDiscretiz...Sandeep Singh2016-11-301-14/+71
* [SPARK-18516][STRUCTURED STREAMING] Follow up PR to add StreamingQuery.status...Tathagata Das2016-11-292-0/+13
* [SPARK-15819][PYSPARK][ML] Add KMeanSummary in KMeans of PySparkJeff Zhang2016-11-292-0/+56
* [SPARK-18319][ML][QA2.1] 2.1 QA: API: Experimental, DeveloperApi, final, seal...Yuhao2016-11-294-32/+0
* [SPARK-18516][SQL] Split state and progress in streamingTathagata Das2016-11-292-304/+44
* [SPARK-18523][PYSPARK] Make SparkContext.stop more reliableAlexander Shorin2016-11-281-2/+15
* [SPARK-18481][ML] ML 2.1 QA: Remove deprecated methods for MLYanbo Liang2016-11-261-4/+36
* [SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...hyukjinkwon2016-11-2220-146/+157
* [SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframeBurak Yavuz2016-11-211-3/+54
* [SPARK-18361][PYSPARK] Expose RDD localCheckpoint in PySparkGabriel Huang2016-11-212-1/+49
* [SPARK-18282][ML][PYSPARK] Add python clustering summaries for GMM and BKMsethah2016-11-214-13/+212
* [SPARK-18445][BUILD][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that`/`...hyukjinkwon2016-11-194-6/+6
* [SPARK-18365][DOCS] Improve Sample Method Documentationanabranch2016-11-172-0/+10
* [SPARK-1267][SPARK-18129] Allow PySpark to be pip installedHolden Karau2016-11-168-1/+381
* [SPARK-18459][SPARK-18460][STRUCTUREDSTREAMING] Rename triggerId to batchId a...Tathagata Das2016-11-161-3/+3
* [MINOR][PYSPARK] Improve error message when running PySpark with different mi...Liang-Chi Hsieh2016-11-101-1/+3
* [SPARK-17829][SQL] Stable format for offset logTyson Condie2016-11-091-6/+6
* [SPARK-18239][SPARKR] Gradient Boosted Tree for RFelix Cheung2016-11-081-5/+5
* [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently ...hyukjinkwon2016-11-051-15/+20
* [SPARK-14393][SQL][DOC] update doc for python and RFelix Cheung2016-11-031-1/+1
* [SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2....Sean Owen2016-11-031-0/+4
* [SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassi...Zheng RuiFeng2016-11-031-5/+5
* [SPARK-18088][ML] Various ChiSqSelector cleanupsJoseph K. Bradley2016-11-012-49/+46
* [SPARK-17764][SQL] Add `to_json` supporting to convert nested struct column t...hyukjinkwon2016-11-013-2/+25
* [SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest re...Felix Cheung2016-10-302-11/+12
* [SPARK-17219][ML] enhanced NaN value handling in BucketizerVinceShieh2016-10-271-5/+0
* [SQL][DOC] updating doc for JSON source to link to jsonlines.orgFelix Cheung2016-10-262-3/+5
* [SPARK-17926][SQL][STREAMING] Added json for statusesTathagata Das2016-10-211-6/+5
* [SPARK-17960][PYSPARK][UPGRADE TO PY4J 0.10.4]Jagadeesan2016-10-213-1/+1
* [SPARK-17817] [PYSPARK] [FOLLOWUP] PySpark RDD Repartitioning Results in High...Liang-Chi Hsieh2016-10-181-6/+6
* [SPARK-17946][PYSPARK] Python crossJoin API similar to ScalaSrinath Shankar2016-10-142-6/+35
* [SPARK-11775][PYSPARK][SQL] Allow PySpark to register Java UDFJeff Zhang2016-10-141-1/+27
* [SPARK-16063][SQL] Add storageLevel to DatasetNick Pentreath2016-10-141-6/+30
* [SPARK-17870][MLLIB][ML] Change statistic to pValue for SelectKBest and Selec...Peng2016-10-142-6/+6
* [SPARK-15402][ML][PYSPARK] PySpark ml.evaluation should support save/loadYanbo Liang2016-10-141-9/+36
* [SPARK-15957][FOLLOW-UP][ML][PYSPARK] Add Python API for RFormula forceIndexL...Yanbo Liang2016-10-132-4/+43
* [SPARK-17731][SQL][STREAMING] Metrics for structured streamingTathagata Das2016-10-131-0/+301
* [SPARK-17745][ML][PYSPARK] update NB python api - add weight col parameterWeichenXu2016-10-121-13/+13
* [SPARK-17845] [SQL] More self-evident window function frame boundary APIReynold Xin2016-10-122-30/+84
* [SPARK-14761][SQL] Reject invalid join methods when join columns are not spec...Bijay Pathak2016-10-122-16/+21