aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-18481][ML] ML 2.1 QA: Remove deprecated methods for MLYanbo Liang2016-11-261-4/+36
* [SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...hyukjinkwon2016-11-2220-146/+157
* [SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframeBurak Yavuz2016-11-211-3/+54
* [SPARK-18361][PYSPARK] Expose RDD localCheckpoint in PySparkGabriel Huang2016-11-212-1/+49
* [SPARK-18282][ML][PYSPARK] Add python clustering summaries for GMM and BKMsethah2016-11-214-13/+212
* [SPARK-18445][BUILD][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that`/`...hyukjinkwon2016-11-194-6/+6
* [SPARK-18365][DOCS] Improve Sample Method Documentationanabranch2016-11-172-0/+10
* [SPARK-1267][SPARK-18129] Allow PySpark to be pip installedHolden Karau2016-11-168-1/+381
* [SPARK-18459][SPARK-18460][STRUCTUREDSTREAMING] Rename triggerId to batchId a...Tathagata Das2016-11-161-3/+3
* [MINOR][PYSPARK] Improve error message when running PySpark with different mi...Liang-Chi Hsieh2016-11-101-1/+3
* [SPARK-17829][SQL] Stable format for offset logTyson Condie2016-11-091-6/+6
* [SPARK-18239][SPARKR] Gradient Boosted Tree for RFelix Cheung2016-11-081-5/+5
* [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently ...hyukjinkwon2016-11-051-15/+20
* [SPARK-14393][SQL][DOC] update doc for python and RFelix Cheung2016-11-031-1/+1
* [SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2....Sean Owen2016-11-031-0/+4
* [SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassi...Zheng RuiFeng2016-11-031-5/+5
* [SPARK-18088][ML] Various ChiSqSelector cleanupsJoseph K. Bradley2016-11-012-49/+46
* [SPARK-17764][SQL] Add `to_json` supporting to convert nested struct column t...hyukjinkwon2016-11-013-2/+25
* [SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest re...Felix Cheung2016-10-302-11/+12
* [SPARK-17219][ML] enhanced NaN value handling in BucketizerVinceShieh2016-10-271-5/+0
* [SQL][DOC] updating doc for JSON source to link to jsonlines.orgFelix Cheung2016-10-262-3/+5
* [SPARK-17926][SQL][STREAMING] Added json for statusesTathagata Das2016-10-211-6/+5
* [SPARK-17960][PYSPARK][UPGRADE TO PY4J 0.10.4]Jagadeesan2016-10-213-1/+1
* [SPARK-17817] [PYSPARK] [FOLLOWUP] PySpark RDD Repartitioning Results in High...Liang-Chi Hsieh2016-10-181-6/+6
* [SPARK-17946][PYSPARK] Python crossJoin API similar to ScalaSrinath Shankar2016-10-142-6/+35
* [SPARK-11775][PYSPARK][SQL] Allow PySpark to register Java UDFJeff Zhang2016-10-141-1/+27
* [SPARK-16063][SQL] Add storageLevel to DatasetNick Pentreath2016-10-141-6/+30
* [SPARK-17870][MLLIB][ML] Change statistic to pValue for SelectKBest and Selec...Peng2016-10-142-6/+6
* [SPARK-15402][ML][PYSPARK] PySpark ml.evaluation should support save/loadYanbo Liang2016-10-141-9/+36
* [SPARK-15957][FOLLOW-UP][ML][PYSPARK] Add Python API for RFormula forceIndexL...Yanbo Liang2016-10-132-4/+43
* [SPARK-17731][SQL][STREAMING] Metrics for structured streamingTathagata Das2016-10-131-0/+301
* [SPARK-17745][ML][PYSPARK] update NB python api - add weight col parameterWeichenXu2016-10-121-13/+13
* [SPARK-17845] [SQL] More self-evident window function frame boundary APIReynold Xin2016-10-122-30/+84
* [SPARK-14761][SQL] Reject invalid join methods when join columns are not spec...Bijay Pathak2016-10-122-16/+21
* [SPARK-17720][SQL] introduce static SQL confWenchen Fan2016-10-111-1/+1
* [SPARK-17387][PYSPARK] Creating SparkContext() from python without spark-subm...Jeff Zhang2016-10-113-25/+75
* [SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly Skewed Pa...Liang-Chi Hsieh2016-10-112-3/+20
* [SPARK-17338][SQL][FOLLOW-UP] add global temp viewWenchen Fan2016-10-111-0/+5
* [SPARK-17808][PYSPARK] Upgraded version of Pyrolite to 4.13Bryan Cutler2016-10-111-0/+8
* [SPARK-17844] Simplify DataFrame API for defining frame boundaries in window ...Reynold Xin2016-10-102-0/+57
* [SPARK-17338][SQL] add global temp viewWenchen Fan2016-10-103-4/+41
* [SPARK-16960][SQL] Deprecate approxCountDistinct, toDegrees and toRadians acc...hyukjinkwon2016-10-071-9/+24
* [SPARK-17805][PYSPARK] Fix in sqlContext.read.text when pass in list of pathsBryan Cutler2016-10-072-2/+8
* [SPARK-17744][ML] Parity check between the ml and mllib test suites for NBZheng RuiFeng2016-10-041-1/+0
* [SPARK-17587][PYTHON][MLLIB] SparseVector __getitem__ should follow __getitem...zero3232016-10-034-16/+36
* [SPARK-17679] [PYSPARK] remove unnecessary Py4J ListConverter patchJason White2016-10-034-24/+6
* [SPARK-17704][ML][MLLIB] ChiSqSelector performance improvement.Sean Owen2016-10-011-1/+1
* [SPARK-17699] Support for parsing JSON string columnsMichael Armbrust2016-09-291-0/+23
* [MINOR][PYSPARK][DOCS] Fix examples in PySpark documentationhyukjinkwon2016-09-284-10/+10
* [SPARK-17138][ML][MLIB] Add Python API for multinomial logistic regressionWeichenXu2016-09-271-20/+70