aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-14272][ML] Add Loglikelihood in GaussianMixtureSummaryZheng RuiFeng2017-01-191-0/+10
* [SPARK-19223][SQL][PYSPARK] Fix InputFileBlockHolder for datasources which ar...Liang-Chi Hsieh2017-01-181-0/+24
* [SPARK-19239][PYSPARK] Check parameters whether equals None when specify the ...DjvuLee2017-01-171-3/+6
* [SPARK-19019] [PYTHON] Fix hijacked `collections.namedtuple` and port cloudpi...hyukjinkwon2017-01-172-31/+87
* [SPARK-19148][SQL] do not expose the external table concept in CatalogWenchen Fan2017-01-171-3/+24
* [SPARK-18687][PYSPARK][SQL] Backward compatibility - creating a Dataframe on ...Vinayak2017-01-132-2/+7
* [SPARK-19055][SQL][PYSPARK] Fix SparkSession initialization when SparkContext...Liang-Chi Hsieh2017-01-122-6/+33
* [SPARK-19164][PYTHON][SQL] Remove unused UserDefinedFunction._broadcastzero3232017-01-121-6/+0
* [SPARK-19140][SS] Allow update mode for non-aggregation streaming queriesShixiong Zhu2017-01-101-8/+19
* [SPARK-17645][MLLIB][ML][FOLLOW-UP] document minor changePeng, Meng2017-01-102-7/+8
* [SPARK-17847][ML] Reduce shuffled data size of GaussianMixture & copy the imp...Yanbo Liang2017-01-091-18/+8
* [SPARK-19126][DOCS] Update Join Documentation Across Languagesanabranch2017-01-081-2/+3
* [SPARK-19127][DOCS] Update Rank Function Documentationanabranch2017-01-081-6/+10
* [SPARK-13748][PYSPARK][DOC] Add the description for explictly setting None fo...hyukjinkwon2017-01-071-1/+3
* [MINOR][DOCS] Remove consecutive duplicated words/typo in Spark RepoNiranjan Padmanabhan2017-01-043-4/+4
* [SPARK-17645][MLLIB][ML] add feature selector method based on: False Discover...Peng2016-12-282-15/+109
* [SPARK-18949][SQL] Add recoverPartitions API to Cataloggatorsmile2016-12-201-0/+5
* [SPARK-18576][PYTHON] Add basic TaskContext information to PySparkHolden Karau2016-12-204-1/+165
* [SPARK-18281] [SQL] [PYSPARK] Remove timeout for reading data through socket ...Liang-Chi Hsieh2016-12-202-6/+17
* [SPARK-18888] partitionBy in DataStreamWriter in Python throws _to_seq not de...Burak Yavuz2016-12-152-3/+5
* [SPARK-18852][SS] StreamingQuery.lastProgress should be null when recentProgr...Shixiong Zhu2016-12-142-3/+24
* [SPARK-18628][ML] Update Scala param and Python param to have quoteskrishnakalyan32016-12-111-2/+2
* [SPARK-18766][SQL] Push Down Filter Through BatchEvalPython (Python UDF)gatorsmile2016-12-101-0/+9
* [SPARK-16589] [PYTHON] Chained cartesian produces incorrect number of recordsAndrew Ray2016-12-082-23/+53
* [SPARK-18667][PYSPARK][SQL] Change the way to group row in BatchEvalPythonExe...Liang-Chi Hsieh2016-12-081-0/+8
* [SPARK-18754][SS] Rename recentProgresses to recentProgressMichael Armbrust2016-12-072-5/+5
* [SPARK-18657][SPARK-18668] Make StreamingQuery.id persists across restart and...Tathagata Das2016-12-051-2/+17
* [SPARK-18634][PYSPARK][SQL] Corruption and Correctness issues with exploding ...Liang-Chi Hsieh2016-12-051-0/+20
* [SPARK-18694][SS] Add StreamingQuery.explain and exception to Python and fix ...Shixiong Zhu2016-12-052-0/+69
* [SPARK-18690][PYTHON][SQL] Backward compatibility of unbounded frameszero3232016-12-022-14/+51
* [SPARK-18274][ML][PYSPARK] Memory leak in PySpark JavaWrapperSandeep Singh2016-12-012-18/+41
* [SPARK-18366][PYSPARK][ML] Add handleInvalid to Pyspark for QuantileDiscretiz...Sandeep Singh2016-11-301-14/+71
* [SPARK-18516][STRUCTURED STREAMING] Follow up PR to add StreamingQuery.status...Tathagata Das2016-11-292-0/+13
* [SPARK-15819][PYSPARK][ML] Add KMeanSummary in KMeans of PySparkJeff Zhang2016-11-292-0/+56
* [SPARK-18319][ML][QA2.1] 2.1 QA: API: Experimental, DeveloperApi, final, seal...Yuhao2016-11-294-32/+0
* [SPARK-18516][SQL] Split state and progress in streamingTathagata Das2016-11-292-304/+44
* [SPARK-18523][PYSPARK] Make SparkContext.stop more reliableAlexander Shorin2016-11-281-2/+15
* [SPARK-18481][ML] ML 2.1 QA: Remove deprecated methods for MLYanbo Liang2016-11-261-4/+36
* [SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...hyukjinkwon2016-11-2220-146/+157
* [SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframeBurak Yavuz2016-11-211-3/+54
* [SPARK-18361][PYSPARK] Expose RDD localCheckpoint in PySparkGabriel Huang2016-11-212-1/+49
* [SPARK-18282][ML][PYSPARK] Add python clustering summaries for GMM and BKMsethah2016-11-214-13/+212
* [SPARK-18445][BUILD][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that`/`...hyukjinkwon2016-11-194-6/+6
* [SPARK-18365][DOCS] Improve Sample Method Documentationanabranch2016-11-172-0/+10
* [SPARK-1267][SPARK-18129] Allow PySpark to be pip installedHolden Karau2016-11-164-1/+96
* [SPARK-18459][SPARK-18460][STRUCTUREDSTREAMING] Rename triggerId to batchId a...Tathagata Das2016-11-161-3/+3
* [MINOR][PYSPARK] Improve error message when running PySpark with different mi...Liang-Chi Hsieh2016-11-101-1/+3
* [SPARK-17829][SQL] Stable format for offset logTyson Condie2016-11-091-6/+6
* [SPARK-18239][SPARKR] Gradient Boosted Tree for RFelix Cheung2016-11-081-5/+5
* [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently ...hyukjinkwon2016-11-051-15/+20