aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...Xiangrui Meng2015-05-291-25/+33
* [SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModelXiangrui Meng2015-05-282-3/+32
* [MINOR] fix RegressionEvaluator docXiangrui Meng2015-05-281-1/+1
* [SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correctlinweizhong2015-05-261-4/+4
* [SPARK-7833] [ML] Add python wrapper for RegressionEvaluatorRam Sriharsha2015-05-241-2/+66
* [SPARK-7840] add insertInto() to WriterDavies Liu2015-05-232-8/+16
* [SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...Davies Liu2015-05-238-56/+365
* [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4Xiangrui Meng2015-05-214-61/+64
* [SPARK-7794] [MLLIB] update RegexTokenizer default settingsXiangrui Meng2015-05-211-21/+19
* [SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in PythonDavies Liu2015-05-211-2/+46
* [SPARK-7711] Add a startTime property to match the corresponding one in ScalaHolden Karau2015-05-212-0/+9
* [SPARK-7394][SQL] Add Pandas style cast (astype)kaka19922015-05-211-0/+2
* [SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutativeSean Owen2015-05-211-2/+10
* [SPARK-7606] [SQL] [PySpark] add version to Python SQL API docsDavies Liu2015-05-207-18/+170
* [SPARK-7762] [MLLIB] set default value for outputColXiangrui Meng2015-05-202-2/+3
* [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...Holden Karau2015-05-208-64/+96
* [SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlibYanbo Liang2015-05-201-0/+117
* [SPARK-7738] [SQL] [PySpark] add reader and writer API in PythonDavies Liu2015-05-195-90/+421
* [SPARK-7150] SparkContext.range() and SQLContext.range()Daoyuan Wang2015-05-184-0/+46
* [SPARK-6216] [PYSPARK] check python version of worker with driverDavies Liu2015-05-186-12/+16
* [SPARK-7380] [MLLIB] pipeline stages should be copyable in PythonXiangrui Meng2015-05-1813-254/+490
* [SPARK-6657] [PYSPARK] Fix doc warningsXiangrui Meng2015-05-184-10/+11
* [SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple filesDavies Liu2015-05-156-449/+552
* [SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in PythonDavies Liu2015-05-151-30/+46
* [SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...FlytxtRnD2015-05-151-0/+6
* [SPARK-6258] [MLLIB] GaussianMixture Python API parity checkYanbo Liang2015-05-151-14/+53
* [SPARK-7548] [SQL] Add explode function for DataFramesMichael Armbrust2015-05-143-3/+44
* [SPARK-7619] [PYTHON] fix docstring signatureXiangrui Meng2015-05-145-55/+52
* [SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.mlXiangrui Meng2015-05-143-1/+43
* [SPARK-7278] [PySpark] DateType should find datetime.datetime acceptableksonj2015-05-141-1/+1
* [SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classificationBurak Yavuz2015-05-133-10/+501
* [SPARK-7593] [ML] Python Api for ml.feature.BucketizerBurak Yavuz2015-05-131-0/+77
* [SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...Reynold Xin2015-05-123-2/+57
* [SPARK-7572] [MLLIB] do not import Param/Params under pyspark.mlXiangrui Meng2015-05-123-7/+11
* [SPARK-7487] [ML] Feature Parity in PySpark for ml.regressionBurak Yavuz2015-05-126-8/+709
* [SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pysparkDaoyuan Wang2015-05-122-0/+133
* [SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.Reynold Xin2015-05-111-1/+13
* [SPARK-7324] [SQL] DataFrame.dropDuplicatesReynold Xin2015-05-111-2/+34
* [SPARK-7462][SQL] Update documentation for retaining grouping columns in Data...Reynold Xin2015-05-111-0/+2
* [SPARK-7462] By default retain group by columns in aggregateReynold Xin2015-05-111-1/+1
* [SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlibYanbo Liang2015-05-111-2/+76
* [SPARK-7427] [PYSPARK] Make sharedParams match in Scala, PythonGlenn Weidner2015-05-103-21/+19
* [SPARK-7431] [ML] [PYTHON] Made CrossValidatorModel call parent init in PySparkJoseph K. Bradley2015-05-103-3/+4
* [SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlibYanbo Liang2015-05-101-0/+129
* [SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinctVinod K C2015-05-092-3/+0
* [SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendationBurak Yavuz2015-05-083-0/+310
* [SPARK-5913] [MLLIB] Python API for ChiSqSelectorYanbo Liang2015-05-081-2/+57
* [SPARK-7133] [SQL] Implement struct, array, and map field accessorWenchen Fan2015-05-082-12/+19
* [SPARK-7474] [MLLIB] update ParamGridBuilder doctestXiangrui Meng2015-05-081-15/+13
* [SPARK-7383] [ML] Feature Parity in PySpark for ml.featuresBurak Yavuz2015-05-083-41/+849