aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...Mike Dusenberry2015-06-041-3/+18
* Update documentation for [SPARK-7980] [SQL] Support SQLContext.range(end)Reynold Xin2015-06-031-0/+2
* [SPARK-7980] [SQL] Support SQLContext.range(end)animesh2015-06-032-2/+12
* [SPARK-8060] Improve DataFrame Python test coverage and documentation.Reynold Xin2015-06-0319-227/+179
* [SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robustMechCoder2015-06-021-1/+3
* [SPARK-8038] [SQL] [PYSPARK] fix Column.when() and otherwise()Davies Liu2015-06-021-3/+28
* [SPARK-7432] [MLLIB] fix flaky CrossValidator doctestXiangrui Meng2015-06-021-10/+9
* [SPARK-8021] [SQL] [PYSPARK] make Python read/write API consistent with ScalaDavies Liu2015-06-021-27/+94
* [minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...Reynold Xin2015-06-011-0/+3
* [SPARK-7497] [PYSPARK] [STREAMING] fix streaming flaky testsDavies Liu2015-06-011-8/+8
* [SPARK-7978] [SQL] [PYSPARK] DecimalType should not be singletonDavies Liu2015-05-312-2/+25
* [MINOR] Enable PySpark SQL readerwriter and window testsJosh Rosen2015-05-311-0/+2
* [SPARK-7918] [MLLIB] MLlib Python doc parity check for evaluation and featureYanbo Liang2015-05-302-39/+36
* [SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflictMichael Nazario2015-05-296-58/+42
* [SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...Xiangrui Meng2015-05-291-25/+33
* [SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModelXiangrui Meng2015-05-282-3/+32
* [MINOR] fix RegressionEvaluator docXiangrui Meng2015-05-281-1/+1
* [SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correctlinweizhong2015-05-261-4/+4
* [SPARK-7833] [ML] Add python wrapper for RegressionEvaluatorRam Sriharsha2015-05-241-2/+66
* [SPARK-7840] add insertInto() to WriterDavies Liu2015-05-232-8/+16
* [SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...Davies Liu2015-05-238-56/+365
* [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4Xiangrui Meng2015-05-214-61/+64
* [SPARK-7794] [MLLIB] update RegexTokenizer default settingsXiangrui Meng2015-05-211-21/+19
* [SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in PythonDavies Liu2015-05-211-2/+46
* [SPARK-7711] Add a startTime property to match the corresponding one in ScalaHolden Karau2015-05-212-0/+9
* [SPARK-7394][SQL] Add Pandas style cast (astype)kaka19922015-05-211-0/+2
* [SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutativeSean Owen2015-05-211-2/+10
* [SPARK-7606] [SQL] [PySpark] add version to Python SQL API docsDavies Liu2015-05-207-18/+170
* [SPARK-7762] [MLLIB] set default value for outputColXiangrui Meng2015-05-202-2/+3
* [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...Holden Karau2015-05-208-64/+96
* [SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlibYanbo Liang2015-05-201-0/+117
* [SPARK-7738] [SQL] [PySpark] add reader and writer API in PythonDavies Liu2015-05-195-90/+421
* [SPARK-7150] SparkContext.range() and SQLContext.range()Daoyuan Wang2015-05-184-0/+46
* [SPARK-6216] [PYSPARK] check python version of worker with driverDavies Liu2015-05-186-12/+16
* [SPARK-7380] [MLLIB] pipeline stages should be copyable in PythonXiangrui Meng2015-05-1813-254/+490
* [SPARK-6657] [PYSPARK] Fix doc warningsXiangrui Meng2015-05-184-10/+11
* [SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple filesDavies Liu2015-05-156-449/+552
* [SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in PythonDavies Liu2015-05-151-30/+46
* [SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...FlytxtRnD2015-05-151-0/+6
* [SPARK-6258] [MLLIB] GaussianMixture Python API parity checkYanbo Liang2015-05-151-14/+53
* [SPARK-7548] [SQL] Add explode function for DataFramesMichael Armbrust2015-05-143-3/+44
* [SPARK-7619] [PYTHON] fix docstring signatureXiangrui Meng2015-05-145-55/+52
* [SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.mlXiangrui Meng2015-05-143-1/+43
* [SPARK-7278] [PySpark] DateType should find datetime.datetime acceptableksonj2015-05-141-1/+1
* [SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classificationBurak Yavuz2015-05-133-10/+501
* [SPARK-7593] [ML] Python Api for ml.feature.BucketizerBurak Yavuz2015-05-131-0/+77
* [SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...Reynold Xin2015-05-123-2/+57
* [SPARK-7572] [MLLIB] do not import Param/Params under pyspark.mlXiangrui Meng2015-05-123-7/+11
* [SPARK-7487] [ML] Feature Parity in PySpark for ml.regressionBurak Yavuz2015-05-126-8/+709
* [SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pysparkDaoyuan Wang2015-05-122-0/+133