aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
...
* [SPARK-7916] [MLLIB] MLlib Python doc parity check for classification and reg...Yanbo Liang2015-06-162-107/+247
* [SPARK-6411] [SQL] [PySpark] support date/datetime with timezone in PythonDavies Liu2015-06-112-9/+50
* [SPARK-8189] [SQL] use Long for TimestampType in SQLDavies Liu2015-06-101-0/+11
* [SPARK-7886] Add built-in expressions to FunctionRegistry.Reynold Xin2015-06-091-1/+1
* [SPARK-7990][SQL] Add methods to facilitate equi-join on multiple joining keysLiang-Chi Hsieh2015-06-081-13/+32
* [SPARK-2808] [STREAMING] [KAFKA] cleanup tests fromcody koeninger2015-06-071-5/+0
* [SPARK-8146] DataFrame Python API: Alias replace in df.naReynold Xin2015-06-072-26/+22
* [SPARK-7639] [PYSPARK] [MLLIB] Python API for KernelDensityMechCoder2015-06-062-1/+63
* [SPARK-7991] [PySpark] Adding support for passing lists to describe.amey2015-06-051-0/+12
* [SPARK-8116][PYSPARK] Allow sc.range() to take a single argument.Ted Blackman2015-06-041-2/+12
* [SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...Mike Dusenberry2015-06-041-3/+18
* Update documentation for [SPARK-7980] [SQL] Support SQLContext.range(end)Reynold Xin2015-06-031-0/+2
* [SPARK-7980] [SQL] Support SQLContext.range(end)animesh2015-06-032-2/+12
* [SPARK-8060] Improve DataFrame Python test coverage and documentation.Reynold Xin2015-06-035-227/+176
* [SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robustMechCoder2015-06-021-1/+3
* [SPARK-8038] [SQL] [PYSPARK] fix Column.when() and otherwise()Davies Liu2015-06-021-3/+28
* [SPARK-7432] [MLLIB] fix flaky CrossValidator doctestXiangrui Meng2015-06-021-10/+9
* [SPARK-8021] [SQL] [PYSPARK] make Python read/write API consistent with ScalaDavies Liu2015-06-021-27/+94
* [minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...Reynold Xin2015-06-011-0/+3
* [SPARK-7497] [PYSPARK] [STREAMING] fix streaming flaky testsDavies Liu2015-06-011-8/+8
* [SPARK-7978] [SQL] [PYSPARK] DecimalType should not be singletonDavies Liu2015-05-312-2/+25
* [SPARK-7918] [MLLIB] MLlib Python doc parity check for evaluation and featureYanbo Liang2015-05-302-39/+36
* [SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflictMichael Nazario2015-05-295-20/+4
* [SPARK-7912] [SPARK-7921] [MLLIB] Update OneHotEncoder to handle ML attribute...Xiangrui Meng2015-05-291-25/+33
* [SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModelXiangrui Meng2015-05-282-3/+32
* [MINOR] fix RegressionEvaluator docXiangrui Meng2015-05-281-1/+1
* [SPARK-7339] [PYSPARK] PySpark shuffle spill memory sometimes are not correctlinweizhong2015-05-261-4/+4
* [SPARK-7833] [ML] Add python wrapper for RegressionEvaluatorRam Sriharsha2015-05-241-2/+66
* [SPARK-7840] add insertInto() to WriterDavies Liu2015-05-232-8/+16
* [SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...Davies Liu2015-05-238-56/+365
* [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4Xiangrui Meng2015-05-214-61/+64
* [SPARK-7794] [MLLIB] update RegexTokenizer default settingsXiangrui Meng2015-05-211-21/+19
* [SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in PythonDavies Liu2015-05-211-2/+46
* [SPARK-7711] Add a startTime property to match the corresponding one in ScalaHolden Karau2015-05-212-0/+9
* [SPARK-7394][SQL] Add Pandas style cast (astype)kaka19922015-05-211-0/+2
* [SPARK-6416] [DOCS] RDD.fold() requires the operator to be commutativeSean Owen2015-05-211-2/+10
* [SPARK-7606] [SQL] [PySpark] add version to Python SQL API docsDavies Liu2015-05-207-18/+170
* [SPARK-7762] [MLLIB] set default value for outputColXiangrui Meng2015-05-202-2/+3
* [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 ...Holden Karau2015-05-208-64/+96
* [SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlibYanbo Liang2015-05-201-0/+117
* [SPARK-7738] [SQL] [PySpark] add reader and writer API in PythonDavies Liu2015-05-195-90/+421
* [SPARK-7150] SparkContext.range() and SQLContext.range()Daoyuan Wang2015-05-184-0/+46
* [SPARK-6216] [PYSPARK] check python version of worker with driverDavies Liu2015-05-186-12/+16
* [SPARK-7380] [MLLIB] pipeline stages should be copyable in PythonXiangrui Meng2015-05-1813-254/+490
* [SPARK-6657] [PYSPARK] Fix doc warningsXiangrui Meng2015-05-184-10/+11
* [SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple filesDavies Liu2015-05-155-449/+550
* [SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in PythonDavies Liu2015-05-151-30/+46
* [SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on...FlytxtRnD2015-05-151-0/+6
* [SPARK-6258] [MLLIB] GaussianMixture Python API parity checkYanbo Liang2015-05-151-14/+53
* [SPARK-7548] [SQL] Add explode function for DataFramesMichael Armbrust2015-05-143-3/+44