aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-7388] [SPARK-7383] wrapper for VectorAssembler in PythonBurak Yavuz2015-05-074-8/+78
* [SPARK-7295][SQL] bitwise operations for DataFrame DSLShiti2015-05-073-0/+20
* [SPARK-7432] [MLLIB] disable cv doctestXiangrui Meng2015-05-061-4/+4
* [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline APIXiangrui Meng2015-05-063-6/+194
* [SPARK-6267] [MLLIB] Python API for IsotonicRegressionYanbo Liang2015-05-051-2/+71
* [SPARK-7358][SQL] Move DataFrame mathfunctions into functionsBurak Yavuz2015-05-053-102/+53
* [SPARK-7294][SQL] ADD BETWEEN云峤2015-05-052-0/+15
* [SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySparkXiangrui Meng2015-05-056-3/+176
* [SPARK-7243][SQL] Reduce size for Contingency Tables in DataFramesBurak Yavuz2015-05-051-4/+5
* [SPARK-6612] [MLLIB] [PYSPARK] Python KMeans parityHrishikesh Subramonian2015-05-052-7/+31
* [SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDeMechCoder2015-05-052-2/+5
* [SPARK-7243][SQL] Contingency Tables for DataFramesBurak Yavuz2015-05-042-0/+34
* [SPARK-7319][SQL] Improve the output from DataFrame.show()云峤2015-05-041-36/+69
* [SPARK-7241] Pearson correlation for DataFramesBurak Yavuz2015-05-032-0/+32
* [SPARK-7329] [MLLIB] simplify ParamGridBuilder implXiangrui Meng2015-05-031-19/+9
* [SPARK-7022] [PYSPARK] [ML] Add ML.Tuning.ParamGridBuilder to PySparkOmede Firouz2015-05-031-0/+94
* [SPARK-3444] Fix typo in Dataframes.py introduced in []Dean Chen2015-05-021-1/+1
* [SPARK-7242] added python api for freqItems in DataFramesBurak Yavuz2015-05-012-0/+32
* [SPARK-3444] Provide an easy way to change log levelHolden Karau2015-05-012-1/+8
* [SPARK-2808][Streaming][Kafka] update kafka to 0.8.2cody koeninger2015-05-011-3/+5
* [SPARK-7240][SQL] Single pass covariance calculation for dataframesBurak Yavuz2015-05-013-2/+43
* [SPARK-7274] [SQL] Create Column expression for array/struct creation.Reynold Xin2015-05-011-19/+61
* [SPARK-6257] [PYSPARK] [MLLIB] MLlib API missing items in RecommendationMechCoder2015-04-301-0/+39
* [SPARK-7248] implemented random number generators for DataFramesBurak Yavuz2015-04-302-1/+34
* [SPARK-7156][SQL] Addressed follow up comments for randomSplitBurak Yavuz2015-04-291-1/+6
* [SPARK-7156][SQL] support RandomSplit in DataFramesBurak Yavuz2015-04-291-1/+17
* Better error message on access to non-existing attributeksonj2015-04-291-1/+2
* [SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operationsPatrick Wendell2015-04-291-1/+2
* [SPARK-7188] added python support for math DataFrame functionsBurak Yavuz2015-04-293-1/+131
* [SPARK-7208] [ML] [PYTHON] Added Matrix, SparseMatrix to __all__ list in lina...Joseph K. Bradley2015-04-281-1/+2
* [SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.Reynold Xin2015-04-281-1/+21
* [SPARK-5946] [STREAMING] Add Python API for direct Kafka streamjerryshao2015-04-272-14/+237
* [SPARK-7152][SQL] Add a Column expression for partition ID.Reynold Xin2015-04-261-9/+21
* [SPARK-7060][SQL] Add alias function to python dataframeYin Huai2015-04-231-0/+14
* [SPARK-6827] [MLLIB] Wrap FPGrowthModel.freqItemsets and make it consistent w...Yanbo Liang2015-04-221-3/+12
* [SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.Reynold Xin2015-04-221-1/+8
* [SPARK-6953] [PySpark] speed up python testsReynold Xin2015-04-218-122/+174
* [SPARK-7036][MLLIB] ALS.train should support DataFrames in PySparkXiangrui Meng2015-04-211-10/+26
* [SPARK-6845] [MLlib] [PySpark] Add isTranposed flag to DenseMatrixMechCoder2015-04-212-16/+49
* [SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expressionDavies Liu2015-04-2110-47/+70
* [SPARK-6661] Python type errors should print type, not objectElisey Zanko2015-04-2010-21/+23
* Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.Reynold Xin2015-04-171-3/+4
* [SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandasDavies Liu2015-04-173-39/+70
* [SPARK-6911] [SQL] improve accessor for nested typesDavies Liu2015-04-162-5/+62
* [SPARK-4897] [PySpark] Python 3 supportDavies Liu2015-04-1646-1090/+1018
* [SPARK-6893][ML] default pipeline parameter handling in pythonXiangrui Meng2015-04-159-114/+266
* [SPARK-6638] [SQL] Improve performance of StringType in SQLDavies Liu2015-04-151-5/+5
* [SPARK-6886] [PySpark] fix big closure with shuffleDavies Liu2015-04-152-14/+7
* [SPARK-6643][MLLIB] Implement StandardScalerModel missing methodslewuathe2015-04-122-0/+40
* [SPARK-6677] [SQL] [PySpark] fix cached classesDavies Liu2015-04-111-19/+20