aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-6720][MLLIB] PySpark MultivariateStatisticalSummary unit test for norm...lewuathe2015-04-071-0/+7
* [SPARK-6262][MLLIB]Implement missing methods for MultivariateStatisticalSummarylewuathe2015-04-052-0/+12
* [SPARK-6615][MLLIB] Python API for Word2Veclewuathe2015-04-032-6/+57
* [SPARK-6667] [PySpark] remove setReuseAddressDavies Liu2015-04-021-0/+1
* [SPARK-6660][MLLIB] pythonToJava doesn't recognize object arraysXiangrui Meng2015-04-011-0/+8
* [SPARK-6553] [pyspark] Support functools.partial as UDFksonj2015-04-012-1/+33
* [SPARK-6576] [MLlib] [PySpark] DenseMatrix in PySpark should support indexingMechCoder2015-04-012-0/+17
* [SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from Normal...Xiangrui Meng2015-04-011-3/+3
* [SPARK-6657] [Python] [Docs] fixed python doc build warningsJoseph K. Bradley2015-04-012-17/+11
* [SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy...Xiangrui Meng2015-04-011-1/+37
* [Doc] Improve Python DataFrame documentationReynold Xin2015-03-315-390/+250
* [SPARK-6255] [MLLIB] Support multiclass classification in Python APIYanbo Liang2015-03-312-28/+116
* [SPARK-6598][MLLIB] Python API for IDFModellewuathe2015-03-312-0/+20
* [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.Reynold Xin2015-03-312-6/+45
* [SPARK-6119][SQL] DataFrame support for missing data handlingReynold Xin2015-03-302-0/+182
* [SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() a...Davies Liu2015-03-301-27/+60
* [DOC] Improvements to Python docs.Reynold Xin2015-03-283-14/+17
* [SPARK-6571][MLLIB] use wrapper in MatrixFactorizationModel.loadXiangrui Meng2015-03-281-0/+8
* [SPARK-6117] [SQL] Improvements to DataFrame.describe()Reynold Xin2015-03-261-0/+19
* [SPARK-6536] [PySpark] Column.inSet() in PythonDavies Liu2015-03-261-0/+17
* [SPARK-6256] [MLlib] MLlib Python API parity check for regressionYanbo Liang2015-03-251-7/+36
* [SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWeights co...lewuathe2015-03-202-1/+9
* [Spark 6096][MLlib] Add Naive Bayes load save methods in PythonXusen Yin2015-03-201-1/+30
* [SPARK-6095] [MLLIB] Support model save/load in Python's linear modelsYanbo Liang2015-03-203-3/+145
* [SPARK-6370][core] Documentation: Improve all 3 docs for RDD.samplembonaci2015-03-201-0/+6
* [SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTab...Yin Huai2015-03-181-2/+2
* [SPARK-6226][MLLIB] add save/load in PySpark's KMeansModelXiangrui Meng2015-03-172-5/+27
* [SPARK-6327] [PySpark] fix launch spark-submit from pythonDavies Liu2015-03-161-4/+2
* [SPARK-6210] [SQL] use prettyString as column name in agg()Davies Liu2015-03-141-16/+16
* [mllib] [python] Add LassoModel to __all__ in regression.pyJoseph K. Bradley2015-03-121-2/+4
* [SPARK-6294] fix hang when call take() in JVM on PythonRDDDavies Liu2015-03-122-1/+9
* [SPARK-4924] Add a library for launching Spark jobs programmatically.Marcelo Vanzin2015-03-111-2/+1
* [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()Davies Liu2015-03-093-34/+23
* [Docs] Replace references to SchemaRDD with DataFrameReynold Xin2015-03-092-3/+3
* [SPARK-6090][MLLIB] add a basic BinaryClassificationMetrics to PySpark/MLlibXiangrui Meng2015-03-053-0/+91
* [SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlibXiangrui Meng2015-03-024-15/+79
* [SPARK-6127][Streaming][Docs] Add Kafka to Python api docsTathagata Das2015-03-021-0/+7
* [SPARK-6121][SQL][MLLIB] simpleString for UDTXiangrui Meng2015-03-022-1/+4
* [SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter ...Yanbo Liang2015-03-021-1/+1
* [Streaming][Minor]Fix some error docs in streaming examplesSaisai Shao2015-03-021-1/+1
* [SPARK-6053][MLLIB] support save/load in PySpark's ALSXiangrui Meng2015-03-012-2/+76
* [SPARK-6055] [PySpark] fix incorrect __eq__ of DataTypeDavies Liu2015-02-274-137/+86
* [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtil...Tathagata Das2015-02-261-15/+27
* [SPARK-6007][SQL] Add numRows param in DataFrame.show()Jacky Li2015-02-261-3/+3
* [SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/l...Joseph K. Bradley2015-02-253-89/+141
* [SPARK-5944] [PySpark] fix version in Python API docsDavies Liu2015-02-253-4/+8
* [SPARK-5994] [SQL] Python DataFrame documentation fixesDavies Liu2015-02-246-183/+129
* [SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.Reynold Xin2015-02-242-3/+11
* [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializerDavies Liu2015-02-242-1/+7
* [SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecutionMichael Armbrust2015-02-231-15/+15