aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.Milan Straka2015-04-102-1/+12
* [SPARK-6781] [SQL] use sqlContext in python shellDavies Liu2015-04-087-53/+52
* [SPARK-6506] [pyspark] Do not try to retrieve SPARK_HOME when not needed...Marcelo Vanzin2015-04-081-2/+1
* [SPARK-6667] [PySpark] remove setReuseAddressDavies Liu2015-04-021-0/+1
* [SPARK-6660][MLLIB] pythonToJava doesn't recognize object arraysXiangrui Meng2015-04-011-0/+8
* [SPARK-6553] [pyspark] Support functools.partial as UDFksonj2015-04-012-1/+33
* [SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from Normal...Xiangrui Meng2015-04-011-3/+3
* [SPARK-6657] [Python] [Docs] fixed python doc build warningsJoseph K. Bradley2015-04-011-16/+10
* [SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy...Xiangrui Meng2015-04-011-1/+37
* [Doc] Improve Python DataFrame documentationReynold Xin2015-03-315-390/+250
* [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.Reynold Xin2015-03-312-6/+45
* [SPARK-6119][SQL] DataFrame support for missing data handlingReynold Xin2015-03-302-0/+182
* [SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() a...Davies Liu2015-03-301-27/+60
* [SPARK-6571][MLLIB] use wrapper in MatrixFactorizationModel.loadXiangrui Meng2015-03-301-0/+8
* [DOC] Improvements to Python docs.Reynold Xin2015-03-282-14/+9
* [SPARK-6117] [SQL] Improvements to DataFrame.describe()Reynold Xin2015-03-261-0/+19
* [SPARK-6536] [PySpark] Column.inSet() in PythonDavies Liu2015-03-261-0/+17
* [SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWeights co...lewuathe2015-03-202-1/+9
* [SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTab...Yin Huai2015-03-181-2/+2
* [SPARK-6210] [SQL] use prettyString as column name in agg()Davies Liu2015-03-141-16/+16
* [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()Davies Liu2015-03-133-34/+23
* [mllib] [python] Add LassoModel to __all__ in regression.pyJoseph K. Bradley2015-03-121-2/+4
* [SPARK-6294] fix hang when call take() in JVM on PythonRDDDavies Liu2015-03-122-1/+9
* [Docs] Replace references to SchemaRDD with DataFrameReynold Xin2015-03-092-3/+3
* [SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlibXiangrui Meng2015-03-024-15/+79
* [SPARK-6121][SQL][MLLIB] simpleString for UDTXiangrui Meng2015-03-022-1/+4
* [SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter ...Yanbo Liang2015-03-021-1/+1
* [Streaming][Minor]Fix some error docs in streaming examplesSaisai Shao2015-03-021-1/+1
* [SPARK-6053][MLLIB] support save/load in PySpark's ALSXiangrui Meng2015-03-012-2/+76
* [SPARK-6055] [PySpark] fix incorrect __eq__ of DataTypeDavies Liu2015-02-274-137/+86
* [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtil...Tathagata Das2015-02-261-15/+27
* [SPARK-6007][SQL] Add numRows param in DataFrame.show()Jacky Li2015-02-261-3/+3
* [SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/l...Joseph K. Bradley2015-02-253-89/+141
* [SPARK-5944] [PySpark] fix version in Python API docsDavies Liu2015-02-251-0/+4
* [SPARK-5994] [SQL] Python DataFrame documentation fixesDavies Liu2015-02-245-180/+129
* [SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.Reynold Xin2015-02-242-3/+11
* [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializerDavies Liu2015-02-242-1/+7
* [SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecutionMichael Armbrust2015-02-231-15/+15
* [SPARK-5943][Streaming] Update the test to use new API to reduce the warningSaisai Shao2015-02-231-1/+1
* [SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache managerYin Huai2015-02-201-0/+4
* [SPARK-5898] [SPARK-5896] [SQL] [PySpark] create DataFrame from pandas and t...Davies Liu2015-02-203-20/+20
* [SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 releaseJoseph K. Bradley2015-02-2012-26/+41
* [SPARK-5904][SQL] DataFrame API fixes.Reynold Xin2015-02-191-36/+20
* [SPARK-5722] [SQL] [PySpark] infer int as LongTypeDavies Liu2015-02-183-11/+33
* [SPARK-5878] fix DataFrame.repartition() in PythonDavies Liu2015-02-181-1/+7
* [SPARK-5811] Added documentation for maven coordinates and added Spark Packag...Burak Yavuz2015-02-171-4/+65
* [SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySparkDavies Liu2015-02-174-22/+75
* [SPARK-5872] [SQL] create a sqlCtx in pyspark shellDavies Liu2015-02-172-3/+22
* [SPARK-5871] output explain in PythonDavies Liu2015-02-171-3/+20
* [SPARK-4172] [PySpark] Progress API in PythonDavies Liu2015-02-174-7/+142