aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-5063] More helpful error messages for several invalid operationsJosh Rosen2015-01-232-0/+19
* Make sure only owner can read / write to directories created for the job.Marcelo Vanzin2015-01-211-1/+2
* [SPARK-5224] [PySpark] improve performance of parallelize list/ndarrayDavies Liu2015-01-152-1/+5
* [SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlibDavies Liu2015-01-131-4/+2
* [SPARK-5089][PYSPARK][MLLIB] Fix vector convertfreeman2015-01-052-1/+11
* [SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand docJoseph K. Bradley2014-12-173-30/+5
* [SPARK-4841] fix zip with textFile()Davies Liu2014-12-173-14/+26
* [SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + D...Joseph K. Bradley2014-12-041-3/+3
* [SPARK-4548] []SPARK-4517] improve performance of python broadcastDavies Liu2014-11-245-233/+80
* [SPARK-4578] fix asDict() with nested Row()Davies Liu2014-11-242-4/+5
* [SPARK-4562] [MLlib] speedup vectorDavies Liu2014-11-242-26/+53
* [SPARK-4531] [MLlib] cache serialized java objectDavies Liu2014-11-214-13/+8
* [SPARK-4477] [PySpark] remove numpy from RDDSamplerDavies Liu2014-11-202-69/+40
* [SPARK-4439] [MLlib] add python api for random forestDavies Liu2014-11-202-23/+221
* [SPARK-4228][SQL] SchemaRDD to JSONDan McClary2014-11-201-1/+16
* [DOC][PySpark][Streaming] Fix docstring for sphinxKen Takagiwa2014-11-191-2/+2
* [SPARK-4384] [PySpark] improve sort spillingDavies Liu2014-11-191-1/+10
* [SPARK-4327] [PySpark] Python API for RDD.randomSplit()Davies Liu2014-11-182-3/+41
* [SPARK-3721] [PySpark] broadcast objects larger than 2GDavies Liu2014-11-186-17/+239
* [SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGSDavies Liu2014-11-181-4/+53
* [SPARK-4396] allow lookup by index in Python's RatingXiangrui Meng2014-11-181-11/+15
* [SPARK-4435] [MLlib] [PySpark] improve classificationDavies Liu2014-11-181-29/+106
* [SPARK-4415] [PySpark] JVM should exit after Python exitDavies Liu2014-11-141-1/+3
* [SPARK-4398][PySpark] specialize sc.parallelize(xrange)Xiangrui Meng2014-11-141-4/+21
* [SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala ...Xiangrui Meng2014-11-132-35/+37
* [SPARK-4348] [PySpark] [MLlib] rename random.py to rand.pyDavies Liu2014-11-136-20/+38
* [SPARK-4369] [MLLib] fix TreeModel.predict() with RDDDavies Liu2014-11-121-12/+14
* [SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib APIDavies Liu2014-11-107-32/+105
* [MLLIB] [PYTHON] SPARK-4221: Expose nonnegative ALS in the python APIMichelangelo D'Agostino2014-11-071-15/+25
* [SPARK-4304] [PySpark] Fix sort on empty RDDDavies Liu2014-11-072-0/+5
* [SPARK-4186] add binaryFiles and binaryRecords in PythonDavies Liu2014-11-062-1/+50
* [SPARK-3964] [MLlib] [PySpark] add Hypothesis test Python APIDavies Liu2014-11-043-4/+153
* [SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...Davies Liu2014-11-038-186/+94
* [SPARK-3573][MLLIB] Make MLlib's Vector compatible with SQL's SchemaRDDXiangrui Meng2014-11-032-3/+86
* [SPARK-4192][SQL] Internal API for Python UDTXiangrui Meng2014-11-032-3/+296
* [SPARK-3594] [PySpark] [SQL] take more rows to infer schema or samplingDavies Liu2014-11-032-68/+147
* [SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sampleXiangrui Meng2014-11-033-9/+20
* [SPARK-3930] [SPARK-3933] Support fixed-precision decimal in SQL, and some op...Matei Zaharia2014-11-011-3/+32
* [SPARK-3569][SQL] Add metadata field to StructFieldXiangrui Meng2014-11-011-4/+11
* [SPARK-3870] EOL character enforcementKousuke Saruta2014-10-312-249/+249
* [SPARK-4150][PySpark] return self in rdd.setNameXiangrui Meng2014-10-311-2/+2
* [SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1wangfei2014-10-311-27/+0
* [SPARK-4124] [MLlib] [PySpark] simplify serialization in MLlib Python APIDavies Liu2014-10-3011-347/+242
* [SPARK-4133] [SQL] [PySpark] type conversionfor python udfDavies Liu2014-10-281-3/+13
* [SPARK-3988][SQL] add public API for date typeDaoyuan Wang2014-10-281-18/+39
* [SPARK-4058] [PySpark] Log file name is hard coded even though there is a var...Kousuke Saruta2014-10-281-2/+2
* [SPARK-3961] [MLlib] [PySpark] Python API for mllib.featureDavies Liu2014-10-282-57/+354
* SPARK-4022 [CORE] [MLLIB] Replace colt dependency (LGPL) with commons-mathSean Owen2014-10-271-1/+1
* [SPARK-4088] [PySpark] Python worker should exit after socket is closed by JVMDavies Liu2014-10-251-5/+7
* [SPARK-4051] [SQL] [PySpark] Convert Row into dictionaryDavies Liu2014-10-242-0/+21