aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-4586][MLLIB] Python API for ML pipeline and parametersXiangrui Meng2015-01-2816-16/+1124
* [SPARK-4387][PySpark] Refactoring python profiling code to make it extensibleYandu Oppacher2015-01-288-71/+232
* [SPARK-5440][pyspark] Add toLocalIterator to pyspark rddMichael Nazario2015-01-281-0/+14
* SPARK-5458. Refer to aggregateByKey instead of combineByKey in docsSandy Ryza2015-01-281-2/+2
* [SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctlyWinston Chen2015-01-281-0/+19
* [SPARK-5097][SQL] DataFrameReynold Xin2015-01-273-336/+793
* [SPARK-5063] More helpful error messages for several invalid operationsJosh Rosen2015-01-232-0/+19
* [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seednate.crosswhite2015-01-212-3/+18
* SPARK-5270 [CORE] Provide isEmpty() function in RDD APISean Owen2015-01-191-0/+12
* [SPARK-5193][SQL] Remove Spark SQL Java-specific API.Reynold Xin2015-01-161-36/+12
* [SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.Reynold Xin2015-01-151-8/+8
* [SPARK-5224] [PySpark] improve performance of parallelize list/ndarrayDavies Liu2015-01-152-1/+5
* [SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexingMechCoder2015-01-142-0/+29
* [SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlibDavies Liu2015-01-131-4/+2
* [SPARK-5138][SQL] Ensure schema can be inferred from a namedtupleGabe Mulley2015-01-121-4/+14
* [SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...RJ Nowling2015-01-081-0/+187
* [SPARK-5089][PYSPARK][MLLIB] Fix vector convertfreeman2015-01-052-1/+11
* [SPARK-3325][Streaming] Add a parameter to the method print in class DStreamYadong Qi2015-01-021-5/+7
* [SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...Brennon York2014-12-271-1/+1
* [SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`jbencook2014-12-231-0/+28
* [SPARK-4822] Use sphinx tags for Python doc annotationslewuathe2014-12-176-17/+17
* [SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand docJoseph K. Bradley2014-12-173-30/+5
* [SPARK-4866] support StructType as key in MapTypeDavies Liu2014-12-162-7/+18
* [SPARK-4855][mllib] testing the Chi-squared hypothesis testjbencook2014-12-161-1/+99
* [SPARK-4841] fix zip with textFile()Davies Liu2014-12-153-14/+26
* [SPARK-4494][mllib] IDFModel.transform() add support for single vectorYuu ISHIKAWA2014-12-151-7/+15
* [SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + D...Joseph K. Bradley2014-12-041-3/+3
* [SPARK-4548] []SPARK-4517] improve performance of python broadcastDavies Liu2014-11-245-233/+80
* [SPARK-4578] fix asDict() with nested Row()Davies Liu2014-11-242-4/+5
* [SPARK-4562] [MLlib] speedup vectorDavies Liu2014-11-242-26/+53
* [SPARK-4531] [MLlib] cache serialized java objectDavies Liu2014-11-214-13/+8
* [SPARK-4477] [PySpark] remove numpy from RDDSamplerDavies Liu2014-11-202-69/+40
* [SPARK-4439] [MLlib] add python api for random forestDavies Liu2014-11-202-23/+221
* [SPARK-4228][SQL] SchemaRDD to JSONDan McClary2014-11-201-1/+16
* [SPARK-4384] [PySpark] improve sort spillingDavies Liu2014-11-191-1/+10
* [DOC][PySpark][Streaming] Fix docstring for sphinxKen Takagiwa2014-11-191-2/+2
* [SPARK-4327] [PySpark] Python API for RDD.randomSplit()Davies Liu2014-11-182-3/+41
* [SPARK-3721] [PySpark] broadcast objects larger than 2GDavies Liu2014-11-186-17/+239
* [SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGSDavies Liu2014-11-181-4/+53
* [SPARK-4396] allow lookup by index in Python's RatingXiangrui Meng2014-11-181-11/+15
* [SPARK-4435] [MLlib] [PySpark] improve classificationDavies Liu2014-11-181-29/+106
* [SPARK-4415] [PySpark] JVM should exit after Python exitDavies Liu2014-11-141-1/+3
* [SPARK-4398][PySpark] specialize sc.parallelize(xrange)Xiangrui Meng2014-11-141-4/+21
* [SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala ...Xiangrui Meng2014-11-132-35/+37
* [SPARK-4348] [PySpark] [MLlib] rename random.py to rand.pyDavies Liu2014-11-136-20/+38
* [SPARK-4369] [MLLib] fix TreeModel.predict() with RDDDavies Liu2014-11-121-12/+14
* [SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib APIDavies Liu2014-11-107-32/+105
* [MLLIB] [PYTHON] SPARK-4221: Expose nonnegative ALS in the python APIMichelangelo D'Agostino2014-11-071-15/+25
* [SPARK-4304] [PySpark] Fix sort on empty RDDDavies Liu2014-11-072-0/+5
* [SPARK-4186] add binaryFiles and binaryRecords in PythonDavies Liu2014-11-062-1/+50