aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
...
* [SPARK-5469] restructure pyspark.sql into multiple filesDavies Liu2015-02-0911-2755/+2961
* [SPARK-5678] Convert DataFrame to pandas.DataFrame and SeriesDavies Liu2015-02-091-0/+25
* SPARK-5633 pyspark saveAsTextFile support for compression codecVladimir Vladimirov2015-02-061-2/+20
* [SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source...Cheng Lian2015-02-051-2/+7
* [SQL][DataFrame] Minor cleanup.Reynold Xin2015-02-041-11/+0
* [SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggrega...Reynold Xin2015-02-041-5/+8
* [SPARK-5577] Python udf for DataFrameDavies Liu2015-02-042-120/+113
* [SPARK-5588] [SQL] support select/filter by SQL expressionDavies Liu2015-02-041-10/+43
* [SPARK-5585] Flaky test in MLlib pythonDavies Liu2015-02-041-1/+1
* [SPARK-5379][Streaming] Add awaitTerminationOrTimeoutzsxwing2015-02-041-0/+9
* [SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streamingfreeman2015-02-032-1/+30
* [SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressionsReynold Xin2015-02-031-3/+2
* [SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python APIDavies Liu2015-02-034-447/+581
* [SPARK-5536] replace old ALS implementation by the new oneXiangrui Meng2015-02-021-8/+8
* [SPARK-5012][MLLib][PySpark]Python API for Gaussian Mixture ModelFlytxtRnD2015-02-024-5/+147
* [SPARK-5154] [PySpark] [Streaming] Kafka streaming support in PythonDavies Liu2015-02-023-2/+100
* [SQL] Improve DataFrame API error reportingReynold Xin2015-02-022-25/+56
* Make sure only owner can read / write to directories created for the job.Marcelo Vanzin2015-02-021-1/+2
* [SPARK-5094][MLlib] Add Python API for Gradient Boosted TreesKazuki Taniguchi2015-01-302-53/+209
* [SPARK-5464] Fix help() for Python DataFrame instancesJosh Rosen2015-01-292-3/+13
* [SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.Reynold Xin2015-01-291-2/+2
* [SPARK-5477] refactor stat.pyXiangrui Meng2015-01-294-54/+96
* [SPARK-5445][SQL] Made DataFrame dsl usable in JavaReynold Xin2015-01-281-16/+22
* [SPARK-5430] move treeReduce and treeAggregate from mllib to coreXiangrui Meng2015-01-281-1/+90
* [SPARK-4586][MLLIB] Python API for ML pipeline and parametersXiangrui Meng2015-01-2816-16/+1124
* [SPARK-4387][PySpark] Refactoring python profiling code to make it extensibleYandu Oppacher2015-01-288-71/+232
* [SPARK-5440][pyspark] Add toLocalIterator to pyspark rddMichael Nazario2015-01-281-0/+14
* SPARK-5458. Refer to aggregateByKey instead of combineByKey in docsSandy Ryza2015-01-281-2/+2
* [SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctlyWinston Chen2015-01-281-0/+19
* [SPARK-5097][SQL] DataFrameReynold Xin2015-01-273-336/+793
* [SPARK-5063] More helpful error messages for several invalid operationsJosh Rosen2015-01-232-0/+19
* [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seednate.crosswhite2015-01-212-3/+18
* SPARK-5270 [CORE] Provide isEmpty() function in RDD APISean Owen2015-01-191-0/+12
* [SPARK-5193][SQL] Remove Spark SQL Java-specific API.Reynold Xin2015-01-161-36/+12
* [SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.Reynold Xin2015-01-151-8/+8
* [SPARK-5224] [PySpark] improve performance of parallelize list/ndarrayDavies Liu2015-01-152-1/+5
* [SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexingMechCoder2015-01-142-0/+29
* [SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlibDavies Liu2015-01-131-4/+2
* [SPARK-5138][SQL] Ensure schema can be inferred from a namedtupleGabe Mulley2015-01-121-4/+14
* [SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...RJ Nowling2015-01-081-0/+187
* [SPARK-5089][PYSPARK][MLLIB] Fix vector convertfreeman2015-01-052-1/+11
* [SPARK-3325][Streaming] Add a parameter to the method print in class DStreamYadong Qi2015-01-021-5/+7
* [SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...Brennon York2014-12-271-1/+1
* [SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`jbencook2014-12-231-0/+28
* [SPARK-4822] Use sphinx tags for Python doc annotationslewuathe2014-12-176-17/+17
* [SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand docJoseph K. Bradley2014-12-173-30/+5
* [SPARK-4866] support StructType as key in MapTypeDavies Liu2014-12-162-7/+18
* [SPARK-4855][mllib] testing the Chi-squared hypothesis testjbencook2014-12-161-1/+99
* [SPARK-4841] fix zip with textFile()Davies Liu2014-12-153-14/+26
* [SPARK-4494][mllib] IDFModel.transform() add support for single vectorYuu ISHIKAWA2014-12-151-7/+15