aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-17472] [PYSPARK] Better error message for serialization failures of la...Eric Liang2016-09-142-1/+20
* [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...Josh Rosen2016-09-142-4/+19
* [SPARK-17525][PYTHON] Remove SparkContext.clearFiles() from the PySpark API a...Sami Jaktholm2016-09-141-8/+0
* [SPARK-17474] [SQL] fix python udf in TakeOrderedAndProjectExecDavies Liu2016-09-121-0/+8
* [SPARK-17389][FOLLOW-UP][ML] Change KMeans k-means|| default init steps from ...Yanbo Liang2016-09-112-8/+8
* [MINOR][ML] Correct weights doc of MultilayerPerceptronClassificationModel.Yanbo Liang2016-09-061-1/+1
* [SPARK-17311][MLLIB] Standardize Python-Java MLlib API to accept optional lon...Sean Owen2016-09-041-2/+2
* [SPARK-17298][SQL] Require explicit CROSS join for cartesian productsSrinath Shankar2016-09-031-1/+1
* [SPARK-17261] [PYSPARK] Using HiveContext after re-creating SparkContext in S...Jeff Zhang2016-09-021-0/+1
* [SPARK-17264][SQL] DataStreamWriter should document that it only supports Par...Sean Owen2016-08-301-1/+1
* [SPARK-17001][ML] Enable standardScaler to standardize sparse vectors when wi...Sean Owen2016-08-271-3/+2
* [SPARK-17197][ML][PYSPARK] PySpark LiR/LoR supports tree aggregation level co...Yanbo Liang2016-08-254-11/+42
* [SPARK-17215][SQL] Method `SQLContext.parseDataType(dataTypeString: String)` ...jiangxingbo2016-08-246-13/+16
* [SPARK-16216][SQL] Read/write timestamps and dates in ISO 8601 and dateFormat...hyukjinkwon2016-08-242-20/+66
* [SPARK-16781][PYSPARK] java launched by PySpark as gateway may not be the sam...Sean Owen2016-08-243-1/+1
* [SPARK-15113][PYSPARK][ML] Add missing num features num classesHolden Karau2016-08-223-11/+64
* [SPARK-15018][PYSPARK][ML] Improve handling of PySpark Pipeline when used wit...Bryan Cutler2016-08-192-8/+14
* [SPARK-16965][MLLIB][PYSPARK] Fix bound checking for SparseVector.Jeff Zhang2016-08-191-0/+15
* [SPARK-16961][CORE] Fixed off-by-one error that biased randomizeInPlaceNick Lavers2016-08-193-8/+8
* [MINOR][DOC] Fix the descriptions for `properties` argument in the documenati...mvervuurt2016-08-161-5/+6
* [SPARK-17035] [SQL] [PYSPARK] Improve Timestamp not to lose precision for all...Dongjoon Hyun2016-08-162-1/+6
* [SPARK-16700][PYSPARK][SQL] create DataFrame from dict/Row with schemaDavies Liu2016-08-154-28/+62
* [MINOR][ML] Rename TreeEnsembleModels to TreeEnsembleModel for PySparkYanbo Liang2016-08-112-6/+6
* [SPARK-16324][SQL] regexp_extract should doc that it returns empty string whe...Sean Owen2016-08-101-1/+5
* [SPARK-16950] [PYSPARK] fromOffsets parameter support in KafkaUtils.createDir...Mariusz Strzelecki2016-08-092-9/+6
* [SPARK-16409][SQL] regexp_extract with optional groups causes NPESean Owen2016-08-071-0/+3
* [SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Updat...Nicholas Chammas2016-08-063-9/+5
* [SPARK-16831][PYTHON] Fixed bug in CrossValidator.avgMetrics=^_^=2016-08-031-1/+3
* [SPARK-16062] [SPARK-15989] [SQL] Fix two bugs of Python-only UDTsLiang-Chi Hsieh2016-08-021-0/+35
* [SPARK-16772][PYTHON][DOCS] Restore "datatype string" to Python API docstringsNicholas Chammas2016-07-292-12/+8
* [SPARK-16772] Correct API doc references to PySpark classes + formatting fixesNicholas Chammas2016-07-288-58/+75
* [SPARK-15254][DOC] Improve ML pipeline Cross Validation Scaladoc & PyDockrishnakalyan32016-07-271-2/+11
* [SPARK-16653][ML][OPTIMIZER] update ANN convergence tolerance param default t...WeichenXu2016-07-251-4/+4
* [PYSPARK] add picklable SparseMatrix in pyspark.ml.commonWeichenXu2016-07-241-0/+1
* [SPARK-16662][PYSPARK][SQL] fix HiveContext warning bugWeichenXu2016-07-231-5/+4
* [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...Dongjoon Hyun2016-07-221-0/+2
* [SPARK-16494][ML] Upgrade breeze version to 0.12Yanbo Liang2016-07-191-1/+1
* [DOC] improve python doc for rdd.histogram and dataframe.joinMortada Mehyar2016-07-182-14/+14
* [SPARK-14817][ML][MLLIB][DOC] Made DataFrame-based API primary in MLlib guideJoseph K. Bradley2016-07-153-4/+7
* [SPARK-16546][SQL][PYSPARK] update python dataframe.dropWeichenXu2016-07-141-8/+19
* [SPARK-16503] SparkSession should provide Spark versionLiwei Lin2016-07-131-0/+6
* [SPARK-16536][SQL][PYSPARK][MINOR] Expose `sql` in PySpark ShellDongjoon Hyun2016-07-131-0/+1
* [SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit ...Joseph K. Bradley2016-07-1313-197/+11
* [SPARK-16429][SQL] Include `StringType` columns in `describe()`Dongjoon Hyun2016-07-081-4/+4
* [SPARK-13638][SQL] Add quoteAll option to CSV DataFrameWriterJurriaan Pruis2016-07-081-2/+5
* [SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...Dongjoon Hyun2016-07-081-2/+2
* [MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentationhyukjinkwon2016-07-066-23/+26
* [SPARK-16348][ML][MLLIB][PYTHON] Use full classpaths for pyspark ML JVM callsJoseph K. Bradley2016-07-058-26/+28
* [SPARK-16335][SQL] Structured streaming should fail if source directory does ...Reynold Xin2016-07-011-7/+4
* [SPARK-15954][SQL] Disable loading test tables in Python testsReynold Xin2016-06-301-1/+1