aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-17583][SQL] Remove uesless rowSeparator variable and set auto-expandin...hyukjinkwon2016-09-212-2/+2
* [SPARK-17219][ML] Add NaN value handling in BucketizerVinceShieh2016-09-211-0/+5
* [SPARK-17017][MLLIB][ML] add a chiSquare Selector based on False Positive Rat...Peng, Meng2016-09-211-5/+66
* [SPARK-17585][PYSPARK][CORE] PySpark SparkContext.addFile supports adding fil...Yanbo Liang2016-09-214-7/+21
* [SPARK-17437] Add uiWebUrl to JavaSparkContext and pyspark.SparkContextAdrian Petrescu2016-09-201-0/+5
* [SPARK-17100] [SQL] fix Python udf in filter on top of outer joinDavies Liu2016-09-191-0/+8
* [SPARK-16462][SPARK-16460][SPARK-15144][SQL] Make CSV cast null values properlyLiwei Lin2016-09-182-2/+4
* [SPARK-17548][MLLIB] Word2VecModel.findSynonyms no longer spuriously rejects ...William Benton2016-09-171-3/+9
* [SPARK-17472] [PYSPARK] Better error message for serialization failures of la...Eric Liang2016-09-142-1/+20
* [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...Josh Rosen2016-09-142-4/+19
* [SPARK-17525][PYTHON] Remove SparkContext.clearFiles() from the PySpark API a...Sami Jaktholm2016-09-141-8/+0
* [SPARK-17474] [SQL] fix python udf in TakeOrderedAndProjectExecDavies Liu2016-09-121-0/+8
* [SPARK-17389][FOLLOW-UP][ML] Change KMeans k-means|| default init steps from ...Yanbo Liang2016-09-112-8/+8
* [MINOR][ML] Correct weights doc of MultilayerPerceptronClassificationModel.Yanbo Liang2016-09-061-1/+1
* [SPARK-17311][MLLIB] Standardize Python-Java MLlib API to accept optional lon...Sean Owen2016-09-041-2/+2
* [SPARK-17298][SQL] Require explicit CROSS join for cartesian productsSrinath Shankar2016-09-031-1/+1
* [SPARK-17261] [PYSPARK] Using HiveContext after re-creating SparkContext in S...Jeff Zhang2016-09-021-0/+1
* [SPARK-17264][SQL] DataStreamWriter should document that it only supports Par...Sean Owen2016-08-301-1/+1
* [SPARK-17001][ML] Enable standardScaler to standardize sparse vectors when wi...Sean Owen2016-08-271-3/+2
* [SPARK-17197][ML][PYSPARK] PySpark LiR/LoR supports tree aggregation level co...Yanbo Liang2016-08-254-11/+42
* [SPARK-17215][SQL] Method `SQLContext.parseDataType(dataTypeString: String)` ...jiangxingbo2016-08-246-13/+16
* [SPARK-16216][SQL] Read/write timestamps and dates in ISO 8601 and dateFormat...hyukjinkwon2016-08-242-20/+66
* [SPARK-16781][PYSPARK] java launched by PySpark as gateway may not be the sam...Sean Owen2016-08-243-1/+1
* [SPARK-15113][PYSPARK][ML] Add missing num features num classesHolden Karau2016-08-223-11/+64
* [SPARK-15018][PYSPARK][ML] Improve handling of PySpark Pipeline when used wit...Bryan Cutler2016-08-192-8/+14
* [SPARK-16965][MLLIB][PYSPARK] Fix bound checking for SparseVector.Jeff Zhang2016-08-191-0/+15
* [SPARK-16961][CORE] Fixed off-by-one error that biased randomizeInPlaceNick Lavers2016-08-193-8/+8
* [MINOR][DOC] Fix the descriptions for `properties` argument in the documenati...mvervuurt2016-08-161-5/+6
* [SPARK-17035] [SQL] [PYSPARK] Improve Timestamp not to lose precision for all...Dongjoon Hyun2016-08-162-1/+6
* [SPARK-16700][PYSPARK][SQL] create DataFrame from dict/Row with schemaDavies Liu2016-08-154-28/+62
* [MINOR][ML] Rename TreeEnsembleModels to TreeEnsembleModel for PySparkYanbo Liang2016-08-112-6/+6
* [SPARK-16324][SQL] regexp_extract should doc that it returns empty string whe...Sean Owen2016-08-101-1/+5
* [SPARK-16950] [PYSPARK] fromOffsets parameter support in KafkaUtils.createDir...Mariusz Strzelecki2016-08-092-9/+6
* [SPARK-16409][SQL] regexp_extract with optional groups causes NPESean Owen2016-08-071-0/+3
* [SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Updat...Nicholas Chammas2016-08-063-9/+5
* [SPARK-16831][PYTHON] Fixed bug in CrossValidator.avgMetrics=^_^=2016-08-031-1/+3
* [SPARK-16062] [SPARK-15989] [SQL] Fix two bugs of Python-only UDTsLiang-Chi Hsieh2016-08-021-0/+35
* [SPARK-16772][PYTHON][DOCS] Restore "datatype string" to Python API docstringsNicholas Chammas2016-07-292-12/+8
* [SPARK-16772] Correct API doc references to PySpark classes + formatting fixesNicholas Chammas2016-07-288-58/+75
* [SPARK-15254][DOC] Improve ML pipeline Cross Validation Scaladoc & PyDockrishnakalyan32016-07-271-2/+11
* [SPARK-16653][ML][OPTIMIZER] update ANN convergence tolerance param default t...WeichenXu2016-07-251-4/+4
* [PYSPARK] add picklable SparseMatrix in pyspark.ml.commonWeichenXu2016-07-241-0/+1
* [SPARK-16662][PYSPARK][SQL] fix HiveContext warning bugWeichenXu2016-07-231-5/+4
* [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...Dongjoon Hyun2016-07-221-0/+2
* [SPARK-16494][ML] Upgrade breeze version to 0.12Yanbo Liang2016-07-191-1/+1
* [DOC] improve python doc for rdd.histogram and dataframe.joinMortada Mehyar2016-07-182-14/+14
* [SPARK-14817][ML][MLLIB][DOC] Made DataFrame-based API primary in MLlib guideJoseph K. Bradley2016-07-153-4/+7
* [SPARK-16546][SQL][PYSPARK] update python dataframe.dropWeichenXu2016-07-141-8/+19
* [SPARK-16503] SparkSession should provide Spark versionLiwei Lin2016-07-131-0/+6
* [SPARK-16536][SQL][PYSPARK][MINOR] Expose `sql` in PySpark ShellDongjoon Hyun2016-07-131-0/+1