aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-20132][DOCS] Add documentation for column string functionsMichael Patterson2017-04-221-6/+64
* [SPARK-20360][PYTHON] reprs for interpretersKyle Kelley2017-04-182-0/+37
* [SPARK-20232][PYTHON] Improve combineByKey docsDavid Gingrich2017-04-131-5/+19
* [SPARK-19570][PYSPARK] Allow to disable hive in pyspark shellJeff Zhang2017-04-121-6/+16
* [MINOR][DOCS] JSON APIs related documentation fixeshyukjinkwon2017-04-122-5/+7
* [SPARK-19505][PYTHON] AttributeError on Exception.message in Python3David Gingrich2017-04-113-5/+53
* [SPARK-20285][TESTS] Increase the pyspark streaming test timeout to 30 secondsShixiong Zhu2017-04-101-1/+1
* [SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.CorrelationLiang-Chi Hsieh2017-04-071-0/+61
* [SPARK-20196][PYTHON][SQL] update doc for catalog functions for all languages...Felix Cheung2017-04-062-9/+20
* [SPARK-20064][PYSPARK] Bump the PySpark verison number to 2.2setjet2017-04-061-1/+1
* [SPARK-20214][ML] Make sure converted csc matrix has sorted indicesLiang-Chi Hsieh2017-04-053-0/+17
* [SPARK-19454][PYTHON][SQL] DataFrame.replace improvementszero3232017-04-052-25/+128
* [SPARK-20166][SQL] Use XXX for ISO 8601 timezone instead of ZZ (FastDateForma...hyukjinkwon2017-04-032-6/+6
* [SPARK-19955][PYSPARK] Jenkins Python Conda based test.Holden Karau2017-03-291-3/+3
* [SPARK-20040][ML][PYTHON] pyspark wrapper for ChiSquareTestBago Amirbekian2017-03-283-9/+123
* [SPARK-20102] Fix nightly packaging and RC packaging scripts w/ two minor bui...Josh Rosen2017-03-271-1/+0
* [MINOR][DOCS] Match several documentation changes in Scala to R/Pythonhyukjinkwon2017-03-262-4/+12
* [SPARK-19281][PYTHON][ML] spark.ml Python API for FPGrowthzero3232017-03-263-7/+270
* [SPARK-15040][ML][PYSPARK] Add Imputer to PySparkNick Pentreath2017-03-242-0/+170
* [SPARK-19876][SS][WIP] OneTime Trigger ExecutorTyson Condie2017-03-232-48/+32
* [SPARK-18579][SQL] Use ignoreLeadingWhiteSpace and ignoreTrailingWhiteSpace o...hyukjinkwon2017-03-233-16/+37
* [SPARK-19949][SQL][FOLLOW-UP] Clean up parse modes and update related commentshyukjinkwon2017-03-222-4/+4
* [SPARK-20041][DOC] Update docs for NaN handling in approxQuantileZheng RuiFeng2017-03-211-1/+2
* [SPARK-20011][ML][DOCS] Clarify documentation for ALS 'rank' parameterchristopher snow2017-03-211-2/+2
* [SPARK-19849][SQL] Support ArrayType in to_json to produce JSON arrayhyukjinkwon2017-03-191-5/+10
* [SPARK-19986][TESTS] Make pyspark.streaming.tests.CheckpointTests more stableShixiong Zhu2017-03-171-5/+6
* [SPARK-19872] [PYTHON] Use the correct deserializer for RDD construction for ...hyukjinkwon2017-03-152-1/+9
* [SPARK-19817][SS] Make it clear that `timeZone` is a general option in DataSt...Liwei Lin2017-03-142-12/+28
* [SPARK-19817][SQL] Make it clear that `timeZone` option is a general option i...Takuya UESHIN2017-03-141-18/+28
* [SPARK-12334][SQL][PYSPARK] Support read from multiple input paths for orc fi...Jeff Zhang2017-03-092-6/+13
* [SPARK-19561][SQL] add int case handling for TimestampTypeJason White2017-03-091-0/+8
* [SPARK-19806][ML][PYSPARK] PySpark GeneralizedLinearRegression supports tweed...Yanbo Liang2017-03-082-8/+73
* Revert "[SPARK-19561] [PYTHON] cast TimestampType.toInternal output to long"Wenchen Fan2017-03-072-7/+1
* [SPARK-19561] [PYTHON] cast TimestampType.toInternal output to longJason White2017-03-072-1/+7
* [SPARK-19701][SQL][PYTHON] Throws a correct exception for 'in' operator again...hyukjinkwon2017-03-052-1/+6
* [SPARK-19595][SQL] Support json array in from_jsonhyukjinkwon2017-03-051-3/+8
* [SPARK-19348][PYTHON] PySpark keyword_only decorator is not thread-safeBryan Cutler2017-03-0311-120/+161
* [SPARK-18352][DOCS] wholeFile JSON update doc and programming guideFelix Cheung2017-03-022-4/+4
* [SPARK-19734][PYTHON][ML] Correct OneHotEncoder doc string to say dropLastMark Grover2017-03-011-1/+1
* [MINOR][ML] Fix comments in LSH Examples and Python APIYun Ni2017-03-011-1/+1
* [SPARK-19610][SQL] Support parsing multiline CSV fileshyukjinkwon2017-02-284-5/+22
* [SPARK-14489][ML][PYSPARK] ALS unknown user/item prediction strategyNick Pentreath2017-02-281-5/+25
* [SPARK-19660][CORE][SQL] Replace the configuration property names that are de...Yuming Wang2017-02-281-23/+24
* [SPARK-13330][PYSPARK] PYTHONHASHSEED is not propgated to python workerJeff Zhang2017-02-242-5/+4
* [SPARK-19161][PYTHON][SQL] Improving UDF Docstringszero3232017-02-242-11/+25
* [SPARK-14772][PYTHON][ML] Fixed Params.copy method to match Scala implementationBryan Cutler2017-02-232-6/+27
* [SPARK-19706][PYSPARK] add Column.contains in pysparkWenchen Fan2017-02-232-1/+3
* [SPARK-18699][SQL] Put malformed tokens into a new field when parsing CSV dataTakeshi Yamamuro2017-02-232-16/+48
* [SPARK-19497][SS] Implement streaming deduplicationShixiong Zhu2017-02-231-0/+6
* [SPARK-19405][STREAMING] Support for cross-account Kinesis reads via STSAdam Budde2017-02-221-2/+10