index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-20232][PYTHON] Improve combineByKey docs
David Gingrich
2017-04-13
1
-5
/
+19
*
[SPARK-19570][PYSPARK] Allow to disable hive in pyspark shell
Jeff Zhang
2017-04-12
1
-6
/
+16
*
[MINOR][DOCS] JSON APIs related documentation fixes
hyukjinkwon
2017-04-12
2
-5
/
+7
*
[SPARK-19505][PYTHON] AttributeError on Exception.message in Python3
David Gingrich
2017-04-11
3
-5
/
+53
*
[SPARK-20285][TESTS] Increase the pyspark streaming test timeout to 30 seconds
Shixiong Zhu
2017-04-10
1
-1
/
+1
*
[SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.Correlation
Liang-Chi Hsieh
2017-04-07
1
-0
/
+61
*
[SPARK-20196][PYTHON][SQL] update doc for catalog functions for all languages...
Felix Cheung
2017-04-06
2
-9
/
+20
*
[SPARK-20064][PYSPARK] Bump the PySpark verison number to 2.2
setjet
2017-04-06
1
-1
/
+1
*
[SPARK-20214][ML] Make sure converted csc matrix has sorted indices
Liang-Chi Hsieh
2017-04-05
3
-0
/
+17
*
[SPARK-19454][PYTHON][SQL] DataFrame.replace improvements
zero323
2017-04-05
2
-25
/
+128
*
[SPARK-20166][SQL] Use XXX for ISO 8601 timezone instead of ZZ (FastDateForma...
hyukjinkwon
2017-04-03
2
-6
/
+6
*
[SPARK-19955][PYSPARK] Jenkins Python Conda based test.
Holden Karau
2017-03-29
1
-3
/
+3
*
[SPARK-20040][ML][PYTHON] pyspark wrapper for ChiSquareTest
Bago Amirbekian
2017-03-28
3
-9
/
+123
*
[SPARK-20102] Fix nightly packaging and RC packaging scripts w/ two minor bui...
Josh Rosen
2017-03-27
1
-1
/
+0
*
[MINOR][DOCS] Match several documentation changes in Scala to R/Python
hyukjinkwon
2017-03-26
2
-4
/
+12
*
[SPARK-19281][PYTHON][ML] spark.ml Python API for FPGrowth
zero323
2017-03-26
3
-7
/
+270
*
[SPARK-15040][ML][PYSPARK] Add Imputer to PySpark
Nick Pentreath
2017-03-24
2
-0
/
+170
*
[SPARK-19876][SS][WIP] OneTime Trigger Executor
Tyson Condie
2017-03-23
2
-48
/
+32
*
[SPARK-18579][SQL] Use ignoreLeadingWhiteSpace and ignoreTrailingWhiteSpace o...
hyukjinkwon
2017-03-23
3
-16
/
+37
*
[SPARK-19949][SQL][FOLLOW-UP] Clean up parse modes and update related comments
hyukjinkwon
2017-03-22
2
-4
/
+4
*
[SPARK-20041][DOC] Update docs for NaN handling in approxQuantile
Zheng RuiFeng
2017-03-21
1
-1
/
+2
*
[SPARK-20011][ML][DOCS] Clarify documentation for ALS 'rank' parameter
christopher snow
2017-03-21
1
-2
/
+2
*
[SPARK-19849][SQL] Support ArrayType in to_json to produce JSON array
hyukjinkwon
2017-03-19
1
-5
/
+10
*
[SPARK-19986][TESTS] Make pyspark.streaming.tests.CheckpointTests more stable
Shixiong Zhu
2017-03-17
1
-5
/
+6
*
[SPARK-19872] [PYTHON] Use the correct deserializer for RDD construction for ...
hyukjinkwon
2017-03-15
2
-1
/
+9
*
[SPARK-19817][SS] Make it clear that `timeZone` is a general option in DataSt...
Liwei Lin
2017-03-14
2
-12
/
+28
*
[SPARK-19817][SQL] Make it clear that `timeZone` option is a general option i...
Takuya UESHIN
2017-03-14
1
-18
/
+28
*
[SPARK-12334][SQL][PYSPARK] Support read from multiple input paths for orc fi...
Jeff Zhang
2017-03-09
2
-6
/
+13
*
[SPARK-19561][SQL] add int case handling for TimestampType
Jason White
2017-03-09
1
-0
/
+8
*
[SPARK-19806][ML][PYSPARK] PySpark GeneralizedLinearRegression supports tweed...
Yanbo Liang
2017-03-08
2
-8
/
+73
*
Revert "[SPARK-19561] [PYTHON] cast TimestampType.toInternal output to long"
Wenchen Fan
2017-03-07
2
-7
/
+1
*
[SPARK-19561] [PYTHON] cast TimestampType.toInternal output to long
Jason White
2017-03-07
2
-1
/
+7
*
[SPARK-19701][SQL][PYTHON] Throws a correct exception for 'in' operator again...
hyukjinkwon
2017-03-05
2
-1
/
+6
*
[SPARK-19595][SQL] Support json array in from_json
hyukjinkwon
2017-03-05
1
-3
/
+8
*
[SPARK-19348][PYTHON] PySpark keyword_only decorator is not thread-safe
Bryan Cutler
2017-03-03
11
-120
/
+161
*
[SPARK-18352][DOCS] wholeFile JSON update doc and programming guide
Felix Cheung
2017-03-02
2
-4
/
+4
*
[SPARK-19734][PYTHON][ML] Correct OneHotEncoder doc string to say dropLast
Mark Grover
2017-03-01
1
-1
/
+1
*
[MINOR][ML] Fix comments in LSH Examples and Python API
Yun Ni
2017-03-01
1
-1
/
+1
*
[SPARK-19610][SQL] Support parsing multiline CSV files
hyukjinkwon
2017-02-28
4
-5
/
+22
*
[SPARK-14489][ML][PYSPARK] ALS unknown user/item prediction strategy
Nick Pentreath
2017-02-28
1
-5
/
+25
*
[SPARK-19660][CORE][SQL] Replace the configuration property names that are de...
Yuming Wang
2017-02-28
1
-23
/
+24
*
[SPARK-13330][PYSPARK] PYTHONHASHSEED is not propgated to python worker
Jeff Zhang
2017-02-24
2
-5
/
+4
*
[SPARK-19161][PYTHON][SQL] Improving UDF Docstrings
zero323
2017-02-24
2
-11
/
+25
*
[SPARK-14772][PYTHON][ML] Fixed Params.copy method to match Scala implementation
Bryan Cutler
2017-02-23
2
-6
/
+27
*
[SPARK-19706][PYSPARK] add Column.contains in pyspark
Wenchen Fan
2017-02-23
2
-1
/
+3
*
[SPARK-18699][SQL] Put malformed tokens into a new field when parsing CSV data
Takeshi Yamamuro
2017-02-23
2
-16
/
+48
*
[SPARK-19497][SS] Implement streaming deduplication
Shixiong Zhu
2017-02-23
1
-0
/
+6
*
[SPARK-19405][STREAMING] Support for cross-account Kinesis reads via STS
Adam Budde
2017-02-22
1
-2
/
+10
*
[MINOR][PYTHON] Fix typo docstring: 'top' -> 'topic'
Rolando Espinoza
2017-02-17
1
-1
/
+1
*
[SPARK-18352][SQL] Support parsing multiline json files
Nathan Howell
2017-02-16
4
-10
/
+37
[next]