index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-13330][PYSPARK] PYTHONHASHSEED is not propgated to python worker
Jeff Zhang
2017-02-24
2
-5
/
+4
*
[SPARK-19161][PYTHON][SQL] Improving UDF Docstrings
zero323
2017-02-24
2
-11
/
+25
*
[SPARK-14772][PYTHON][ML] Fixed Params.copy method to match Scala implementation
Bryan Cutler
2017-02-23
2
-6
/
+27
*
[SPARK-19706][PYSPARK] add Column.contains in pyspark
Wenchen Fan
2017-02-23
2
-1
/
+3
*
[SPARK-18699][SQL] Put malformed tokens into a new field when parsing CSV data
Takeshi Yamamuro
2017-02-23
2
-16
/
+48
*
[SPARK-19497][SS] Implement streaming deduplication
Shixiong Zhu
2017-02-23
1
-0
/
+6
*
[SPARK-19405][STREAMING] Support for cross-account Kinesis reads via STS
Adam Budde
2017-02-22
1
-2
/
+10
*
[MINOR][PYTHON] Fix typo docstring: 'top' -> 'topic'
Rolando Espinoza
2017-02-17
1
-1
/
+1
*
[SPARK-18352][SQL] Support parsing multiline json files
Nathan Howell
2017-02-16
3
-10
/
+24
*
[SPARK-18080][ML][PYTHON] Python API & Examples for Locality Sensitive Hashing
Yun Ni
2017-02-15
1
-0
/
+291
*
[SPARK-18937][SQL] Timezone support in CSV/JSON parsing
Takuya UESHIN
2017-02-15
2
-24
/
+39
*
[SPARK-19399][SPARKR] Add R coalesce API for DataFrame and Column
Felix Cheung
2017-02-15
1
-1
/
+9
*
[SPARK-19160][PYTHON][SQL] Add udf decorator
zero323
2017-02-15
2
-7
/
+91
*
[SPARK-19590][PYSPARK][ML] Update the document for QuantileDiscretizer in pys...
VinceShieh
2017-02-15
1
-1
/
+11
*
[SPARK-18541][PYTHON] Add metadata parameter to pyspark.sql.Column.alias()
Sheamus K. Parkes
2017-02-14
2
-3
/
+33
*
[SPARK-19162][PYTHON][SQL] UserDefinedFunction should validate that func is c...
zero323
2017-02-14
2
-0
/
+12
*
[SPARK-19453][PYTHON][SQL][DOC] Correct and extend DataFrame.replace docstring
zero323
2017-02-14
1
-6
/
+12
*
[SPARK-19429][PYTHON][SQL] Support slice arguments in Column.__getitem__
zero323
2017-02-13
2
-3
/
+16
*
[SPARK-19427][PYTHON][SQL] Support data type string as a returnType argument ...
zero323
2017-02-13
2
-3
/
+20
*
[SPARK-19506][ML][PYTHON] Import warnings in pyspark.ml.util
zero323
2017-02-13
1
-0
/
+1
*
[SPARK-16609] Add to_date/to_timestamp with format functions
anabranch
2017-02-07
2
-6
/
+55
*
[SPARK-19467][ML][PYTHON] Remove cyclic imports from pyspark.ml.pipeline
zero323
2017-02-06
1
-1
/
+1
*
[SPARK-19421][ML][PYSPARK] Remove numClasses and numFeatures methods in Linea...
Zheng RuiFeng
2017-02-05
1
-16
/
+0
*
[SPARK-19389][ML][PYTHON][DOC] Minor doc fixes for ML Python Params and Linea...
Joseph K. Bradley
2017-02-02
2
-17
/
+5
*
[SPARK-14352][SQL] approxQuantile should support multi columns
Zheng RuiFeng
2017-02-01
2
-8
/
+52
*
[SPARK-19163][PYTHON][SQL] Delay _judf initialization to the __call__
zero323
2017-01-31
2
-11
/
+68
*
[SPARK-17161][PYSPARK][ML] Add PySpark-ML JavaWrapper convenience function to...
Bryan Cutler
2017-01-31
3
-3
/
+77
*
[SPARK-19403][PYTHON][SQL] Correct pyspark.sql.column.__all__ list.
zero323
2017-01-30
1
-1
/
+1
*
[SPARK-19336][ML][PYSPARK] LinearSVC Python API
wm624@hotmail.com
2017-01-27
3
-1
/
+156
*
[SPARK-18020][STREAMING][KINESIS] Checkpoint SHARD_END to finish reading clos...
Takeshi YAMAMURO
2017-01-25
1
-1
/
+1
*
[SPARK-19307][PYSPARK] Make sure user conf is propagated to SparkContext.
Marcelo Vanzin
2017-01-25
2
-0
/
+23
*
[SPARK-19229][SQL] Disallow Creating Hive Source Tables when Hive Support is ...
gatorsmile
2017-01-22
1
-4
/
+4
*
[SPARK-18589][SQL] Fix Python UDF accessing attributes from both side of join
Davies Liu
2017-01-20
1
-0
/
+9
*
[SPARK-14272][ML] Add Loglikelihood in GaussianMixtureSummary
Zheng RuiFeng
2017-01-19
1
-0
/
+10
*
[SPARK-19223][SQL][PYSPARK] Fix InputFileBlockHolder for datasources which ar...
Liang-Chi Hsieh
2017-01-18
1
-0
/
+24
*
[SPARK-19239][PYSPARK] Check parameters whether equals None when specify the ...
DjvuLee
2017-01-17
1
-3
/
+6
*
[SPARK-19019] [PYTHON] Fix hijacked `collections.namedtuple` and port cloudpi...
hyukjinkwon
2017-01-17
2
-31
/
+87
*
[SPARK-19148][SQL] do not expose the external table concept in Catalog
Wenchen Fan
2017-01-17
1
-3
/
+24
*
[SPARK-18687][PYSPARK][SQL] Backward compatibility - creating a Dataframe on ...
Vinayak
2017-01-13
2
-2
/
+7
*
[SPARK-19055][SQL][PYSPARK] Fix SparkSession initialization when SparkContext...
Liang-Chi Hsieh
2017-01-12
2
-6
/
+33
*
[SPARK-19164][PYTHON][SQL] Remove unused UserDefinedFunction._broadcast
zero323
2017-01-12
1
-6
/
+0
*
[SPARK-19140][SS] Allow update mode for non-aggregation streaming queries
Shixiong Zhu
2017-01-10
1
-8
/
+19
*
[SPARK-17645][MLLIB][ML][FOLLOW-UP] document minor change
Peng, Meng
2017-01-10
2
-7
/
+8
*
[SPARK-17847][ML] Reduce shuffled data size of GaussianMixture & copy the imp...
Yanbo Liang
2017-01-09
1
-18
/
+8
*
[SPARK-19126][DOCS] Update Join Documentation Across Languages
anabranch
2017-01-08
1
-2
/
+3
*
[SPARK-19127][DOCS] Update Rank Function Documentation
anabranch
2017-01-08
1
-6
/
+10
*
[SPARK-13748][PYSPARK][DOC] Add the description for explictly setting None fo...
hyukjinkwon
2017-01-07
1
-1
/
+3
*
[MINOR][DOCS] Remove consecutive duplicated words/typo in Spark Repo
Niranjan Padmanabhan
2017-01-04
3
-4
/
+4
*
[SPARK-17645][MLLIB][ML] add feature selector method based on: False Discover...
Peng
2016-12-28
2
-15
/
+109
*
[SPARK-18949][SQL] Add recoverPartitions API to Catalog
gatorsmile
2016-12-20
1
-0
/
+5
[next]