index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-2871] [PySpark] add approx API for RDD
Davies Liu
2014-08-23
1
-0
/
+81
*
[SPARK-2871] [PySpark] add `key` argument for max(), min() and top(n)
Davies Liu
2014-08-23
1
-17
/
+27
*
[SPARK-3140] Clarify confusing PySpark exception message
Andrew Or
2014-08-20
1
-3
/
+10
*
[SPARK-3141] [PySpark] fix sortByKey() with take()
Davies Liu
2014-08-19
1
-10
/
+8
*
[SPARK-2974] [SPARK-2975] Fix two bugs related to spark.local.dirs
Josh Rosen
2014-08-19
1
-1
/
+1
*
[SPARK-3136][MLLIB] Create Java-friendly methods in RandomRDDs
Xiangrui Meng
2014-08-19
1
-10
/
+10
*
[SPARK-2790] [PySpark] fix zip with serializers which have different batch si...
Davies Liu
2014-08-19
3
-1
/
+54
*
[SPARK-3114] [PySpark] Fix Python UDFs in Spark SQL.
Josh Rosen
2014-08-18
3
-5
/
+3
*
[SPARK-2850] [SPARK-2626] [mllib] MLlib stats examples + small fixes
Joseph K. Bradley
2014-08-18
3
-10
/
+23
*
[mllib] DecisionTree: treeAggregate + Python example bug fix
Joseph K. Bradley
2014-08-18
2
-6
/
+9
*
[SPARK-3103] [PySpark] fix saveAsTextFile() with utf-8
Davies Liu
2014-08-18
2
-1
/
+12
*
[SPARK-1065] [PySpark] improve supporting for large broadcast
Davies Liu
2014-08-16
6
-21
/
+73
*
[SPARK-3035] Wrong example with SparkContext.addFile
iAmGhost
2014-08-16
1
-1
/
+1
*
[SPARK-3081][MLLIB] rename RandomRDDGenerators to RandomRDDs
Xiangrui Meng
2014-08-16
1
-13
/
+12
*
[SQL] Using safe floating-point numbers in doctest
Cheng Lian
2014-08-16
1
-2
/
+2
*
[SQL] Python JsonRDD UTF8 Encoding Fix
Ahir Reddy
2014-08-14
1
-1
/
+3
*
[SPARK-2983] [PySpark] improve performance of sortByKey()
Davies Liu
2014-08-13
1
-23
/
+24
*
[SPARK-3013] [SQL] [PySpark] convert array into list
Davies Liu
2014-08-13
1
-7
/
+7
*
[SPARK-2993] [MLLib] colStats (wrapper around MultivariateStatisticalSummary)...
Doris Xin
2014-08-12
1
-1
/
+65
*
fix flaky tests
Davies Liu
2014-08-12
1
-1
/
+1
*
[SPARK-2844][SQL] Correctly set JVM HiveContext if it is passed into Python H...
Ahir Reddy
2014-08-11
1
-0
/
+14
*
[PySpark] [SPARK-2954] [SPARK-2948] [SPARK-2910] [SPARK-2101] Python 2.6 Fixes
Josh Rosen
2014-08-11
5
-7
/
+36
*
[SPARK-2898] [PySpark] fix bugs in deamon.py
Davies Liu
2014-08-10
1
-31
/
+47
*
[SPARK-2894] spark-shell doesn't accept flags
Kousuke Saruta
2014-08-09
1
-1
/
+1
*
[SPARK-2851] [mllib] DecisionTree Python consistency update
Joseph K. Bradley
2014-08-06
1
-35
/
+15
*
[PySpark] Add blanklines to Python docstrings so example code renders correctly
RJ Nowling
2014-08-06
1
-0
/
+9
*
[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically
Nicholas Chammas
2014-08-06
27
-134
/
+251
*
[SPARK-2875] [PySpark] [SQL] handle null in schemaRDD()
Davies Liu
2014-08-06
1
-0
/
+7
*
[SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql
Yin Huai
2014-08-05
1
-9
/
+20
*
[SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pys...
Michael Giannakopoulos
2014-08-05
1
-6
/
+55
*
[SPARK-1687] [PySpark] fix unit tests related to pickable namedtuple
Davies Liu
2014-08-04
1
-1
/
+5
*
[SPARK-1687] [PySpark] pickable namedtuple
Davies Liu
2014-08-04
2
-0
/
+79
*
[SPARK-1740] [PySpark] kill the python worker
Davies Liu
2014-08-03
2
-6
/
+69
*
[SPARK-2784][SQL] Deprecate hql() method in favor of a config option, 'spark....
Michael Armbrust
2014-08-03
1
-8
/
+12
*
[SPARK-2739][SQL] Rename registerAsTable to registerTempTable
Michael Armbrust
2014-08-02
1
-4
/
+8
*
[SPARK-2797] [SQL] SchemaRDDs don't support unpersist()
Yin Huai
2014-08-02
1
-2
/
+2
*
[SPARK-2097][SQL] UDF Support
Michael Armbrust
2014-08-02
1
-1
/
+38
*
[SPARK-2478] [mllib] DecisionTree Python API
Joseph K. Bradley
2014-08-02
5
-18
/
+291
*
[SPARK-2454] Do not ship spark home to Workers
Andrew Or
2014-08-02
1
-1
/
+1
*
StatCounter on NumPy arrays [PYSPARK][SPARK-2012]
Jeremy Freeman
2014-08-01
2
-8
/
+37
*
[SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pys...
Michael Giannakopoulos
2014-08-01
1
-4
/
+28
*
[SPARK-2764] Simplify daemon.py process structure
Josh Rosen
2014-08-01
1
-108
/
+71
*
[SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD
Davies Liu
2014-08-01
2
-347
/
+919
*
[SPARK-2786][mllib] Python correlations
Doris Xin
2014-08-01
2
-1
/
+109
*
[SPARK-2724] Python version of RandomRDDGenerators
Doris Xin
2014-07-31
4
-0
/
+197
*
SPARK-2282: Reuse Socket for sending accumulator updates to Pyspark
Aaron Davidson
2014-07-31
1
-7
/
+27
*
[SPARK-2397][SQL] Deprecate LocalHiveContext
Michael Armbrust
2014-07-31
1
-0
/
+6
*
SPARK-2341 [MLLIB] loadLibSVMFile doesn't handle regression datasets
Sean Owen
2014-07-30
1
-11
/
+12
*
[SPARK-2024] Add saveAsSequenceFile to PySpark
Kan Zhang
2014-07-30
3
-28
/
+454
*
Avoid numerical instability
Naftali Harris
2014-07-30
1
-1
/
+2
[next]