index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-5094][MLlib] Add Python API for Gradient Boosted Trees
Kazuki Taniguchi
2015-01-30
2
-53
/
+209
*
[SPARK-5464] Fix help() for Python DataFrame instances
Josh Rosen
2015-01-29
2
-3
/
+13
*
[SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.
Reynold Xin
2015-01-29
1
-2
/
+2
*
[SPARK-5477] refactor stat.py
Xiangrui Meng
2015-01-29
4
-54
/
+96
*
[SPARK-5445][SQL] Made DataFrame dsl usable in Java
Reynold Xin
2015-01-28
1
-16
/
+22
*
[SPARK-5430] move treeReduce and treeAggregate from mllib to core
Xiangrui Meng
2015-01-28
1
-1
/
+90
*
[SPARK-4586][MLLIB] Python API for ML pipeline and parameters
Xiangrui Meng
2015-01-28
16
-16
/
+1124
*
[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
Yandu Oppacher
2015-01-28
8
-71
/
+232
*
[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
Michael Nazario
2015-01-28
1
-0
/
+14
*
SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs
Sandy Ryza
2015-01-28
1
-2
/
+2
*
[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
Winston Chen
2015-01-28
1
-0
/
+19
*
[SPARK-5097][SQL] DataFrame
Reynold Xin
2015-01-27
3
-336
/
+793
*
[SPARK-5063] More helpful error messages for several invalid operations
Josh Rosen
2015-01-23
2
-0
/
+19
*
[SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
nate.crosswhite
2015-01-21
2
-3
/
+18
*
SPARK-5270 [CORE] Provide isEmpty() function in RDD API
Sean Owen
2015-01-19
1
-0
/
+12
*
[SPARK-5193][SQL] Remove Spark SQL Java-specific API.
Reynold Xin
2015-01-16
1
-36
/
+12
*
[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.
Reynold Xin
2015-01-15
1
-8
/
+8
*
[SPARK-5224] [PySpark] improve performance of parallelize list/ndarray
Davies Liu
2015-01-15
2
-1
/
+5
*
[SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing
MechCoder
2015-01-14
2
-0
/
+29
*
[SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib
Davies Liu
2015-01-13
1
-4
/
+2
*
[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple
Gabe Mulley
2015-01-12
1
-4
/
+14
*
[SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...
RJ Nowling
2015-01-08
1
-0
/
+187
*
[SPARK-5089][PYSPARK][MLLIB] Fix vector convert
freeman
2015-01-05
2
-1
/
+11
*
[SPARK-3325][Streaming] Add a parameter to the method print in class DStream
Yadong Qi
2015-01-02
1
-5
/
+7
*
[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...
Brennon York
2014-12-27
1
-1
/
+1
*
[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`
jbencook
2014-12-23
1
-0
/
+28
*
[SPARK-4822] Use sphinx tags for Python doc annotations
lewuathe
2014-12-17
6
-17
/
+17
*
[SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc
Joseph K. Bradley
2014-12-17
3
-30
/
+5
*
[SPARK-4866] support StructType as key in MapType
Davies Liu
2014-12-16
2
-7
/
+18
*
[SPARK-4855][mllib] testing the Chi-squared hypothesis test
jbencook
2014-12-16
1
-1
/
+99
*
[SPARK-4841] fix zip with textFile()
Davies Liu
2014-12-15
3
-14
/
+26
*
[SPARK-4494][mllib] IDFModel.transform() add support for single vector
Yuu ISHIKAWA
2014-12-15
1
-7
/
+15
*
[SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + D...
Joseph K. Bradley
2014-12-04
1
-3
/
+3
*
[SPARK-4548] []SPARK-4517] improve performance of python broadcast
Davies Liu
2014-11-24
5
-233
/
+80
*
[SPARK-4578] fix asDict() with nested Row()
Davies Liu
2014-11-24
2
-4
/
+5
*
[SPARK-4562] [MLlib] speedup vector
Davies Liu
2014-11-24
2
-26
/
+53
*
[SPARK-4531] [MLlib] cache serialized java object
Davies Liu
2014-11-21
4
-13
/
+8
*
[SPARK-4477] [PySpark] remove numpy from RDDSampler
Davies Liu
2014-11-20
2
-69
/
+40
*
[SPARK-4439] [MLlib] add python api for random forest
Davies Liu
2014-11-20
2
-23
/
+221
*
[SPARK-4228][SQL] SchemaRDD to JSON
Dan McClary
2014-11-20
1
-1
/
+16
*
[SPARK-4384] [PySpark] improve sort spilling
Davies Liu
2014-11-19
1
-1
/
+10
*
[DOC][PySpark][Streaming] Fix docstring for sphinx
Ken Takagiwa
2014-11-19
1
-2
/
+2
*
[SPARK-4327] [PySpark] Python API for RDD.randomSplit()
Davies Liu
2014-11-18
2
-3
/
+41
*
[SPARK-3721] [PySpark] broadcast objects larger than 2G
Davies Liu
2014-11-18
6
-17
/
+239
*
[SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGS
Davies Liu
2014-11-18
1
-4
/
+53
*
[SPARK-4396] allow lookup by index in Python's Rating
Xiangrui Meng
2014-11-18
1
-11
/
+15
*
[SPARK-4435] [MLlib] [PySpark] improve classification
Davies Liu
2014-11-18
1
-29
/
+106
*
[SPARK-4415] [PySpark] JVM should exit after Python exit
Davies Liu
2014-11-14
1
-1
/
+3
*
[SPARK-4398][PySpark] specialize sc.parallelize(xrange)
Xiangrui Meng
2014-11-14
1
-4
/
+21
*
[SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala ...
Xiangrui Meng
2014-11-13
2
-35
/
+37
[next]