index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
Davies Liu
2015-02-11
4
-46
/
+144
*
[SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns
Davies Liu
2015-02-10
2
-33
/
+80
*
[SPARK-5658][SQL] Finalize DDL and write support APIs
Yin Huai
2015-02-10
3
-6
/
+241
*
[SQL] Add toString to DataFrame/Column
Michael Armbrust
2015-02-10
1
-1
/
+1
*
[SPARK-5469] restructure pyspark.sql into multiple files
Davies Liu
2015-02-09
11
-2755
/
+2961
*
[SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
Davies Liu
2015-02-09
1
-0
/
+25
*
SPARK-5633 pyspark saveAsTextFile support for compression codec
Vladimir Vladimirov
2015-02-06
1
-2
/
+20
*
[SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source...
Cheng Lian
2015-02-05
1
-2
/
+7
*
[SQL][DataFrame] Minor cleanup.
Reynold Xin
2015-02-04
1
-11
/
+0
*
[SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggrega...
Reynold Xin
2015-02-04
1
-5
/
+8
*
[SPARK-5577] Python udf for DataFrame
Davies Liu
2015-02-04
2
-120
/
+113
*
[SPARK-5588] [SQL] support select/filter by SQL expression
Davies Liu
2015-02-04
1
-10
/
+43
*
[SPARK-5585] Flaky test in MLlib python
Davies Liu
2015-02-04
1
-1
/
+1
*
[SPARK-5379][Streaming] Add awaitTerminationOrTimeout
zsxwing
2015-02-04
1
-0
/
+9
*
[SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming
freeman
2015-02-03
2
-1
/
+30
*
[SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
Reynold Xin
2015-02-03
1
-3
/
+2
*
[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
Davies Liu
2015-02-03
4
-447
/
+581
*
[SPARK-5536] replace old ALS implementation by the new one
Xiangrui Meng
2015-02-02
1
-8
/
+8
*
[SPARK-5012][MLLib][PySpark]Python API for Gaussian Mixture Model
FlytxtRnD
2015-02-02
4
-5
/
+147
*
[SPARK-5154] [PySpark] [Streaming] Kafka streaming support in Python
Davies Liu
2015-02-02
3
-2
/
+100
*
[SQL] Improve DataFrame API error reporting
Reynold Xin
2015-02-02
2
-25
/
+56
*
Make sure only owner can read / write to directories created for the job.
Marcelo Vanzin
2015-02-02
1
-1
/
+2
*
[SPARK-5094][MLlib] Add Python API for Gradient Boosted Trees
Kazuki Taniguchi
2015-01-30
2
-53
/
+209
*
[SPARK-5464] Fix help() for Python DataFrame instances
Josh Rosen
2015-01-29
2
-3
/
+13
*
[SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.
Reynold Xin
2015-01-29
1
-2
/
+2
*
[SPARK-5477] refactor stat.py
Xiangrui Meng
2015-01-29
4
-54
/
+96
*
[SPARK-5445][SQL] Made DataFrame dsl usable in Java
Reynold Xin
2015-01-28
1
-16
/
+22
*
[SPARK-5430] move treeReduce and treeAggregate from mllib to core
Xiangrui Meng
2015-01-28
1
-1
/
+90
*
[SPARK-4586][MLLIB] Python API for ML pipeline and parameters
Xiangrui Meng
2015-01-28
16
-16
/
+1124
*
[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
Yandu Oppacher
2015-01-28
8
-71
/
+232
*
[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
Michael Nazario
2015-01-28
1
-0
/
+14
*
SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs
Sandy Ryza
2015-01-28
1
-2
/
+2
*
[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
Winston Chen
2015-01-28
1
-0
/
+19
*
[SPARK-5097][SQL] DataFrame
Reynold Xin
2015-01-27
3
-336
/
+793
*
[SPARK-5063] More helpful error messages for several invalid operations
Josh Rosen
2015-01-23
2
-0
/
+19
*
[SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
nate.crosswhite
2015-01-21
2
-3
/
+18
*
SPARK-5270 [CORE] Provide isEmpty() function in RDD API
Sean Owen
2015-01-19
1
-0
/
+12
*
[SPARK-5193][SQL] Remove Spark SQL Java-specific API.
Reynold Xin
2015-01-16
1
-36
/
+12
*
[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.
Reynold Xin
2015-01-15
1
-8
/
+8
*
[SPARK-5224] [PySpark] improve performance of parallelize list/ndarray
Davies Liu
2015-01-15
2
-1
/
+5
*
[SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing
MechCoder
2015-01-14
2
-0
/
+29
*
[SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib
Davies Liu
2015-01-13
1
-4
/
+2
*
[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple
Gabe Mulley
2015-01-12
1
-4
/
+14
*
[SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...
RJ Nowling
2015-01-08
1
-0
/
+187
*
[SPARK-5089][PYSPARK][MLLIB] Fix vector convert
freeman
2015-01-05
2
-1
/
+11
*
[SPARK-3325][Streaming] Add a parameter to the method print in class DStream
Yadong Qi
2015-01-02
1
-5
/
+7
*
[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...
Brennon York
2014-12-27
1
-1
/
+1
*
[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`
jbencook
2014-12-23
1
-0
/
+28
*
[SPARK-4822] Use sphinx tags for Python doc annotations
lewuathe
2014-12-17
6
-17
/
+17
*
[SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc
Joseph K. Bradley
2014-12-17
3
-30
/
+5
[next]