index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
tests.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-8202] [PYSPARK] fix infinite loop during external sort in PySpark
Davies Liu
2015-06-18
1
-1
/
+4
*
[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling...
zsxwing
2015-06-17
1
-0
/
+8
*
[SPARK-7711] Add a startTime property to match the corresponding one in Scala
Holden Karau
2015-05-21
1
-0
/
+4
*
[SPARK-7150] SparkContext.range() and SQLContext.range()
Daoyuan Wang
2015-05-18
1
-0
/
+5
*
[SPARK-6216] [PYSPARK] check python version of worker with driver
Davies Liu
2015-05-18
1
-3
/
+3
*
[SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct
Vinod K C
2015-05-09
1
-1
/
+0
*
[SPARK-6953] [PySpark] speed up python tests
Reynold Xin
2015-04-21
1
-35
/
+61
*
[SPARK-4897] [PySpark] Python 3 support
Davies Liu
2015-04-16
1
-152
/
+175
*
[SPARK-6886] [PySpark] fix big closure with shuffle
Davies Liu
2015-04-15
1
-4
/
+2
*
[SPARK-6216] [PySpark] check the python version in worker
Davies Liu
2015-04-10
1
-0
/
+16
*
[SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.
Milan Straka
2015-04-10
1
-0
/
+11
*
[SPARK-3074] [PySpark] support groupByKey() with single huge key
Davies Liu
2015-04-09
1
-4
/
+46
*
[SPARK-6294] fix hang when call take() in JVM on PythonRDD
Davies Liu
2015-03-12
1
-0
/
+5
*
[SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
Davies Liu
2015-02-24
1
-0
/
+6
*
[SPARK-5811] Added documentation for maven coordinates and added Spark Packag...
Burak Yavuz
2015-02-17
1
-4
/
+65
*
[SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
Davies Liu
2015-02-17
1
-1
/
+37
*
[SPARK-4172] [PySpark] Progress API in Python
Davies Liu
2015-02-17
1
-0
/
+31
*
[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
Davies Liu
2015-02-03
1
-261
/
+0
*
[SPARK-5154] [PySpark] [Streaming] Kafka streaming support in Python
Davies Liu
2015-02-02
1
-1
/
+9
*
[SQL] Improve DataFrame API error reporting
Reynold Xin
2015-02-02
1
-2
/
+4
*
[SPARK-5464] Fix help() for Python DataFrame instances
Josh Rosen
2015-01-29
1
-0
/
+10
*
[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
Yandu Oppacher
2015-01-28
1
-9
/
+31
*
[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
Winston Chen
2015-01-28
1
-0
/
+19
*
[SPARK-5097][SQL] DataFrame
Reynold Xin
2015-01-27
1
-69
/
+86
*
[SPARK-4866] support StructType as key in MapType
Davies Liu
2014-12-16
1
-0
/
+8
*
[SPARK-4841] fix zip with textFile()
Davies Liu
2014-12-15
1
-0
/
+9
*
[SPARK-4548] []SPARK-4517] improve performance of python broadcast
Davies Liu
2014-11-24
1
-14
/
+4
*
[SPARK-4578] fix asDict() with nested Row()
Davies Liu
2014-11-24
1
-3
/
+4
*
[SPARK-3721] [PySpark] broadcast objects larger than 2G
Davies Liu
2014-11-18
1
-2
/
+50
*
[SPARK-4304] [PySpark] Fix sort on empty RDD
Davies Liu
2014-11-07
1
-0
/
+3
*
[SPARK-4186] add binaryFiles and binaryRecords in Python
Davies Liu
2014-11-06
1
-0
/
+19
*
[SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...
Davies Liu
2014-11-03
1
-54
/
+12
*
[SPARK-4192][SQL] Internal API for Python UDT
Xiangrui Meng
2014-11-03
1
-1
/
+92
*
[SPARK-3594] [PySpark] [SQL] take more rows to infer schema or sampling
Davies Liu
2014-11-03
1
-0
/
+19
*
[SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample
Xiangrui Meng
2014-11-03
1
-0
/
+15
*
[SPARK-4133] [SQL] [PySpark] type conversionfor python udf
Davies Liu
2014-10-28
1
-3
/
+13
*
[SPARK-4051] [SQL] [PySpark] Convert Row into dictionary
Davies Liu
2014-10-24
1
-0
/
+9
*
[SPARK-3993] [PySpark] fix bug while reuse worker after take()
Davies Liu
2014-10-23
1
-1
/
+18
*
Fix for sampling error in NumPy v1.9 [SPARK-3995][PYSPARK]
freeman
2014-10-22
1
-0
/
+6
*
[SPARK-3855][SQL] Preserve the result attribute of python UDFs though transfo...
Michael Armbrust
2014-10-17
1
-0
/
+6
*
[SPARK-3867][PySpark] ./python/run-tests failed when it run with Python 2.6 a...
cocoatomo
2014-10-11
1
-1
/
+5
*
[SPARK-3786] [PySpark] speedup tests
Davies Liu
2014-10-06
1
-50
/
+42
*
[SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDD
Davies Liu
2014-10-01
1
-2
/
+6
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-30
1
-0
/
+30
*
[SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDD
Davies Liu
2014-09-27
1
-0
/
+21
*
Revert "[SPARK-3478] [PySpark] Profile the Python tasks"
Josh Rosen
2014-09-26
1
-30
/
+0
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-26
1
-0
/
+30
*
[SPARK-3679] [PySpark] pickle the exact globals of functions
Davies Liu
2014-09-24
1
-0
/
+18
*
[SPARK-3634] [PySpark] User's module should take precedence over system modules
Davies Liu
2014-09-24
1
-0
/
+12
*
[PySpark] remove unnecessary use of numSlices from pyspark tests
Matthew Farrellee
2014-09-20
1
-2
/
+2
[next]