diff options
author | Davies Liu <davies@databricks.com> | 2014-11-07 20:53:03 -0800 |
---|---|---|
committer | Josh Rosen <joshrosen@databricks.com> | 2014-11-07 20:53:03 -0800 |
commit | 7779109796c90d789464ab0be35917f963bbe867 (patch) | |
tree | f3086d40f5f144c436b70ecc81919d83cf5a30a6 /python/pyspark/tests.py | |
parent | 5923dd986ba26d0fcc8707dd8d16863f1c1005cb (diff) | |
download | spark-7779109796c90d789464ab0be35917f963bbe867.tar.gz spark-7779109796c90d789464ab0be35917f963bbe867.tar.bz2 spark-7779109796c90d789464ab0be35917f963bbe867.zip |
[SPARK-4304] [PySpark] Fix sort on empty RDD
This PR fix sortBy()/sortByKey() on empty RDD.
This should be back ported into 1.1/1.2
Author: Davies Liu <davies@databricks.com>
Closes #3162 from davies/fix_sort and squashes the following commits:
84f64b7 [Davies Liu] add tests
52995b5 [Davies Liu] fix sortByKey() on empty RDD
Diffstat (limited to 'python/pyspark/tests.py')
-rw-r--r-- | python/pyspark/tests.py | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/python/pyspark/tests.py b/python/pyspark/tests.py index 9f625c5c6c..491e445a21 100644 --- a/python/pyspark/tests.py +++ b/python/pyspark/tests.py @@ -649,6 +649,9 @@ class RDDTests(ReusedPySparkTestCase): self.assertEquals(result.getNumPartitions(), 5) self.assertEquals(result.count(), 3) + def test_sort_on_empty_rdd(self): + self.assertEqual([], self.sc.parallelize(zip([], [])).sortByKey().collect()) + def test_sample(self): rdd = self.sc.parallelize(range(0, 100), 4) wo = rdd.sample(False, 0.1, 2).collect() |