aboutsummaryrefslogtreecommitdiff
path: root/python/lib
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2014-12-15 22:58:26 -0800
committerJosh Rosen <joshrosen@databricks.com>2014-12-15 22:58:26 -0800
commitc246b95dd2f565043db429c38c6cc029a0b870c1 (patch)
treedfaabaae0428a63eff807510c918e932f0bb5df3 /python/lib
parentc7628771da9a7d4bd4d7abfdede37ce1568dcd01 (diff)
downloadspark-c246b95dd2f565043db429c38c6cc029a0b870c1.tar.gz
spark-c246b95dd2f565043db429c38c6cc029a0b870c1.tar.bz2
spark-c246b95dd2f565043db429c38c6cc029a0b870c1.zip
[SPARK-4841] fix zip with textFile()
UTF8Deserializer can not be used in BatchedSerializer, so always use PickleSerializer() when change batchSize in zip(). Also, if two RDD have the same batch size already, they did not need re-serialize any more. Author: Davies Liu <davies@databricks.com> Closes #3706 from davies/fix_4841 and squashes the following commits: 20ce3a3 [Davies Liu] fix bug in _reserialize() e3ebf7c [Davies Liu] add comment 379d2c8 [Davies Liu] fix zip with textFile()
Diffstat (limited to 'python/lib')
0 files changed, 0 insertions, 0 deletions