diff options
author | Arash Parsa <arash@ip-192-168-50-106.ec2.internal> | 2016-04-21 11:29:24 +0100 |
---|---|---|
committer | Sean Owen <sowen@cloudera.com> | 2016-04-21 11:29:24 +0100 |
commit | 2b8906c43760591f2e2da99bf0e34fa1bb63bfd1 (patch) | |
tree | b6ce6a45af31ec5957810e39f093b696322ea462 /LICENSE | |
parent | 8bd05c9db2e9c1c77fd06d490e5d4136acd6821c (diff) | |
download | spark-2b8906c43760591f2e2da99bf0e34fa1bb63bfd1.tar.gz spark-2b8906c43760591f2e2da99bf0e34fa1bb63bfd1.tar.bz2 spark-2b8906c43760591f2e2da99bf0e34fa1bb63bfd1.zip |
[SPARK-14739][PYSPARK] Fix Vectors parser bugs
## What changes were proposed in this pull request?
The PySpark deserialization has a bug that shows while deserializing all zero sparse vectors. This fix filters out empty string tokens before casting, hence properly stringified SparseVectors successfully get parsed.
## How was this patch tested?
Standard unit-tests similar to other methods.
Author: Arash Parsa <arash@ip-192-168-50-106.ec2.internal>
Author: Arash Parsa <arashpa@gmail.com>
Author: Vishnu Prasad <vishnu667@gmail.com>
Author: Vishnu Prasad S <vishnu667@gmail.com>
Closes #12516 from arashpa/SPARK-14739.
Diffstat (limited to 'LICENSE')
0 files changed, 0 insertions, 0 deletions