diff options
author | Ankur Dave <ankurdave@gmail.com> | 2014-05-26 16:10:22 -0700 |
---|---|---|
committer | Reynold Xin <rxin@apache.org> | 2014-05-26 16:10:22 -0700 |
commit | 56c771cb2d00a5843c391ae6561536ee46e535d4 (patch) | |
tree | a6cf101ec4e203ec6b92f7217b1195b5d8660699 /dev/audit-release | |
parent | cb7fe5034826844f1b50fbe8b92646317b66f21c (diff) | |
download | spark-56c771cb2d00a5843c391ae6561536ee46e535d4.tar.gz spark-56c771cb2d00a5843c391ae6561536ee46e535d4.tar.bz2 spark-56c771cb2d00a5843c391ae6561536ee46e535d4.zip |
[SPARK-1931] Reconstruct routing tables in Graph.partitionBy
905173df57b90f90ebafb22e43f55164445330e6 introduced a bug in partitionBy where, after repartitioning the edges, it reuses the VertexRDD without updating the routing tables to reflect the new edge layout. Subsequent accesses of the triplets contain nulls for many vertex properties.
This commit adds a test for this bug and fixes it by introducing `VertexRDD#withEdges` and calling it in `partitionBy`.
Author: Ankur Dave <ankurdave@gmail.com>
Closes #885 from ankurdave/SPARK-1931 and squashes the following commits:
3930cdd [Ankur Dave] Note how to set up VertexRDD for efficient joins
9bdbaa4 [Ankur Dave] [SPARK-1931] Reconstruct routing tables in Graph.partitionBy
Diffstat (limited to 'dev/audit-release')
0 files changed, 0 insertions, 0 deletions