aboutsummaryrefslogtreecommitdiff
path: root/docs/graphx-programming-guide.md
diff options
context:
space:
mode:
authorBrennon York <brennon.york@capitalone.com>2015-03-26 19:08:09 -0700
committerAnkur Dave <ankurdave@gmail.com>2015-03-26 19:08:09 -0700
commit39fb57968352549f2276ac4fcd2b92988ed6fe42 (patch)
tree811c1ecc0e813d299cdfdac6f1d6521883beb4b8 /docs/graphx-programming-guide.md
parentaad00322765d6041e817a6bd3fcff2187d212057 (diff)
downloadspark-39fb57968352549f2276ac4fcd2b92988ed6fe42.tar.gz
spark-39fb57968352549f2276ac4fcd2b92988ed6fe42.tar.bz2
spark-39fb57968352549f2276ac4fcd2b92988ed6fe42.zip
[SPARK-6510][GraphX]: Add Graph#minus method to act as Set#difference
Adds a `Graph#minus` method which will return only unique `VertexId`'s from the calling `VertexRDD`. To demonstrate a basic example with pseudocode: ``` Set((0L,0),(1L,1)).minus(Set((1L,1),(2L,2))) > Set((0L,0)) ``` Author: Brennon York <brennon.york@capitalone.com> Closes #5175 from brennonyork/SPARK-6510 and squashes the following commits: 248d5c8 [Brennon York] added minus(VertexRDD[VD]) method to avoid createUsingIndex and updated the mask operations to simplify with andNot call 3fb7cce [Brennon York] updated graphx doc to reflect the addition of minus method 6575d92 [Brennon York] updated mima exclude aaa030b [Brennon York] completed graph#minus functionality 7227c0f [Brennon York] beginning work on minus functionality
Diffstat (limited to 'docs/graphx-programming-guide.md')
-rw-r--r--docs/graphx-programming-guide.md2
1 files changed, 2 insertions, 0 deletions
diff --git a/docs/graphx-programming-guide.md b/docs/graphx-programming-guide.md
index c601d793a2..3f10cb2dc3 100644
--- a/docs/graphx-programming-guide.md
+++ b/docs/graphx-programming-guide.md
@@ -899,6 +899,8 @@ class VertexRDD[VD] extends RDD[(VertexID, VD)] {
// Transform the values without changing the ids (preserves the internal index)
def mapValues[VD2](map: VD => VD2): VertexRDD[VD2]
def mapValues[VD2](map: (VertexId, VD) => VD2): VertexRDD[VD2]
+ // Show only vertices unique to this set based on their VertexId's
+ def minus(other: RDD[(VertexId, VD)])
// Remove vertices from this set that appear in the other set
def diff(other: VertexRDD[VD]): VertexRDD[VD]
// Join operators that take advantage of the internal indexing to accelerate joins (substantially)