From 43588164336771f787d0d2cdf79f0d50ac828af4 Mon Sep 17 00:00:00 2001 From: Ankur Dave Date: Sat, 31 May 2014 03:21:26 +0000 Subject: Add a GraphX page at /graphx/ --- site/graphx/index.html | 293 +++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 293 insertions(+) create mode 100644 site/graphx/index.html (limited to 'site/graphx') diff --git a/site/graphx/index.html b/site/graphx/index.html new file mode 100644 index 000000000..ba1100039 --- /dev/null +++ b/site/graphx/index.html @@ -0,0 +1,293 @@ + + + + + + + + + GraphX | Apache Spark + + + + + + + + + + + + + + + + + +
+ +
+ +

+ + + + GraphX + +

+ +
+ + + + +
+
+
+
Latest News
+ +

Archive

+
+ +
+ +
+
+ GraphX is Apache Spark's API for graphs and graph-parallel computation. +
+ +
+
+

Flexibility

+

+ Seamlessly work with both graphs and collections. +

+

+ GraphX unifies ETL, exploratory analysis, and iterative graph computation within a single system. You can view the same data as both graphs and collections, transform and join graphs with RDDs efficiently, and write custom iterative graph algorithms using the Pregel API. +

+
+
+ +
+
+ graph = Graph(vertices, edges)
+ messages = spark.textFile("hdfs://...")
+ graph2 = graph.joinVertices(messages) {
+   (id, vertex, msg) => ...
+ } +
+
Using GraphX in Scala
+
+
+
+ +
+
+

Speed

+

+ Comparable performance to the fastest specialized graph processing systems. +

+

+ GraphX competes on performance with the fastest graph systems while retaining Spark's flexibility, fault tolerance, and ease of use. +

+
+
+
+ +
End-to-end PageRank performance (20 iterations, 3.7B edges)
+
+
+
+ +
+
+

Algorithms

+

+ Choose from a growing library of graph algorithms. +

+

In addition to a highly flexible API, GraphX comes with a variety of graph algorithms, many of which were contributed by our users.

+
+
+
    +
  • PageRank
  • +
  • Connected components
  • +
  • Label propagation
  • +
  • SVD++
  • +
  • Strongly connected components
  • +
  • Triangle count
  • +
+
+
+ +
+
+

Community

+

+ GraphX is developed as part of the Apache Spark project. It thus gets + tested and updated with each Spark release. +

+

+ If you have questions about the library, ask on the + Spark mailing lists. +

+

+ GraphX is in the alpha stage and welcomes contributions. If you'd like to submit a change to GraphX, + read how to + contribute to Spark and send us a patch! +

+
+ +
+

Getting Started

+

+ To get started with GraphX: +

+
    +
  • Download Spark. GraphX is included as a module.
  • +
  • Read the GraphX guide, which includes + usage examples.
  • +
  • Learn how to deploy Spark on a cluster + if you'd like to run in distributed mode. You can also run locally on a multicore machine + without any setup. +
  • +
+
+
+ + + +
+
+ + + + + +
+ + + + + + + -- cgit v1.2.3