diff options
Diffstat (limited to 'site/faq.html')
-rw-r--r-- | site/faq.html | 160 |
1 files changed, 160 insertions, 0 deletions
diff --git a/site/faq.html b/site/faq.html new file mode 100644 index 000000000..699b33f61 --- /dev/null +++ b/site/faq.html @@ -0,0 +1,160 @@ +<!DOCTYPE html> +<!--[if IE 6]> +<html id="ie6" dir="ltr" lang="en-US"> +<![endif]--> +<!--[if IE 7]> +<html id="ie7" dir="ltr" lang="en-US"> +<![endif]--> +<!--[if IE 8]> +<html id="ie8" dir="ltr" lang="en-US"> +<![endif]--> +<!--[if !(IE 6) | !(IE 7) | !(IE 8) ]><!--> +<html dir="ltr" lang="en-US"> +<!--<![endif]--> +<head> + <link rel="shortcut icon" href="favicon.ico" /> + <meta charset="UTF-8" /> + <meta name="viewport" content="width=device-width" /> + <title>FAQ | Spark</title> + + <link rel="stylesheet" type="text/css" media="all" href="/css/style.css" /> + <link rel="stylesheet" href="/css/pygments-default.css"> + + <script type="text/javascript">//<![CDATA[ + // Google Analytics for WordPress by Yoast v4.2.8 | http://yoast.com/wordpress/google-analytics/ + var _gaq = _gaq || []; + _gaq.push(['_setAccount', 'UA-32518208-1']); + _gaq.push(['_trackPageview']); + (function () { + var ga = document.createElement('script'); + ga.type = 'text/javascript'; + ga.async = true; + ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; + var s = document.getElementsByTagName('script')[0]; + s.parentNode.insertBefore(ga, s); + })(); + //]]></script> + + <link rel='canonical' href='/index.html' /> + + <style type="text/css"> + #site-title, + #site-description { + position: absolute !important; + clip: rect(1px 1px 1px 1px); /* IE6, IE7 */ + clip: rect(1px, 1px, 1px, 1px); + } + </style> + <style type="text/css" id="custom-background-css"> + body.custom-background { background-color: #f1f1f1; } + </style> +</head> + +<!--body class="page singular"--> +<body class="page singular"> +<div id="page" class="hfeed"> + + <header id="branding" role="banner"> + <hgroup> + <h1 id="site-title"><span><a href="/" title="Spark" rel="home">Spark</a></span></h1> + <h2 id="site-description">Lightning-Fast Cluster Computing</h2> + </hgroup> + + <a href="/"> + <img src="/images/spark-project-header1.png" width="1000" height="220" alt="Spark: Lightning-Fast Cluster Computing" title="Spark: Lightning-Fast Cluster Computing" /> + </a> + + <nav id="access" role="navigation"> + <h3 class="assistive-text">Main menu</h3> + <div class="menu-main-menu-container"> + <ul id="menu-main-menu" class="menu"> + + <li class="menu-item menu-item-type-post_type menu-item-object-page "> + <a href="/index.html">Home</a> + </li> + + <li class="menu-item menu-item-type-post_type menu-item-object-page "> + <a href="/downloads.html">Downloads</a> + </li> + + <li class="menu-item menu-item-type-post_type menu-item-object-page "> + <a href="/documentation.html">Documentation</a> + </li> + + <li class="menu-item menu-item-type-post_type menu-item-object-page "> + <a href="/examples.html">Examples</a> + </li> + + <li class="menu-item menu-item-type-post_type menu-item-object-page "> + <a href="/mailing-lists.html">Mailing Lists</a> + </li> + + <li class="menu-item menu-item-type-post_type menu-item-object-page "> + <a href="/research.html">Research</a> + </li> + + <li class="menu-item menu-item-type-post_type menu-item-object-page current-menu-item"> + <a href="/faq.html">FAQ</a> + </li> + + </ul></div> + </nav><!-- #access --> +</header><!-- #branding --> + + + + <div id="main"> + <div id="primary"> + <div id="content" role="main"> + + <article class="page type-page status-publish hentry"> + <h2>Spark FAQ</h2> + +<p class="question">Is Spark a modified version of Hadoop?</p> +<p class="answer">No. Spark is a completely separate codebase optimized for low latency, although it can load data from any Hadoop input source (InputFormat).</p> + +<p class="question">Which languages does Spark support?</p> +<p class="answer">Starting in version 0.7, Spark supports Scala, Java and Python.</p> + +<p class="question">Does Spark require modified versions of Scala or Python?</p> +<p class="answer">No. Spark requires no changes to Scala or compiler plugins. The Python API uses the standard CPython implementation, and can call into existing C libraries for Python such as NumPy.</p> + +<p class="question">What happens when a cached dataset does not fit in memory?</p> +<p class="answer">Spark can either spill it to disk or recompute the partitions that don't fit in RAM each time they are requested. By default, it uses recomputation, but you can set a dataset's <a href="/docs/latest/scala-programming-guide.html#rdd-persistence">storage level</a> to <tt>MEMORY_AND_DISK</tt> to avoid this. </p> + +<p class="question">How can I run Spark on a cluster?</p> +<p class="answer">You can use either the <a href="/docs/latest/spark-standalone.html">standalone deploy mode</a>, which depends only on Java, or the <a href="/docs/latest/running-on-mesos.html">Apache Mesos</a> cluster manager.</p> +<p>Note that you can also run Spark locally (possibly on multiple cores) without any special setup by just passing <tt>local[N]</tt> as the master URL, where <tt>N</tt> is the number of parallel threads you want.</p> + +<p class="question">I don't know Scala; how hard is it to pick it up to use Spark?</p> +<p class="answer">Scala itself is pretty easy to pick up if you have Java experience. Check out <a href="http://www.artima.com/scalazine/articles/steps.html">First Steps to Scala</a> for a quick introduction, the <a href="http://www.scala-lang.org/docu/files/ScalaTutorial.pdf">Scala tutorial for Java programmers</a>, or the free online book <a href="http://www.artima.com/pins1ed/">Programming in Scala</a>.</p> +<p>Spark 0.6 also added a <a href="/docs/latest/java-programming-guide.html">Java API</a>, letting you use Spark from Java, and Spark 0.7 added a <a href="/docs/latest/python-programming-guide.html">Python API</a>.</p> + +<p class="question">What license is Spark under?</p> +<p class="answer">Spark is open source under the liberal <a href="https://github.com/mesos/spark/blob/master/LICENSE">BSD license</a>.</p> + +<p class="question">How can I contribute to Spark?</p> +<p class="answer">Contact the <a href="http://groups.google.com/group/spark-users">mailing list</a> or send us a pull request on GitHub. We're glad to hear about your experience using Spark and to accept patches </p> +<p>If you would like to report an issue, post it to the <a href="https://spark-project.atlassian.net/browse/SPARK">Spark issue tracker</a>.</p> + +<p class="question">Where can I get more help?</p> +<p class="answer">Please post on the <a href="http://groups.google.com/group/spark-users">spark-users</a> mailing list. We'll be glad to help!</p> + + </article><!-- #post --> + + </div><!-- #content --> + + <footer id="colophon" role="contentinfo"> + <div id="site-generator"> + <p>Spark is an open source project developed at the UC Berkeley <a href="https://amplab.cs.berkeley.edu">AMPLab</a>.</p> + <a class="amp-logo" style="background:url(/images/amplab-small.png)" href="https://amplab.cs.berkeley.edu/" title="Brought to you by the UC Berkeley AMPLab." rel="generator"><!--Brought to you by the UC Berkeley AMPLab--> </a> + </div> +</footer><!-- #colophon --> + + </div><!-- #primary --> + </div><!-- #main --> +</div><!-- #page --> + + +</body> +</html> |