<!DOCTYPE html>
<!--[if IE 6]>
<html id="ie6" dir="ltr" lang="en-US">
<![endif]-->
<!--[if IE 7]>
<html id="ie7" dir="ltr" lang="en-US">
<![endif]-->
<!--[if IE 8]>
<html id="ie8" dir="ltr" lang="en-US">
<![endif]-->
<!--[if !(IE 6) | !(IE 7) | !(IE 8) ]><!-->
<html dir="ltr" lang="en-US">
<!--<![endif]-->
<head>
<link rel="shortcut icon" href="/favicon.ico" />
<meta charset="UTF-8" />
<meta name="viewport" content="width=device-width" />
<title>
Spark Release 0.7.2 | Apache Spark
</title>
<link rel="stylesheet" type="text/css" media="all" href="/css/style.css" />
<link rel="stylesheet" href="/css/pygments-default.css">
<script type="text/javascript">
<!-- Google Analytics initialization -->
var _gaq = _gaq || [];
_gaq.push(['_setAccount', 'UA-32518208-2']);
_gaq.push(['_trackPageview']);
(function() {
var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true;
ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js';
var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s);
})();
<!-- Adds slight delay to links to allow async reporting -->
function trackOutboundLink(link, category, action) {
try {
_gaq.push(['_trackEvent', category , action]);
} catch(err){}
setTimeout(function() {
document.location.href = link.href;
}, 100);
}
</script>
<link rel='canonical' href='/index.html' />
<style type="text/css">
#site-title,
#site-description {
position: absolute !important;
clip: rect(1px 1px 1px 1px); /* IE6, IE7 */
clip: rect(1px, 1px, 1px, 1px);
}
</style>
<style type="text/css" id="custom-background-css">
body.custom-background { background-color: #f1f1f1; }
</style>
</head>
<!--body class="page singular"-->
<body class="singular">
<div id="page" class="hfeed">
<header id="branding" role="banner">
<hgroup>
<h1 id="site-title"><span><a href="/" title="Spark" rel="home">Spark</a></span></h1>
<h2 id="site-description">Lightning-Fast Cluster Computing</h2>
</hgroup>
<a id="main-logo" href="/">
<img style="height:175px; width:auto;" src="/images/spark-project-header1-cropped.png" alt="Spark: Lightning-Fast Cluster Computing" title="Spark: Lightning-Fast Cluster Computing" />
</a>
<div class="widget-summit">
<a href="http://spark-summit.org"><img src="/images/Summit-Logo-FINALtr-150x150px.png" /></a>
<div class="text">
<a href="http://spark-summit.org/2013">
<strong>Videos and Slides<br/>
Available Now!</strong>
</a>
</div>
</div>
<nav id="access" role="navigation">
<h3 class="assistive-text">Main menu</h3>
<div class="menu-main-menu-container">
<ul id="menu-main-menu" class="menu">
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/index.html">Home</a>
</li>
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/downloads.html">Downloads</a>
</li>
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/documentation.html">Documentation</a>
</li>
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/examples.html">Examples</a>
</li>
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/mailing-lists.html">Mailing Lists</a>
</li>
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/research.html">Research</a>
</li>
<li class="menu-item menu-item-type-post_type menu-item-object-page ">
<a href="/faq.html">FAQ</a>
</li>
</ul></div>
</nav><!-- #access -->
</header><!-- #branding -->
<div id="main">
<div id="primary">
<div id="content" role="main">
<article class="page type-page status-publish hentry">
<h2>Spark Release 0.7.2</h2>
<p>Spark 0.7.2 is a maintenance release that contains multiple bug fixes and improvements. You can download it as a <a href="http://spark-project.org/download-spark-0.7.2-sources">source package</a> (4 MB tar.gz) or get prebuilt packages for <a href="http://spark-project.org/download-spark-0.7.2-prebuilt-hadoop1">Hadoop 1 / CDH3</a> or <a href="http://spark-project.org/download-spark-0.7.2-prebuilt-cdh4">CDH 4</a> (61 MB tar.gz).</p>
<p>We recommend that all users update to this maintenance release.</p>
<p>The fixes and improvements in this version include:</p>
<ul>
<li>Scala version updated to 2.9.3.</li>
<li>Several improvements to Bagel, including performance fixes and a configurable storage level.</li>
<li>New API methods: subtractByKey, foldByKey, mapWith, filterWith, foreachPartition, and others.</li>
<li>A new metrics reporting interface, SparkListener, to collect information about each computation stage: task lengths, bytes shuffled, etc.</li>
<li>Several new examples using the Java API, including K-means and computing pi.</li>
<li>Support for launching multiple worker instances per host in the standalone mode.</li>
<li>Various bug fixes across the board.</li>
</ul>
<p>The following people contributed to this release:</p>
<ul>
<li>Jey Kottalam (Maven build, bug fixes, EC2 scripts, packaging the release)</li>
<li>Andrew Ash (bug fixes, docs)</li>
<li>Andrey Kouznetsov (bug fixes)</li>
<li>Andy Konwinski (docs)</li>
<li>Charles Reiss (bug fixes)</li>
<li>Christoph Grothaus (bug fixes)</li>
<li>Erik van Oosten (bug fixes)</li>
<li>Giovanni Delussu (bug fixes)</li>
<li>Hiral Patel (bug fixes)</li>
<li>Holden Karau (error reporting, EC2 scripts)</li>
<li>Imran Rashid (metrics reporting system)</li>
<li>Josh Rosen (EC2 scripts)</li>
<li>Mark Hamstra (new API methods, tests)</li>
<li>Mikhail Bautin (build)</li>
<li>Mosharaf Chowdhury (bug fixes)</li>
<li>Nick Pentreath (Bagel, examples)</li>
<li>Patrick Wendell (bug fixes)</li>
<li>Reynold Xin (bug fixes)</li>
<li>Stephen Haberman (bug fixes, tests, subtractByKey)</li>
<li>Kalpit Shah (build, multiple workers per host)</li>
<li>Mike Potts (run scripts)</li>
<li>Matei Zaharia (Bagel, bug fixes, build)</li>
</ul>
<p>We thank everyone who helped with this release, and hope to see more contributions from you in the future!</p>
</article><!-- #post -->
</div><!-- #content -->
<footer id="colophon" role="contentinfo">
<div id="site-generator">
<p style="padding-top: 0; padding-bottom: 15px;">
Apache Spark is an effort undergoing incubation at The Apache Software Foundation.
<a href="http://incubator.apache.org/" style="border: none;">
<img style="vertical-align: middle; border: none;" src="/images/incubator-logo.png" alt="Apache Incubator" title="Apache Incubator" />
</a>
</p>
</div>
</footer><!-- #colophon -->
</div><!-- #primary -->
</div><!-- #main -->
</div><!-- #page -->
</body>
</html>