|
|
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>
Spark Release 1.0.2 | Apache Spark
</title>
<!-- Bootstrap core CSS -->
<link href="/css/cerulean.min.css" rel="stylesheet">
<link href="/css/custom.css" rel="stylesheet">
<script type="text/javascript">
<!-- Google Analytics initialization -->
var _gaq = _gaq || [];
_gaq.push(['_setAccount', 'UA-32518208-2']);
_gaq.push(['_trackPageview']);
(function() {
var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true;
ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js';
var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s);
})();
<!-- Adds slight delay to links to allow async reporting -->
function trackOutboundLink(link, category, action) {
try {
_gaq.push(['_trackEvent', category , action]);
} catch(err){}
setTimeout(function() {
document.location.href = link.href;
}, 100);
}
</script>
<!-- HTML5 shim and Respond.js IE8 support of HTML5 elements and media queries -->
<!--[if lt IE 9]>
<script src="https://oss.maxcdn.com/libs/html5shiv/3.7.0/html5shiv.js"></script>
<script src="https://oss.maxcdn.com/libs/respond.js/1.3.0/respond.min.js"></script>
<![endif]-->
</head>
<body>
<script src="https://code.jquery.com/jquery.js"></script>
<script src="//netdna.bootstrapcdn.com/bootstrap/3.0.3/js/bootstrap.min.js"></script>
<script src="/js/lang-tabs.js"></script>
<script src="/js/downloads.js"></script>
<div class="container" style="max-width: 1200px;">
<div class="masthead">
<p class="lead">
<a href="/">
<img src="/images/spark-logo.png"
style="height:100px; width:auto; vertical-align: bottom; margin-top: 20px;"></a><span class="tagline">
Lightning-fast cluster computing
</span>
</p>
</div>
<nav class="navbar navbar-default" role="navigation">
<!-- Brand and toggle get grouped for better mobile display -->
<div class="navbar-header">
<button type="button" class="navbar-toggle" data-toggle="collapse"
data-target="#navbar-collapse-1">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
</div>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse" id="navbar-collapse-1">
<ul class="nav navbar-nav">
<li><a href="/downloads.html">Download</a></li>
<li class="dropdown">
<a href="#" class="dropdown-toggle" data-toggle="dropdown">
Related Projects <b class="caret"></b>
</a>
<ul class="dropdown-menu">
<li><a href="/sql/">Spark SQL</a></li>
<li><a href="/streaming/">Spark Streaming</a></li>
<li><a href="/mllib/">MLlib (machine learning)</a></li>
<li><a href="/graphx/">GraphX (graph)</a></li>
</ul>
</li>
<li class="dropdown">
<a href="#" class="dropdown-toggle" data-toggle="dropdown">
Documentation <b class="caret"></b>
</a>
<ul class="dropdown-menu">
<li><a href="/documentation.html">Overview</a></li>
<li><a href="/docs/latest/">Latest Release (Spark 1.1.0)</a></li>
</ul>
</li>
<li class="dropdown">
<a href="#" class="dropdown-toggle" data-toggle="dropdown">
Community <b class="caret"></b>
</a>
<ul class="dropdown-menu">
<li><a href="/community.html">Mailing Lists</a></li>
<li><a href="/community.html#events">Events and Meetups</a></li>
<li><a href="/community.html#history">Project History</a></li>
<li><a href="https://cwiki.apache.org/confluence/display/SPARK/Powered+By+Spark">Powered By</a></li>
</ul>
</li>
<li><a href="/examples.html">Examples</a></li>
<li><a href="/faq.html">FAQ</a></li>
</ul>
</div>
<!-- /.navbar-collapse -->
</nav>
<div class="row">
<div class="col-md-3 col-md-push-9">
<div class="news" style="margin-bottom: 20px;">
<h5>Latest News</h5>
<ul class="list-unstyled">
<li><a href="/news/spark-wins-daytona-gray-sort-100tb-benchmark.html">Spark wins Daytona Gray Sort 100TB Benchmark</a>
<span class="small">(Nov 05, 2014)</span></li>
<li><a href="/news/proposals-open-for-spark-summit-east.html">Submissions open for Spark Summit East 2015 in New York</a>
<span class="small">(Oct 18, 2014)</span></li>
<li><a href="/news/spark-1-1-0-released.html">Spark 1.1.0 released</a>
<span class="small">(Sep 11, 2014)</span></li>
<li><a href="/news/spark-1-0-2-released.html">Spark 1.0.2 released</a>
<span class="small">(Aug 05, 2014)</span></li>
</ul>
<p class="small" style="text-align: right;"><a href="/news/index.html">Archive</a></p>
</div>
<div class="hidden-xs hidden-sm">
<a href="/downloads.html" class="btn btn-success btn-lg btn-block" style="margin-bottom: 30px;">
Download Spark
</a>
<p style="font-size: 16px; font-weight: 500; color: #555;">
Related Projects:
</p>
<ul class="list-narrow">
<li><a href="/sql/">Spark SQL</a></li>
<li><a href="/streaming/">Spark Streaming</a></li>
<li><a href="/mllib/">MLlib (machine learning)</a></li>
<li><a href="/graphx/">GraphX (graph)</a></li>
</ul>
</div>
</div>
<div class="col-md-9 col-md-pull-3">
<h2>Spark Release 1.0.2</h2>
<p>Spark 1.0.2 is a maintenance release with bug fixes. This release is based on the <a href="https://github.com/apache/spark/tree/branch-1.0">branch-1.0</a> maintenance branch of Spark. We recommend all 1.0.x users to upgrade to this stable release. Contributions to this release came from 30 developers.</p>
<p>You can download Spark 1.0.2 as either a
<a href="http://d3kbcqa49mib13.cloudfront.net/spark-1.0.2.tgz" onclick="trackOutboundLink(this, 'Release Download Links', 'cloudfront_spark-1.0.2.tgz'); return false;">source package</a>
(6 MB tgz) or a prebuilt package for
<a href="http://d3kbcqa49mib13.cloudfront.net/spark-1.0.2-bin-hadoop1.tgz" onclick="trackOutboundLink(this, 'Release Download Links', 'cloudfront_spark-1.0.2-bin-hadoop1.tgz'); return false;">Hadoop 1 / CDH3</a> (156 MB tgz),
<a href="http://d3kbcqa49mib13.cloudfront.net/spark-1.0.2-bin-cdh4.tgz" onclick="trackOutboundLink(this, 'Release Download Links', 'cloudfront_spark-1.0.2-bin-cdh4.tgz'); return false;">CDH4</a> (161 MB tgz), or
<a href="http://d3kbcqa49mib13.cloudfront.net/spark-1.0.2-bin-hadoop2.tgz" onclick="trackOutboundLink(this, 'Release Download Links', 'cloudfront_spark-1.0.2-bin-hadoop2.tgz'); return false;">Hadoop 2 / CDH5 / HDP2</a>
(168 MB tgz). Release signatures and checksums are available at the official <a href="http://www.apache.org/dist/spark/spark-1.0.2/">Apache download site</a>.</p>
<h3 id="fixes">Fixes</h3>
<p>Spark 1.0.2 contains bug fixes in several components. Some of the more important fixes are highlighted below. You can visit the <a href="http://s.apache.org/9NJ">Spark issue tracker</a> for the full list of fixes.</p>
<h4 id="spark-core">Spark Core</h4>
<ul>
<li>Avoid pulling in the entire RDD or PairRDDFunctions in various operators (<a href="https://issues.apache.org/jira/browse/SPARK-2534">SPARK-2534</a>)</li>
<li>RangePartitioner’s binary search does not use the given Ordering (<a href="https://issues.apache.org/jira/browse/SPARK-2598">SPARK-2598</a>)</li>
<li>Exception in accumulator update should not crash DAGScheduler and SparkContext (<a href="https://issues.apache.org/jira/browse/SPARK-2323">SPARK-2323</a>)</li>
</ul>
<h4 id="sql">SQL</h4>
<ul>
<li>Slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file (<a href="https://issues.apache.org/jira/browse/SPARK-2576">SPARK-2576</a>)</li>
<li>Concurrent initialization of various DataType objects causes exceptions (<a href="https://issues.apache.org/jira/browse/SPARK-2498">SPARK-2498</a>)</li>
<li>Multiple instances of an InMemoryRelation in a single plan results in recaching (<a href="https://issues.apache.org/jira/browse/SPARK-2405">SPARK-2405</a>)</li>
</ul>
<h4 id="pyspark">PySpark</h4>
<ul>
<li>Make hash of None consistant cross machines (<a href="https://issues.apache.org/jira/browse/SPARK-2494">SPARK-2494</a>)</li>
</ul>
<h4 id="mllib">MLlib</h4>
<ul>
<li>mutable.BitSet in ALS not serializable with KryoSerializer (<a href="https://issues.apache.org/jira/browse/SPARK-1977">SPARK-1977</a>)</li>
<li>fix bin offset in DecisionTree node aggregations (<a href="https://issues.apache.org/jira/browse/SPARK-2152">SPARK-2152</a>)</li>
</ul>
<h4 id="streaming">Streaming</h4>
<ul>
<li>Ability to limit the Receiver data rate to prevent overloading of data and Spark crash (<a href="https://issues.apache.org/jira/browse/SPARK-1341">SPARK-1341</a>)</li>
<li>File stream processes existing files in a directory even if newFilesOnly = true (<a href="https://issues.apache.org/jira/browse/SPARK-2362">SPARK-2362</a>)</li>
<li>QueueInputDStream with oneAtATime=false does not dequeue items (<a href="https://issues.apache.org/jira/browse/SPARK-2343">SPARK-2343</a>)</li>
</ul>
<h4 id="graphx">GraphX</h4>
<ul>
<li>VertexPartition is not serializable (<a href="https://issues.apache.org/jira/browse/SPARK-2455">SPARK-2455</a>)</li>
</ul>
<h3 id="contributors">Contributors</h3>
<p>The following developers contributed to this release:</p>
<ul>
<li>Aaron Davidson - Bug fixes in core</li>
<li>Aaron Staple - Bug fix in SQL</li>
<li>Andrew Or - Bug fix in core</li>
<li>Ankur Dave - Bug fix in GraphX</li>
<li>Artjom-Metro - Bug fix in examples</li>
<li>Basit Mustafa - Added t2 EC2 instance support</li>
<li>Cesar Arevalo - Doc fix</li>
<li>Cheng Hao - Bug fix in SQL</li>
<li>Daniel Darabos - Bug fix in core</li>
<li>Davies Liu - Bug fix in PySpark</li>
<li>Gabriele Nizzoli - Bug fix in Streaming</li>
<li>Hossein - Bug fix in core</li>
<li>Issac Buenrostro - Added support for throttling Streaming receiver</li>
<li>Manuel Laflamme - Bug fix in Streaming</li>
<li>Michael Armbrust - Bug fix and performance improvements in SQL</li>
<li>Neville Li - Bug fix in MLlib</li>
<li>Patrick Wendell - Bug fixes in core</li>
<li>Reynold Xin - Bug fixes in core and SQL</li>
<li>Sarah Gerweck - Bug fix in core</li>
<li>Takuya UESHIN - Bug fixes in SQL</li>
<li>Tathagata Das - Bug fix in Streaming</li>
<li>William Benton - Bug fix in SQL</li>
<li>Yin Huai - Bug fixes in SQL</li>
<li>Zongheng Yang - Bug fixes in SQL</li>
<li>baishuo(白硕) - Bug fix in SQL</li>
<li>johnnywalleye - Bug fixes in MLlib</li>
<li>joyyoj - Bug fix in Streaming</li>
<li>kballou - Doc fix</li>
<li>lianhuiwang - Doc fix </li>
<li>witgo - Bug fix in sbt</li>
</ul>
<p><em>Thanks to everyone who contributed!</em></p>
<p>
<br/>
<a href="/news/">Spark News Archive</a>
</p>
</div>
</div>
<footer class="small">
<hr>
Apache Spark, Spark, Apache, and the Spark logo are trademarks of
<a href="http://www.apache.org">The Apache Software Foundation</a>.
</footer>
</div>
</body>
</html>
|