summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorReynold Xin <rxin@databricks.com>2016-07-26 15:10:23 -0700
committerReynold Xin <rxin@databricks.com>2016-07-26 15:10:23 -0700
commita9ba7a4c65bf97c1762f300cbf5ba1459e59b765 (patch)
tree217ea2931f36ffdb586d45925896a467a77ee130
parent0323eb0787282068ce34de0b953a5c3a5c24e84c (diff)
downloadspark-website-a9ba7a4c65bf97c1762f300cbf5ba1459e59b765.tar.gz
spark-website-a9ba7a4c65bf97c1762f300cbf5ba1459e59b765.tar.bz2
spark-website-a9ba7a4c65bf97c1762f300cbf5ba1459e59b765.zip
Better release notes for 2.0.0
-rw-r--r--releases/_posts/2016-07-27-spark-release-2-0-0.md33
-rw-r--r--site/releases/spark-release-2-0-0.html41
2 files changed, 48 insertions, 26 deletions
diff --git a/releases/_posts/2016-07-27-spark-release-2-0-0.md b/releases/_posts/2016-07-27-spark-release-2-0-0.md
index 8b26c0453..eb267cf59 100644
--- a/releases/_posts/2016-07-27-spark-release-2-0-0.md
+++ b/releases/_posts/2016-07-27-spark-release-2-0-0.md
@@ -11,14 +11,19 @@ meta:
_wpas_done_all: '1'
---
-Apache Spark 2.0.0 is the first release on the 2.x line. This release includes over 2500 patches from over 300 contributors. Some breaking changes have been made with respect to the 1.x line. To download Apache Spark 2.0.0, visit the [downloads](http://spark.apache.org/downloads.html) page.
+Apache Spark 2.0.0 is the first release on the 2.x line. This release includes over 2500 patches from over 300 contributors. Spark 2.0.0 builds on what the community has learned in the past two years, with major updates in API usability, SQL 2003 support, performance improvements, structured streaming, R UDF support, as well as operational improvements.
-You can consult JIRA for the [detailed changes](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315420&version=12329449). We have curated a list of high level changes here, grouped by major modules.
+To download Apache Spark 2.0.0, visit the [downloads](http://spark.apache.org/downloads.html) page. You can consult JIRA for the [detailed changes](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315420&version=12329449). We have curated a list of high level changes here, grouped by major modules.
* This will become a table of contents (this text will be scraped).
{:toc}
+### API Stability
+
+Apache Spark 2.0.0 is the first release in the 2.X major line. Spark is guaranteeing stability of its non-experimental APIs for all 2.X releases. Although the APIs have stayed largely similar to 1.X, Spark 2.0.0 does have API breaking changes. They are documented at the end of this release notes.
+
+
### Core and Spark SQL
#### Programming APIs
@@ -94,7 +99,12 @@ Spark 2.0 ships the initial experimental release for Structured Streaming, a hig
For the DStream API, the most prominent update is the new experimental support for Kafka 0.10.
-### Removals, Deprecations and Breaking Changes
+### Operational and Packaging Improvements
+
+There are a variety of improvements to Spark's operations and packaging process. The most prominent change is that Spark 2.0 no longer requires a fat assembly jar for production deployment.
+
+
+### Removals, Behavior Changes and Deprecations
#### Removals
The following features have been removed in Spark 2.0:
@@ -113,15 +123,9 @@ The following features have been removed in Spark 2.0:
- Hash-based shuffle manager
- History serving functionality from standalone Master
- For Java and Scala, DataFrame no longer exists as a class. As a result, data sources would need to be updated.
+- Spark EC2 script has been fully moved to an [external repository hosted by the UC Berkeley AMPLab](https://github.com/amplab/spark-ec2)
-#### Deprecations
-The following features have been deprecated in Spark 2.0, and might be removed in future versions of Spark 2.x:
-
-- Fine-grained mode in Apache Mesos
-- Support for Java 7
-- Support for Python 2.6
-
-#### Breaking Changes
+#### Behavior Changes
The following changes might require updating existing applications that depend on the old behavior or API.
- The default build is now using Scala 2.11 rather than Scala 2.10.
@@ -134,6 +138,13 @@ The following changes might require updating existing applications that depend o
For a more complete list, please see [SPARK-11806](https://issues.apache.org/jira/browse/SPARK-11806) for deprecations and removals.
+#### Deprecations
+The following features have been deprecated in Spark 2.0, and might be removed in future versions of Spark 2.x:
+
+- Fine-grained mode in Apache Mesos
+- Support for Java 7
+- Support for Python 2.6
+
### Known Issues
diff --git a/site/releases/spark-release-2-0-0.html b/site/releases/spark-release-2-0-0.html
index e260a0f11..94ce3bb96 100644
--- a/site/releases/spark-release-2-0-0.html
+++ b/site/releases/spark-release-2-0-0.html
@@ -186,11 +186,12 @@
<h2>Spark Release 2.0.0</h2>
-<p>Apache Spark 2.0.0 is the first release on the 2.x line. This release includes over 2500 patches from over 300 contributors. Some breaking changes have been made with respect to the 1.x line. To download Apache Spark 2.0.0, visit the <a href="http://spark.apache.org/downloads.html">downloads</a> page.</p>
+<p>Apache Spark 2.0.0 is the first release on the 2.x line. This release includes over 2500 patches from over 300 contributors. Spark 2.0.0 builds on what the community has learned in the past two years, with major updates in API usability, SQL 2003 support, performance improvements, structured streaming, R UDF support, as well as operational improvements.</p>
-<p>You can consult JIRA for the <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315420&amp;version=12329449">detailed changes</a>. We have curated a list of high level changes here, grouped by major modules.</p>
+<p>To download Apache Spark 2.0.0, visit the <a href="http://spark.apache.org/downloads.html">downloads</a> page. You can consult JIRA for the <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315420&amp;version=12329449">detailed changes</a>. We have curated a list of high level changes here, grouped by major modules.</p>
<ul id="markdown-toc">
+ <li><a href="#api-stability">API Stability</a></li>
<li><a href="#core-and-spark-sql">Core and Spark SQL</a> <ul>
<li><a href="#programming-apis">Programming APIs</a></li>
<li><a href="#sql">SQL</a></li>
@@ -205,16 +206,21 @@
</li>
<li><a href="#sparkr">SparkR</a></li>
<li><a href="#streaming">Streaming</a></li>
- <li><a href="#removals-deprecations-and-breaking-changes">Removals, Deprecations and Breaking Changes</a> <ul>
+ <li><a href="#operational-and-packaging-improvements">Operational and Packaging Improvements</a></li>
+ <li><a href="#removals-behavior-changes-and-deprecations">Removals, Behavior Changes and Deprecations</a> <ul>
<li><a href="#removals">Removals</a></li>
+ <li><a href="#behavior-changes">Behavior Changes</a></li>
<li><a href="#deprecations">Deprecations</a></li>
- <li><a href="#breaking-changes">Breaking Changes</a></li>
</ul>
</li>
<li><a href="#known-issues">Known Issues</a></li>
<li><a href="#credits">Credits</a></li>
</ul>
+<h3 id="api-stability">API Stability</h3>
+
+<p>Apache Spark 2.0.0 is the first release in the 2.X major line. Spark is guaranteeing stability of its non-experimental APIs for all 2.X releases. Although the APIs have stayed largely similar to 1.X, Spark 2.0.0 does have API breaking changes. They are documented at the end of this release notes.</p>
+
<h3 id="core-and-spark-sql">Core and Spark SQL</h3>
<h4 id="programming-apis">Programming APIs</h4>
@@ -294,7 +300,11 @@ See the MLlib migration guide for a full list of API changes.</p>
<p>For the DStream API, the most prominent update is the new experimental support for Kafka 0.10.</p>
-<h3 id="removals-deprecations-and-breaking-changes">Removals, Deprecations and Breaking Changes</h3>
+<h3 id="operational-and-packaging-improvements">Operational and Packaging Improvements</h3>
+
+<p>There are a variety of improvements to Spark&#8217;s operations and packaging process. The most prominent change is that Spark 2.0 no longer requires a fat assembly jar for production deployment.</p>
+
+<h3 id="removals-behavior-changes-and-deprecations">Removals, Behavior Changes and Deprecations</h3>
<h4 id="removals">Removals</h4>
<p>The following features have been removed in Spark 2.0:</p>
@@ -314,18 +324,10 @@ See the MLlib migration guide for a full list of API changes.</p>
<li>Hash-based shuffle manager</li>
<li>History serving functionality from standalone Master</li>
<li>For Java and Scala, DataFrame no longer exists as a class. As a result, data sources would need to be updated.</li>
+ <li>Spark EC2 script has been fully moved to an <a href="https://github.com/amplab/spark-ec2">external repository hosted by the UC Berkeley AMPLab</a></li>
</ul>
-<h4 id="deprecations">Deprecations</h4>
-<p>The following features have been deprecated in Spark 2.0, and might be removed in future versions of Spark 2.x:</p>
-
-<ul>
- <li>Fine-grained mode in Apache Mesos</li>
- <li>Support for Java 7</li>
- <li>Support for Python 2.6</li>
-</ul>
-
-<h4 id="breaking-changes">Breaking Changes</h4>
+<h4 id="behavior-changes">Behavior Changes</h4>
<p>The following changes might require updating existing applications that depend on the old behavior or API.</p>
<ul>
@@ -339,6 +341,15 @@ See the MLlib migration guide for a full list of API changes.</p>
<p>For a more complete list, please see <a href="https://issues.apache.org/jira/browse/SPARK-11806">SPARK-11806</a> for deprecations and removals.</p>
+<h4 id="deprecations">Deprecations</h4>
+<p>The following features have been deprecated in Spark 2.0, and might be removed in future versions of Spark 2.x:</p>
+
+<ul>
+ <li>Fine-grained mode in Apache Mesos</li>
+ <li>Support for Java 7</li>
+ <li>Support for Python 2.6</li>
+</ul>
+
<h3 id="known-issues">Known Issues</h3>
<ul>