diff options
Diffstat (limited to 'site/releases/spark-release-0-8-0.html')
-rw-r--r-- | site/releases/spark-release-0-8-0.html | 147 |
1 files changed, 74 insertions, 73 deletions
diff --git a/site/releases/spark-release-0-8-0.html b/site/releases/spark-release-0-8-0.html index 1d981a2b3..1005fe589 100644 --- a/site/releases/spark-release-0-8-0.html +++ b/site/releases/spark-release-0-8-0.html @@ -48,6 +48,11 @@ <body> +<script src="https://code.jquery.com/jquery.js"></script> +<script src="//netdna.bootstrapcdn.com/bootstrap/3.0.3/js/bootstrap.min.js"></script> +<script src="/js/lang-tabs.js"></script> +<script src="/js/downloads.js"></script> + <div class="container" style="max-width: 1200px;"> <div class="masthead"> @@ -204,13 +209,13 @@ <li>The examples build has been isolated from the core build, substantially reducing the potential for dependency conflicts.</li> <li>The Spark Streaming Twitter API has been updated to use OAuth authentication instead of the deprecated username/password authentication in Spark 0.7.0.</li> <li>Several new example jobs have been added, including PageRank implementations in Java, Scala and Python, examples for accessing HBase and Cassandra, and MLlib examples.</li> - <li>Support for running on Mesos has been improved – now you can deploy a Spark assembly JAR as part of the Mesos job, instead of having Spark pre-installed on each machine. The default Mesos version has also been updated to 0.13.</li> + <li>Support for running on Mesos has been improved – now you can deploy a Spark assembly JAR as part of the Mesos job, instead of having Spark pre-installed on each machine. The default Mesos version has also been updated to 0.13.</li> <li>This release includes various optimizations to PySpark and to the job scheduler.</li> </ul> <h3 id="compatibility">Compatibility</h3> <ul> - <li><strong>This release changes Spark’s package name to ‘org.apache.spark’</strong>, so those upgrading from Spark 0.7 will need to adjust their imports accordingly. In addition, we’ve moved the <code>RDD</code> class to the org.apache.spark.rdd package (it was previously in the top-level package). The Spark artifacts published through Maven have also changed to the new package name.</li> + <li><strong>This release changes Spark’s package name to ‘org.apache.spark’</strong>, so those upgrading from Spark 0.7 will need to adjust their imports accordingly. In addition, we’ve moved the <code>RDD</code> class to the org.apache.spark.rdd package (it was previously in the top-level package). The Spark artifacts published through Maven have also changed to the new package name.</li> <li>In the Java API, use of Scala’s <code>Option</code> class has been replaced with <code>Optional</code> from the Guava library.</li> <li>Linking against Spark for arbitrary Hadoop versions is now possible by specifying a dependency on <code>hadoop-client</code>, instead of rebuilding <code>spark-core</code> against your version of Hadoop. See the documentation <a href="http://spark.incubator.apache.org/docs/0.8.0/scala-programming-guide.html#linking-with-spark">here</a> for details.</li> <li>If you are building Spark, you’ll now need to run <code>sbt/sbt assembly</code> instead of <code>package</code>.</li> @@ -220,73 +225,73 @@ <p>Spark 0.8.0 was the result of the largest team of contributors yet. The following developers contributed to this release:</p> <ul> - <li>Andrew Ash – documentation, code cleanup and logging improvements</li> - <li>Mikhail Bautin – bug fix</li> - <li>Konstantin Boudnik – Maven build, bug fixes, and documentation</li> - <li>Ian Buss – sbt configuration improvement</li> - <li>Evan Chan – API improvement, bug fix, and documentation</li> - <li>Lian Cheng – bug fix</li> - <li>Tathagata Das – performance improvement in streaming receiver and streaming bug fix</li> - <li>Aaron Davidson – Python improvements, bug fix, and unit tests</li> - <li>Giovanni Delussu – coalesced RDD feature</li> - <li>Joseph E. Gonzalez – improvement to zipPartitions</li> - <li>Karen Feng – several improvements to web UI</li> - <li>Andy Feng – HDFS metrics</li> - <li>Ali Ghodsi – configuration improvements and locality-aware coalesce</li> - <li>Christoph Grothaus – bug fix</li> - <li>Thomas Graves – support for secure YARN cluster and various YARN-related improvements</li> - <li>Stephen Haberman – bug fix, documentation, and code cleanup</li> - <li>Mark Hamstra – bug fixes and Maven build</li> - <li>Benjamin Hindman – Mesos compatibility and documentation</li> - <li>Liang-Chi Hsieh – bug fixes in build and in YARN mode</li> - <li>Shane Huang – shuffle improvements, bug fix</li> - <li>Ethan Jewett – Spark/HBase example</li> - <li>Holden Karau – bug fix and EC2 improvement</li> - <li>Kody Koeniger – JDBV RDD implementation</li> - <li>Andy Konwinski – documentation</li> - <li>Jey Kottalam – PySpark optimizations, Hadoop agnostic build (lead), and bug fixes</li> - <li>Andrey Kouznetsov – Bug fix</li> - <li>S. Kumar – Spark Streaming example</li> - <li>Ryan LeCompte – topK method optimization and serialization improvements</li> - <li>Gavin Li – compression codecs and pipe support</li> - <li>Harold Lim – fair scheduler</li> - <li>Dmitriy Lyubimov – bug fix</li> - <li>Chris Mattmann – Apache mentor</li> - <li>David McCauley – JSON API improvement</li> - <li>Sean McNamara – added <code>takeOrdered</code> function, bug fixes, and a build fix</li> - <li>Mridul Muralidharan – YARN integration (lead) and scheduler improvements</li> - <li>Marc Mercer – improvements to UI json output</li> - <li>Christopher Nguyen – bug fixes</li> - <li>Erik van Oosten – example fix</li> - <li>Kay Ousterhout – fix for scheduler regression and bug fixes</li> - <li>Xinghao Pan – MLLib contributions</li> - <li>Hiral Patel – bug fix</li> - <li>James Phillpotts – updated Twitter API for Spark streaming</li> - <li>Nick Pentreath – scala pageRank example, bagel improvement, and several Java examples</li> - <li>Alexander Pivovarov – logging improvement and Maven build</li> - <li>Mike Potts – configuration improvement</li> - <li>Rohit Rai – Spark/Cassandra example</li> - <li>Imran Rashid – bug fixes and UI improvement</li> - <li>Charles Reiss – bug fixes, code cleanup, performance improvements</li> - <li>Josh Rosen – Python API improvements, Java API improvements, EC2 scripts and bug fixes</li> - <li>Henry Saputra – Apache mentor</li> - <li>Jerry Shao – bug fixes, metrics system</li> - <li>Prashant Sharma – documentation</li> - <li>Mingfei Shi – joblogger and bug fix</li> - <li>Andre Schumacher – several PySpark features</li> - <li>Ginger Smith – MLLib contribution</li> - <li>Evan Sparks – contributions to MLLib</li> - <li>Ram Sriharsha – bug fix and RDD removal feature</li> - <li>Ameet Talwalkar – MLlib contributions</li> - <li>Roman Tkalenko – code refactoring and cleanup</li> - <li>Chu Tong – Java PageRank algorithm and bug fix in bash scripts</li> - <li>Shivaram Venkataraman – bug fixes, contributions to MLLib, netty shuffle fixes, and Java API additions</li> - <li>Patrick Wendell – release manager, bug fixes, documentation, metrics system, and web UI</li> - <li>Andrew Xia – fair scheduler (lead), metrics system, and ui improvements</li> - <li>Reynold Xin – shuffle improvements, bug fixes, code refactoring, usability improvements, MLLib contributions</li> - <li>Matei Zaharia – MLLib contributions, documentation, examples, UI improvements, PySpark improvements, and bug fixes</li> - <li>Wu Zeming – bug fix in scheduler</li> - <li>Bill Zhao – log message improvement</li> + <li>Andrew Ash – documentation, code cleanup and logging improvements</li> + <li>Mikhail Bautin – bug fix</li> + <li>Konstantin Boudnik – Maven build, bug fixes, and documentation</li> + <li>Ian Buss – sbt configuration improvement</li> + <li>Evan Chan – API improvement, bug fix, and documentation</li> + <li>Lian Cheng – bug fix</li> + <li>Tathagata Das – performance improvement in streaming receiver and streaming bug fix</li> + <li>Aaron Davidson – Python improvements, bug fix, and unit tests</li> + <li>Giovanni Delussu – coalesced RDD feature</li> + <li>Joseph E. Gonzalez – improvement to zipPartitions</li> + <li>Karen Feng – several improvements to web UI</li> + <li>Andy Feng – HDFS metrics</li> + <li>Ali Ghodsi – configuration improvements and locality-aware coalesce</li> + <li>Christoph Grothaus – bug fix</li> + <li>Thomas Graves – support for secure YARN cluster and various YARN-related improvements</li> + <li>Stephen Haberman – bug fix, documentation, and code cleanup</li> + <li>Mark Hamstra – bug fixes and Maven build</li> + <li>Benjamin Hindman – Mesos compatibility and documentation</li> + <li>Liang-Chi Hsieh – bug fixes in build and in YARN mode</li> + <li>Shane Huang – shuffle improvements, bug fix</li> + <li>Ethan Jewett – Spark/HBase example</li> + <li>Holden Karau – bug fix and EC2 improvement</li> + <li>Kody Koeniger – JDBV RDD implementation</li> + <li>Andy Konwinski – documentation</li> + <li>Jey Kottalam – PySpark optimizations, Hadoop agnostic build (lead), and bug fixes</li> + <li>Andrey Kouznetsov – Bug fix</li> + <li>S. Kumar – Spark Streaming example</li> + <li>Ryan LeCompte – topK method optimization and serialization improvements</li> + <li>Gavin Li – compression codecs and pipe support</li> + <li>Harold Lim – fair scheduler</li> + <li>Dmitriy Lyubimov – bug fix</li> + <li>Chris Mattmann – Apache mentor</li> + <li>David McCauley – JSON API improvement</li> + <li>Sean McNamara – added <code>takeOrdered</code> function, bug fixes, and a build fix</li> + <li>Mridul Muralidharan – YARN integration (lead) and scheduler improvements</li> + <li>Marc Mercer – improvements to UI json output</li> + <li>Christopher Nguyen – bug fixes</li> + <li>Erik van Oosten – example fix</li> + <li>Kay Ousterhout – fix for scheduler regression and bug fixes</li> + <li>Xinghao Pan – MLLib contributions</li> + <li>Hiral Patel – bug fix</li> + <li>James Phillpotts – updated Twitter API for Spark streaming</li> + <li>Nick Pentreath – scala pageRank example, bagel improvement, and several Java examples</li> + <li>Alexander Pivovarov – logging improvement and Maven build</li> + <li>Mike Potts – configuration improvement</li> + <li>Rohit Rai – Spark/Cassandra example</li> + <li>Imran Rashid – bug fixes and UI improvement</li> + <li>Charles Reiss – bug fixes, code cleanup, performance improvements</li> + <li>Josh Rosen – Python API improvements, Java API improvements, EC2 scripts and bug fixes</li> + <li>Henry Saputra – Apache mentor</li> + <li>Jerry Shao – bug fixes, metrics system</li> + <li>Prashant Sharma – documentation</li> + <li>Mingfei Shi – joblogger and bug fix</li> + <li>Andre Schumacher – several PySpark features</li> + <li>Ginger Smith – MLLib contribution</li> + <li>Evan Sparks – contributions to MLLib</li> + <li>Ram Sriharsha – bug fix and RDD removal feature</li> + <li>Ameet Talwalkar – MLlib contributions</li> + <li>Roman Tkalenko – code refactoring and cleanup</li> + <li>Chu Tong – Java PageRank algorithm and bug fix in bash scripts</li> + <li>Shivaram Venkataraman – bug fixes, contributions to MLLib, netty shuffle fixes, and Java API additions</li> + <li>Patrick Wendell – release manager, bug fixes, documentation, metrics system, and web UI</li> + <li>Andrew Xia – fair scheduler (lead), metrics system, and ui improvements</li> + <li>Reynold Xin – shuffle improvements, bug fixes, code refactoring, usability improvements, MLLib contributions</li> + <li>Matei Zaharia – MLLib contributions, documentation, examples, UI improvements, PySpark improvements, and bug fixes</li> + <li>Wu Zeming – bug fix in scheduler</li> + <li>Bill Zhao – log message improvement</li> </ul> <p>Thanks to everyone who contributed! @@ -311,9 +316,5 @@ We’d especially like to thank Patrick Wendell for acting as the release manage </div> -<script src="https://code.jquery.com/jquery.js"></script> -<script src="//netdna.bootstrapcdn.com/bootstrap/3.0.3/js/bootstrap.min.js"></script> -<script src="/js/lang-tabs.js"></script> - </body> </html> |