---
layout: post
title: Spark Release 0.9.2
categories: []
tags: []
status: publish
type: post
published: true
meta:
_edit_last: '4'
_wpas_done_all: '1'
---
Spark 0.9.2 is a maintenance release with bug fixes. This release is based on the [branch-0.9](https://github.com/apache/spark/tree/branch-0.9) maintenance branch of Spark. We recommend all 0.9.x users to upgrade to this stable release. Contributions to this release came from 28 developers.
You can download Spark 0.9.2 as either a
source package
(6 MB tgz) or a prebuilt package for
Hadoop 1 / CDH3 (156 MB tgz),
CDH4 (161 MB tgz), or
Hadoop 2 / CDH5 / HDP2
(168 MB tgz). Release signatures and checksums are available at the official [Apache download site](http://www.apache.org/dist/spark/spark-0.9.2/).
### Fixes
Spark 0.9.2 contains bug fixes in several components. Some of the more important fixes are highlighted below. You can visit the [Spark issue tracker](http://s.apache.org/d0t) for the full list of fixes.
#### Spark Core
- ExternalAppendOnlyMap doesn't always find matching keys. ([SPARK-2043](https://issues.apache.org/jira/browse/SPARK-2043))
- Jobs hang due to akka frame size settings. ([SPARK-1112](https://issues.apache.org/jira/browse/SPARK-1112), [SPARK-2156](https://issues.apache.org/jira/browse/SPARK-2156))
- HDFS FileSystems continually pile up in the FS cache. ([SPARK-1676](https://issues.apache.org/jira/browse/SPARK-1676))
- Unneeded lock in ShuffleMapTask.deserializeInfo. ([SPARK-1775](https://issues.apache.org/jira/browse/SPARK-1775))
- Secondary jars are not added to executor classpath for YARN. ([SPARK-1870](https://issues.apache.org/jira/browse/SPARK-1870))
#### PySpark
- IPython won't run standalone Python script. ([SPARK-1134](https://issues.apache.org/jira/browse/SPARK-1134))
- The hash method used by partitionBy doesn't deal with None correctly. ([SPARK-1468](https://issues.apache.org/jira/browse/SPARK-1468))
- PySpark crashes if too many tasks complete quickly. ([SPARK-2282](https://issues.apache.org/jira/browse/SPARK-2282))
#### MLlib
- Make MLlib work on Python 2.6. ([SPARK-1421](https://issues.apache.org/jira/browse/SPARK-1421))
- Fix PySpark's Naive Bayes implementation. ([SPARK-2433](https://issues.apache.org/jira/browse/SPARK-2433))
#### Streaming
- SparkFlumeEvent with body bigger than 1020 bytes are not read properly. ([SPARK-1916](https://issues.apache.org/jira/browse/SPARK-1916))
#### GraphX
- GraphX triplets not working properly. ([SPARK-1188](https://issues.apache.org/jira/browse/SPARK-1188))
### Contributors
The following developers contributed to this release:
* Aaron Davidson - bug fix and optimization
* Anant Daksh Asthana - improvement
* Daniel Darabos - bug fix
* David Lemieux - bug fix
* Davis Shepherd - bug fix
* DB Tsai - bug fix
* Diana Carroll - bug fix
* Erik Selin - bug fix
* Gabriele Nizzoli - bug fix
* Guoqiang Li - bug fix
* John Zhao - improvement
* Mark Hamstra - bug fix
* Matei Zaharia - bug fix and improvement
* Nan Zhu - bug fix
* Nick Lanham - bug fix
* Ori Kremer - bug fix
* Patrick Wendell - bug fixes
* Prashant Sharma - new feature
* Sam Sun - bug fix
* Sandeep Singh - bug fix
* Shuo Bai - improvement
* Sujeet Varakhedi - improvement
* Tathagata Das - bug fixes and documentation fix
* Thomas Graves - bug fixes
* Uri Laserson - bug fix
* Wenchen Fan - bug fix
* Xiangrui Meng - bug fixes and release manager
* Yin Huai - bug fix
_Thanks to everyone who contributed!_