--- layout: post title: Spark Release 1.0.2 categories: [] tags: [] status: publish type: post published: true meta: _edit_last: '4' _wpas_done_all: '1' --- Spark 1.0.2 is a maintenance release with bug fixes. This release is based on the [branch-1.0](https://github.com/apache/spark/tree/branch-1.0) maintenance branch of Spark. We recommend all 1.0.x users to upgrade to this stable release. Contributions to this release came from 30 developers. You can download Spark 1.0.2 as either a source package (6 MB tgz) or a prebuilt package for Hadoop 1 / CDH3 (156 MB tgz), CDH4 (161 MB tgz), or Hadoop 2 / CDH5 / HDP2 (168 MB tgz). Release signatures and checksums are available at the official [Apache download site](http://www.apache.org/dist/spark/spark-1.0.2/). ### Fixes Spark 1.0.2 contains bug fixes in several components. Some of the more important fixes are highlighted below. You can visit the [Spark issue tracker](http://s.apache.org/9NJ) for the full list of fixes. #### Spark Core - Avoid pulling in the entire RDD or PairRDDFunctions in various operators ([SPARK-2534](https://issues.apache.org/jira/browse/SPARK-2534)) - RangePartitioner's binary search does not use the given Ordering ([SPARK-2598](https://issues.apache.org/jira/browse/SPARK-2598)) - Exception in accumulator update should not crash DAGScheduler and SparkContext ([SPARK-2323](https://issues.apache.org/jira/browse/SPARK-2323)) #### SQL - Slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file ([SPARK-2576](https://issues.apache.org/jira/browse/SPARK-2576)) - Concurrent initialization of various DataType objects causes exceptions ([SPARK-2498](https://issues.apache.org/jira/browse/SPARK-2498)) - Multiple instances of an InMemoryRelation in a single plan results in recaching ([SPARK-2405](https://issues.apache.org/jira/browse/SPARK-2405)) #### PySpark - Make hash of None consistant cross machines ([SPARK-2494](https://issues.apache.org/jira/browse/SPARK-2494)) #### MLlib - mutable.BitSet in ALS not serializable with KryoSerializer ([SPARK-1977](https://issues.apache.org/jira/browse/SPARK-1977)) - fix bin offset in DecisionTree node aggregations ([SPARK-2152](https://issues.apache.org/jira/browse/SPARK-2152)) #### Streaming - Ability to limit the Receiver data rate to prevent overloading of data and Spark crash ([SPARK-1341](https://issues.apache.org/jira/browse/SPARK-1341)) - File stream processes existing files in a directory even if newFilesOnly = true ([SPARK-2362](https://issues.apache.org/jira/browse/SPARK-2362)) - QueueInputDStream with oneAtATime=false does not dequeue items ([SPARK-2343](https://issues.apache.org/jira/browse/SPARK-2343)) #### GraphX - VertexPartition is not serializable ([SPARK-2455](https://issues.apache.org/jira/browse/SPARK-2455)) ### Contributors The following developers contributed to this release: * Aaron Davidson - Bug fixes in core * Aaron Staple - Bug fix in SQL * Andrew Or - Bug fix in core * Ankur Dave - Bug fix in GraphX * Artjom-Metro - Bug fix in examples * Basit Mustafa - Added t2 EC2 instance support * Cesar Arevalo - Doc fix * Cheng Hao - Bug fix in SQL * Daniel Darabos - Bug fix in core * Davies Liu - Bug fix in PySpark * Gabriele Nizzoli - Bug fix in Streaming * Hossein - Bug fix in core * Issac Buenrostro - Added support for throttling Streaming receiver * Manuel Laflamme - Bug fix in Streaming * Michael Armbrust - Bug fix and performance improvements in SQL * Neville Li - Bug fix in MLlib * Patrick Wendell - Bug fixes in core * Reynold Xin - Bug fixes in core and SQL * Sarah Gerweck - Bug fix in core * Takuya UESHIN - Bug fixes in SQL * Tathagata Das - Bug fix in Streaming * William Benton - Bug fix in SQL * Yin Huai - Bug fixes in SQL * Zongheng Yang - Bug fixes in SQL * baishuo(白硕) - Bug fix in SQL * johnnywalleye - Bug fixes in MLlib * joyyoj - Bug fix in Streaming * kballou - Doc fix * lianhuiwang - Doc fix * witgo - Bug fix in sbt _Thanks to everyone who contributed!_