blob: dfec1b79295e447f2d00f35ea20c7b20f3e48f1d (
plain) (
tree)
|
|
---
layout: post
title: Spark Release 1.4.1
categories: []
tags: []
status: publish
type: post
published: true
meta:
_edit_last: '4'
_wpas_done_all: '1'
---
Spark 1.4.1 is a maintenance release containing stability fixes. This release is based on the [branch-1.4](https://github.com/apache/spark/tree/branch-1.4) maintenance branch of Spark. We recommend all 1.4.1 users to upgrade to this stable release. 85 developers contributed to this release.
To download Spark 1.4.1 visit the <a href="{{site.url}}downloads.html">downloads</a> page.
### Fixes
Spark 1.4.1 contains several bug fixes in Spark SQL and assorted fixes in other components. Some of the more important fixes are highlighted below. You can visit the [Spark issue tracker](https://issues.apache.org/jira/issues/?jql=project%20%3D%20SPARK%20AND%20fixVersion%20%3D%201.4.1%20ORDER%20BY%20priority%2C%20component) for the full list of fixes.
#### MLLib
* [SPARK-8151](https://issues.apache.org/jira/browse/SPARK-8151): Pipeline components should correctly implement copy
* [SPARK-8468](https://issues.apache.org/jira/browse/SPARK-8468): Some metrics in RegressionEvaluator should have negative sign
* [SPARK-8736](https://issues.apache.org/jira/browse/SPARK-8736): GBTRegressionModel shouldn’t threshold predictions
* [SPARK-8563](https://issues.apache.org/jira/browse/SPARK-8563): IndexedRowMatrix.computeSVD() yields the U with wrong numCols
#### PySpark
* [SPARK-8202](https://issues.apache.org/jira/browse/SPARK-8202): Infinite loop during external sort
* [SPARK-8573](https://issues.apache.org/jira/browse/SPARK-8573): Trigger exceptions when invalid operators are used
* [SPARK-8766](https://issues.apache.org/jira/browse/SPARK-8766): Support non ASCII characters in columns
#### Data Sources and DataFrames
* [SPARK-8804](https://issues.apache.org/jira/browse/SPARK-8804): Order of UTF8String is not consistent with String if there is any non-ascii character in it
* [SPARK-8406](https://issues.apache.org/jira/browse/SPARK-8406): Race condition when writing Parquet files
* [SPARK-8329](https://issues.apache.org/jira/browse/SPARK-8329): DataSource options parser no longer accepts '_'
* [SPARK-8368](https://issues.apache.org/jira/browse/SPARK-8368): ClassNotFoundException in closure for map
* [SPARK-8470](https://issues.apache.org/jira/browse/SPARK-8470): MissingRequirementError for ScalaReflection on user classes
* [SPARK-8358](https://issues.apache.org/jira/browse/SPARK-8358): DataFrame explode with alias and * fails
### Contributors
The following developers contributed to this release:
* Adam Roberts -- Bug fixes in Core and tests
* Ai He -- Improvements in PySpark
* Alok Singh -- Bug fixes in Core
* Amey Chaugule -- Improvement in SQL
* Andrew Or -- Bug fixes in Core, tests, Shuffle, spark submit, and SQL; improvement in Web UI and tests
* Animesh Baranawal -- Improvement in SQL
* Ben Fradet -- Bug fixes in Streaming and Web UI
* Brian Lockwood -- Documentation in Core
* Burak Yavuz -- Test in SQL; improvements in spark submit; bug fixes in spark submit and SQL; improvement in SQL
* Carson Wang -- Bug fixes in Core and Web UI
* Cheng Hao -- Bug fixes in SQL
* Cheng Lian -- Documentation in Core; bug fixes in SQL
* Cheolsoo Park -- Improvement in SQL
* Chris Freeman -- Bug fixes in r
* Christian Kadner -- Bug fixes in SQL
* Cody Koeninger -- Improvements and improvement in Streaming
* DB Tsai -- New features in MLlib
* Daniel Darabos -- Bug fixes in EC2
* Daoyuan Wang -- Bug fixes in SQL
* Davies Liu -- Bug fixes in PySpark; improvement in SQL and PySpark
* Devaraj K -- Bug fixes in YARN
* Dibyendu Bhattacharya -- Bug fixes in Streaming and Web UI
* Favio Vazquez -- Documentation in Core and MLlib
* Hari Shreedharan -- Bug fixes in Streaming and Core
* Holden Karau -- Documentation in Core; bug fixes in sparkr, Shuffle, and MLlib
* Hossein Falaki -- Improvements and bug fixes in sparkr
* Huang Zhaowei -- Bug fixes in Streaming and YARN
* Jean Lyn -- Bug fixes in SQL
* Joseph K. Bradley -- Improvements in MLlib; bug fixes in MLlib; improvement in Core and MLlib
* Josh Rosen -- New features in SQL; bug fixes in scheduler and SQL; improvement in Core
* Kevin Conor -- Bug fixes in PySpark
* Kousuke Saruta -- Documentation in Core
* Lars Francke -- Documentation in Core
* Lee -- Bug fixes in MLlib
* Liang-Chi Hsieh -- Improvements in SQL; bug fixes in MLlib and SQL
* Lianhui Wang -- Bug fixes in Shuffle
* Luca Martinetti -- Documentation in Core
* Manoj Kumar -- Improvements in SQL; bug fixes in MLlib and PySpark
* Marcelo Vanzin -- Improvements in Core; bug fixes in Core and Web UI
* Mark Smith -- Bug fixes in EC2
* Michael Armbrust -- Bug fixes in SQL
* Mike Dusenberry -- Improvements in PySpark, Core, SQL, and Streaming
* Moussa Taifi -- Documentation in Core
* Nathan Howell -- Bug fixes in SQL
* Navis Ryu -- Bug fixes in SQL
* Neelesh Srinivas Salian -- Documentation in Core and YARN
* Oleksiy Dyagilev -- Bug fixes in Core and MLlib
* Paavo Parkkinen -- Bug fixes in Streaming and MLlib
* Patrick Wendell -- Improvements in Core
* Punya Biswal -- Bug fixes in Core
* Qian Huang -- New features in sparkr
* Radek Ostrowski -- Documentation in Core
* Ram Sriharsha -- New features in MLlib and PySpark
* Rekha Joshi -- Improvements in Web UI
* Reynold Xin -- Bug fixes and improvement in SQL
* Rosstin Murphy -- Documentation in Core
* Ryan Williams -- Bug fixes in Core
* Saisai Shao -- Bug fixes in Core and Streaming
* Scott Taylor -- Test in PySpark
* Sean Owen -- Improvements and bug fixes in Core
* Sephiroth Lin -- Improvements in YARN
* Shiming Fei -- Bug fixes in Core
* Shivaram Venkataraman -- Improvements in Core, EC2, and sparkr; bug fixes in sparkr and EC2
* Shixiong Zhu -- Test in Streaming, tests, and Core; improvements in Streaming and Core; bug fixes in Core, tests, PySpark, and Streaming
* Simon Hafner -- Bug fixes in EC2
* Sun Rui -- Bug fixes in Core and sparkr
* Tao Li -- Bug fixes in YARN
* Tathagata Das -- Improvements in Core and Streaming; bug fixes in Core
* Ted Blackman -- Bug fixes in PySpark
* Ted Yu -- Bug fixes in SQL
* Tim Ellison -- Bug fixes in Core
* Timothy Chen -- Bug fixes in Web UI and Mesos
* Tingjun Xu -- Improvements in Core
* Tom Graves -- Bug fixes in Core
* Vinod K C -- Improvements and bug fixes in SQL
* Vyacheslav Baranov -- Bug fixes in Core
* Wang Tao -- Bug fixes in Core and YARN
* Wenchen Fan -- Improvement in SQL
* Xiangrui Meng -- Bug fixes in MLlib
* Yanbo Liang -- Improvements in Core, MLlib, and PySpark
* Yin Huai -- Improvements in Core, tests, and SQL; bug fixes in tests and SQL; improvement in tests and SQL
* Yu ISHIKAWA -- Improvements in PySpark; improvement in sparkr
* Yuhao Yang -- Bug fixes in MLlib
* Yuming Wang -- Github integration in Web UI
* Yuri Saito -- Bug fixes in PySpark
_Thanks to everyone who contributed!_
|