[SPARK-9593] [SQL] [HOTFIX] Makes the Hadoop shims loading fix more robust - spark

diff options

author	Cheng Lian <lian@databricks.com>	2015-08-06 09:53:53 -0700
committer	Yin Huai <yhuai@databricks.com>	2015-08-06 09:53:53 -0700
commit	9f94c85ff35df6289371f80edde51c2aa6c4bcdc (patch)
tree	9858091ca33d98022573ed721ab165ee3dbd2ba9 /python/pyspark/ml/feature.py
parent	93085c992e40dbc06714cb1a64c838e25e683a6f (diff)
download	spark-9f94c85ff35df6289371f80edde51c2aa6c4bcdc.tar.gz spark-9f94c85ff35df6289371f80edde51c2aa6c4bcdc.tar.bz2 spark-9f94c85ff35df6289371f80edde51c2aa6c4bcdc.zip

[SPARK-9593] [SQL] [HOTFIX] Makes the Hadoop shims loading fix more robust

This is a follow-up of #7929. We found that Jenkins SBT master build still fails because of the Hadoop shims loading issue. But the failure doesn't appear to be deterministic. My suspect is that Hadoop `VersionInfo` class may fail to inspect Hadoop version, and the shims loading branch is skipped. This PR tries to make the fix more robust: 1. When Hadoop version is available, we load `Hadoop20SShims` for versions <= 2.0.x as srowen suggested in PR #7929. 2. Otherwise, we use `Path.getPathWithoutSchemeAndAuthority` as a probe method, which doesn't exist in Hadoop 1.x or 2.0.x. If this method is not found, `Hadoop20SShims` is also loaded. Author: Cheng Lian <lian@databricks.com> Closes #7994 from liancheng/spark-9593/fix-hadoop-shims and squashes the following commits: e1d3d70 [Cheng Lian] Fixes typo in comments 8d971da [Cheng Lian] Makes the Hadoop shims loading fix more robust

Diffstat (limited to 'python/pyspark/ml/feature.py')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: