aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/ml/feature.py
diff options
context:
space:
mode:
authorCheng Lian <lian@databricks.com>2015-08-06 09:53:53 -0700
committerYin Huai <yhuai@databricks.com>2015-08-06 09:53:53 -0700
commit9f94c85ff35df6289371f80edde51c2aa6c4bcdc (patch)
tree9858091ca33d98022573ed721ab165ee3dbd2ba9 /python/pyspark/ml/feature.py
parent93085c992e40dbc06714cb1a64c838e25e683a6f (diff)
downloadspark-9f94c85ff35df6289371f80edde51c2aa6c4bcdc.tar.gz
spark-9f94c85ff35df6289371f80edde51c2aa6c4bcdc.tar.bz2
spark-9f94c85ff35df6289371f80edde51c2aa6c4bcdc.zip
[SPARK-9593] [SQL] [HOTFIX] Makes the Hadoop shims loading fix more robust
This is a follow-up of #7929. We found that Jenkins SBT master build still fails because of the Hadoop shims loading issue. But the failure doesn't appear to be deterministic. My suspect is that Hadoop `VersionInfo` class may fail to inspect Hadoop version, and the shims loading branch is skipped. This PR tries to make the fix more robust: 1. When Hadoop version is available, we load `Hadoop20SShims` for versions <= 2.0.x as srowen suggested in PR #7929. 2. Otherwise, we use `Path.getPathWithoutSchemeAndAuthority` as a probe method, which doesn't exist in Hadoop 1.x or 2.0.x. If this method is not found, `Hadoop20SShims` is also loaded. Author: Cheng Lian <lian@databricks.com> Closes #7994 from liancheng/spark-9593/fix-hadoop-shims and squashes the following commits: e1d3d70 [Cheng Lian] Fixes typo in comments 8d971da [Cheng Lian] Makes the Hadoop shims loading fix more robust
Diffstat (limited to 'python/pyspark/ml/feature.py')
0 files changed, 0 insertions, 0 deletions