[SPARK-11998][SQL][TEST-HADOOP2.0] When downloading Hadoop artifacts from maven, we need to try to download the version that is used by Spark - spark

diff options

author	Yin Huai <yhuai@databricks.com>	2015-11-26 16:20:08 -0800
committer	Yin Huai <yhuai@databricks.com>	2015-11-26 16:20:08 -0800
commit	ad76562390b81207f8f32491c0bd8ad0e020141f (patch)
tree	65058912455c4195ac12e389cf6e619f9581ee16 /docs/mllib-linear-methods.md
parent	bc16a67562560c732833260cbc34825f7e9dcb8f (diff)
download	spark-ad76562390b81207f8f32491c0bd8ad0e020141f.tar.gz spark-ad76562390b81207f8f32491c0bd8ad0e020141f.tar.bz2 spark-ad76562390b81207f8f32491c0bd8ad0e020141f.zip

[SPARK-11998][SQL][TEST-HADOOP2.0] When downloading Hadoop artifacts from maven, we need to try to download the version that is used by Spark

If we need to download Hive/Hadoop artifacts, try to download a Hadoop that matches the Hadoop used by Spark. If the Hadoop artifact cannot be resolved (e.g. Hadoop version is a vendor specific version like 2.0.0-cdh4.1.1), we will use Hadoop 2.4.0 (we used to hard code this version as the hadoop that we will download from maven) and we will not share Hadoop classes. I tested this match in my laptop with the following confs (these confs are used by our builds). All tests are good. ``` build/sbt -Phadoop-1 -Dhadoop.version=1.2.1 -Pkinesis-asl -Phive-thriftserver -Phive build/sbt -Phadoop-1 -Dhadoop.version=2.0.0-mr1-cdh4.1.1 -Pkinesis-asl -Phive-thriftserver -Phive build/sbt -Pyarn -Phadoop-2.2 -Pkinesis-asl -Phive-thriftserver -Phive build/sbt -Pyarn -Phadoop-2.3 -Dhadoop.version=2.3.0 -Pkinesis-asl -Phive-thriftserver -Phive ``` Author: Yin Huai <yhuai@databricks.com> Closes #9979 from yhuai/versionsSuite.

Diffstat (limited to 'docs/mllib-linear-methods.md')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: