diff options
author | lewuathe <lewuathe@me.com> | 2015-09-09 09:29:10 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-09-09 09:29:10 -0700 |
commit | 2ddeb63126d26149eda197e85b7b26ef16a6e97c (patch) | |
tree | 6d208f63c719af95e7392b4dcf3cd21949301e0e /mllib/src/main/resources/META-INF/services/org.apache.spark.sql.sources.DataSourceRegister | |
parent | c1bc4f439f54625c01a585691e5293cd9961eb0c (diff) | |
download | spark-2ddeb63126d26149eda197e85b7b26ef16a6e97c.tar.gz spark-2ddeb63126d26149eda197e85b7b26ef16a6e97c.tar.bz2 spark-2ddeb63126d26149eda197e85b7b26ef16a6e97c.zip |
[SPARK-10117] [MLLIB] Implement SQL data source API for reading LIBSVM data
It is convenient to implement data source API for LIBSVM format to have a better integration with DataFrames and ML pipeline API.
Two option is implemented.
* `numFeatures`: Specify the dimension of features vector
* `featuresType`: Specify the type of output vector. `sparse` is default.
Author: lewuathe <lewuathe@me.com>
Closes #8537 from Lewuathe/SPARK-10117 and squashes the following commits:
986999d [lewuathe] Change unit test phrase
11d513f [lewuathe] Fix some reviews
21600a4 [lewuathe] Merge branch 'master' into SPARK-10117
9ce63c7 [lewuathe] Rewrite service loader file
1fdd2df [lewuathe] Merge branch 'SPARK-10117' of github.com:Lewuathe/spark into SPARK-10117
ba3657c [lewuathe] Merge branch 'master' into SPARK-10117
0ea1c1c [lewuathe] LibSVMRelation is registered into META-INF
4f40891 [lewuathe] Improve test suites
5ab62ab [lewuathe] Merge branch 'master' into SPARK-10117
8660d0e [lewuathe] Fix Java unit test
b56a948 [lewuathe] Merge branch 'master' into SPARK-10117
2c12894 [lewuathe] Remove unnecessary tag
7d693c2 [lewuathe] Resolv conflict
62010af [lewuathe] Merge branch 'master' into SPARK-10117
a97ee97 [lewuathe] Fix some points
aef9564 [lewuathe] Fix
70ee4dd [lewuathe] Add Java test
3fd8dce [lewuathe] [SPARK-10117] Implement SQL data source API for reading LIBSVM data
40d3027 [lewuathe] Add Java test
7056d4a [lewuathe] Merge branch 'master' into SPARK-10117
99accaa [lewuathe] [SPARK-10117] Implement SQL data source API for reading LIBSVM data
Diffstat (limited to 'mllib/src/main/resources/META-INF/services/org.apache.spark.sql.sources.DataSourceRegister')
-rw-r--r-- | mllib/src/main/resources/META-INF/services/org.apache.spark.sql.sources.DataSourceRegister | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/mllib/src/main/resources/META-INF/services/org.apache.spark.sql.sources.DataSourceRegister b/mllib/src/main/resources/META-INF/services/org.apache.spark.sql.sources.DataSourceRegister new file mode 100644 index 0000000000..f632dd603c --- /dev/null +++ b/mllib/src/main/resources/META-INF/services/org.apache.spark.sql.sources.DataSourceRegister @@ -0,0 +1 @@ +org.apache.spark.ml.source.libsvm.DefaultSource |