diff options
author | Xiangrui Meng <meng@databricks.com> | 2015-05-28 16:32:51 -0700 |
---|---|---|
committer | Joseph K. Bradley <joseph@databricks.com> | 2015-05-28 16:32:51 -0700 |
commit | 7859ab659eecbcf2d8b9a274a4e9e4f5186a528c (patch) | |
tree | dc968849ed71a2aeb02a8e3d0b969785ef4607d3 /unsafe | |
parent | 3e312a5ed0154527c66eeeee0d2cc3bfce0a820e (diff) | |
download | spark-7859ab659eecbcf2d8b9a274a4e9e4f5186a528c.tar.gz spark-7859ab659eecbcf2d8b9a274a4e9e4f5186a528c.tar.bz2 spark-7859ab659eecbcf2d8b9a274a4e9e4f5186a528c.zip |
[SPARK-7198] [MLLIB] VectorAssembler should output ML attributes
`VectorAssembler` should carry over ML attributes. For unknown attributes, we assume numeric values. This PR handles the following cases:
1. DoubleType with ML attribute: carry over
2. DoubleType without ML attribute: numeric value
3. Scalar type: numeric value
4. VectorType with all ML attributes: carry over and update names
5. VectorType with number of ML attributes: assume all numeric
6. VectorType without ML attributes: check the first row and get the number of attributes
jkbradley
Author: Xiangrui Meng <meng@databricks.com>
Closes #6452 from mengxr/SPARK-7198 and squashes the following commits:
a9d2469 [Xiangrui Meng] add space
facdb1f [Xiangrui Meng] VectorAssembler should output ML attributes
Diffstat (limited to 'unsafe')
0 files changed, 0 insertions, 0 deletions