diff options
author | lee19 <lee19@live.co.kr> | 2015-06-30 14:08:00 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-06-30 14:08:00 -0700 |
commit | e72526227fdcf93b7a33375ef954746ac08753f5 (patch) | |
tree | 7d34d9b23897c088254cd83c753b7b0ed0ded3f6 /core | |
parent | 8c898964f095fcb5bb1c9212e1e484b1eb55c296 (diff) | |
download | spark-e72526227fdcf93b7a33375ef954746ac08753f5.tar.gz spark-e72526227fdcf93b7a33375ef954746ac08753f5.tar.bz2 spark-e72526227fdcf93b7a33375ef954746ac08753f5.zip |
[SPARK-8563] [MLLIB] Fixed a bug so that IndexedRowMatrix.computeSVD().U.numCols = k
I'm sorry that I made https://github.com/apache/spark/pull/6949 closed by mistake.
I pushed codes again.
And, I added a test code.
>
There is a bug that `U.numCols() = self.nCols` in `IndexedRowMatrix.computeSVD()`
It should have been `U.numCols() = k = svd.U.numCols()`
>
```
self = U * sigma * V.transpose
(m x n) = (m x n) * (k x k) * (k x n) //ASIS
-->
(m x n) = (m x k) * (k x k) * (k x n) //TOBE
```
Author: lee19 <lee19@live.co.kr>
Closes #6953 from lee19/MLlibBugfix and squashes the following commits:
c1812a0 [lee19] [SPARK-8563] [MLlib] Used nRows instead of numRows() to reduce a burden.
4b9803b [lee19] [SPARK-8563] [MLlib] Fixed a build error.
c2ccd89 [lee19] Added a unit test that validates matrix sizes of svd for [SPARK-8563][MLlib]
8373424 [lee19] [SPARK-8563][MLlib] Fixed a bug so that IndexedRowMatrix.computeSVD().U.numCols = k
Diffstat (limited to 'core')
0 files changed, 0 insertions, 0 deletions