aboutsummaryrefslogtreecommitdiff
path: root/docs/cluster-overview.md
diff options
context:
space:
mode:
authorXiangrui Meng <meng@databricks.com>2014-03-13 00:43:19 -0700
committerReynold Xin <rxin@apache.org>2014-03-13 00:43:19 -0700
commite4e8d8f395aea48f0cae00d7c381a863c48a2837 (patch)
tree283f03c2f4a7be86eb7d950dd0ee51c375b0f737 /docs/cluster-overview.md
parent4ea23db0efff2f39ac5b8f0bd1d9a6ffa3eceb0d (diff)
downloadspark-e4e8d8f395aea48f0cae00d7c381a863c48a2837.tar.gz
spark-e4e8d8f395aea48f0cae00d7c381a863c48a2837.tar.bz2
spark-e4e8d8f395aea48f0cae00d7c381a863c48a2837.zip
[SPARK-1237, 1238] Improve the computation of YtY for implicit ALS
Computing YtY can be implemented using BLAS's DSPR operations instead of generating y_i y_i^T and then combining them. The latter generates many k-by-k matrices. On the movielens data, this change improves the performance by 10-20%. The algorithm remains the same, verified by computing RMSE on the movielens data. To compare the results, I also added an option to set a random seed in ALS. JIRA: 1. https://spark-project.atlassian.net/browse/SPARK-1237 2. https://spark-project.atlassian.net/browse/SPARK-1238 Author: Xiangrui Meng <meng@databricks.com> Closes #131 from mengxr/als and squashes the following commits: ed00432 [Xiangrui Meng] minor changes d984623 [Xiangrui Meng] minor changes 2fc1641 [Xiangrui Meng] remove commented code 4c7cde2 [Xiangrui Meng] allow specifying a random seed in ALS 200bef0 [Xiangrui Meng] optimize computeYtY and updateBlock
Diffstat (limited to 'docs/cluster-overview.md')
0 files changed, 0 insertions, 0 deletions