diff options
author | MechCoder <manojkumarsivaraj334@gmail.com> | 2015-07-07 08:59:52 -0700 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2015-07-07 08:59:52 -0700 |
commit | 738c10748b49eb8a475d1fd26c6a271ca36497cf (patch) | |
tree | a0d4dd94fdccb12934a40435fbd8fa8c0716b136 /python/pyspark/rddsampler.py | |
parent | 1dbc4a155f3697a3973909806be42a1be6017d12 (diff) | |
download | spark-738c10748b49eb8a475d1fd26c6a271ca36497cf.tar.gz spark-738c10748b49eb8a475d1fd26c6a271ca36497cf.tar.bz2 spark-738c10748b49eb8a475d1fd26c6a271ca36497cf.zip |
[SPARK-8823] [MLLIB] [PYSPARK] Optimizations for SparseVector dot products
Follow up for https://github.com/apache/spark/pull/5946
Currently we iterate over indices and values in SparseVector and can be vectorized.
Author: MechCoder <manojkumarsivaraj334@gmail.com>
Closes #7222 from MechCoder/sparse_optim and squashes the following commits:
dcb51d3 [MechCoder] [SPARK-8823] [MLlib] [PySpark] Optimizations for SparseVector dot product
Diffstat (limited to 'python/pyspark/rddsampler.py')
0 files changed, 0 insertions, 0 deletions