aboutsummaryrefslogtreecommitdiff
path: root/bin
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2014-11-24 16:37:14 -0800
committerXiangrui Meng <meng@databricks.com>2014-11-24 16:37:23 -0800
commit9ea67fc1ddd2aca70f6e2da38ebaf7ebc2398981 (patch)
tree1bcc6448caa79b81f1e67f7b43d39f6d474636f0 /bin
parent6fa3e415d419ee9b2f3d14106a714b627e251e7d (diff)
downloadspark-9ea67fc1ddd2aca70f6e2da38ebaf7ebc2398981.tar.gz
spark-9ea67fc1ddd2aca70f6e2da38ebaf7ebc2398981.tar.bz2
spark-9ea67fc1ddd2aca70f6e2da38ebaf7ebc2398981.zip
[SPARK-4562] [MLlib] speedup vector
This PR change the underline array of DenseVector to numpy.ndarray to avoid the conversion, because most of the users will using numpy.array. It also improve the serialization of DenseVector. Before this change: trial | trainingTime | testTime -------|--------|-------- 0 | 5.126 | 1.786 1 |2.698 |1.693 After the change: trial | trainingTime | testTime -------|--------|-------- 0 |4.692 |0.554 1 |2.307 |0.525 This could partially fix the performance regression during test. Author: Davies Liu <davies@databricks.com> Closes #3420 from davies/ser2 and squashes the following commits: 0e1e6f3 [Davies Liu] fix tests 426f5db [Davies Liu] impove toArray() 44707ec [Davies Liu] add name for ISO-8859-1 fa7d791 [Davies Liu] address comments 1cfb137 [Davies Liu] handle zero sparse vector 2548ee2 [Davies Liu] fix tests 9e6389d [Davies Liu] bugfix 470f702 [Davies Liu] speed up DenseMatrix f0d3c40 [Davies Liu] speedup SparseVector ef6ce70 [Davies Liu] speed up dense vector (cherry picked from commit b660de7a9cbdea3df4a37fbcf60c1c33c71782b8) Signed-off-by: Xiangrui Meng <meng@databricks.com>
Diffstat (limited to 'bin')
0 files changed, 0 insertions, 0 deletions