diff options
author | tkaessmann <tobias.kaessmann@s24.com> | 2014-11-24 16:40:19 -0800 |
---|---|---|
committer | Xiangrui Meng <meng@databricks.com> | 2014-11-24 16:40:19 -0800 |
commit | 2acbd2884f73c4503d753bb96e0acf75cd237536 (patch) | |
tree | 5018b90a35733f239b060e05e4b84a49d37315a2 | |
parent | 9ea67fc1ddd2aca70f6e2da38ebaf7ebc2398981 (diff) | |
download | spark-2acbd2884f73c4503d753bb96e0acf75cd237536.tar.gz spark-2acbd2884f73c4503d753bb96e0acf75cd237536.tar.bz2 spark-2acbd2884f73c4503d753bb96e0acf75cd237536.zip |
get raw vectors for further processing in Word2Vec
e.g. clustering
Author: tkaessmann <tobias.kaessmann@s24.com>
Closes #3309 from tkaessmann/branch-1.2 and squashes the following commits:
e3a3142 [tkaessmann] changes the comment for getVectors
58d3d83 [tkaessmann] removes sign from comment
a5be213 [tkaessmann] fixes getVectors to fit code guidelines
3782fa9 [tkaessmann] get raw vectors for further processing
-rw-r--r-- | mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala | 7 |
1 files changed, 7 insertions, 0 deletions
diff --git a/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala b/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala index f5f7ad613d..7960f3cab5 100644 --- a/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala +++ b/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala @@ -461,4 +461,11 @@ class Word2VecModel private[mllib] ( .tail .toArray } + + /** + * Returns a map of words to their vector representations. + */ + def getVectors: Map[String, Array[Float]] = { + model + } } |