aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authortkaessmann <tobias.kaessmann@s24.com>2014-11-24 16:40:19 -0800
committerXiangrui Meng <meng@databricks.com>2014-11-24 16:40:19 -0800
commit2acbd2884f73c4503d753bb96e0acf75cd237536 (patch)
tree5018b90a35733f239b060e05e4b84a49d37315a2
parent9ea67fc1ddd2aca70f6e2da38ebaf7ebc2398981 (diff)
downloadspark-2acbd2884f73c4503d753bb96e0acf75cd237536.tar.gz
spark-2acbd2884f73c4503d753bb96e0acf75cd237536.tar.bz2
spark-2acbd2884f73c4503d753bb96e0acf75cd237536.zip
get raw vectors for further processing in Word2Vec
e.g. clustering Author: tkaessmann <tobias.kaessmann@s24.com> Closes #3309 from tkaessmann/branch-1.2 and squashes the following commits: e3a3142 [tkaessmann] changes the comment for getVectors 58d3d83 [tkaessmann] removes sign from comment a5be213 [tkaessmann] fixes getVectors to fit code guidelines 3782fa9 [tkaessmann] get raw vectors for further processing
-rw-r--r--mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala7
1 files changed, 7 insertions, 0 deletions
diff --git a/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala b/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala
index f5f7ad613d..7960f3cab5 100644
--- a/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala
+++ b/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala
@@ -461,4 +461,11 @@ class Word2VecModel private[mllib] (
.tail
.toArray
}
+
+ /**
+ * Returns a map of words to their vector representations.
+ */
+ def getVectors: Map[String, Array[Float]] = {
+ model
+ }
}