aboutsummaryrefslogtreecommitdiff
path: root/data
diff options
context:
space:
mode:
authorXiangrui Meng <meng@databricks.com>2015-02-08 23:40:36 -0800
committerXiangrui Meng <meng@databricks.com>2015-02-08 23:40:44 -0800
commit5782ee29eb273b1f87a07fd624bbf228d2597b98 (patch)
tree211ebbc8afa0cf37ecc6f96b6502d7de828ccf91 /data
parent955f2863e39a96c0b00ad7d3eac972bb1cfcb594 (diff)
downloadspark-5782ee29eb273b1f87a07fd624bbf228d2597b98.tar.gz
spark-5782ee29eb273b1f87a07fd624bbf228d2597b98.tar.bz2
spark-5782ee29eb273b1f87a07fd624bbf228d2597b98.zip
[SPARK-5539][MLLIB] LDA guide
This is the LDA user guide from jkbradley with Java and Scala code example. Author: Xiangrui Meng <meng@databricks.com> Author: Joseph K. Bradley <joseph@databricks.com> Closes #4465 from mengxr/lda-guide and squashes the following commits: 6dcb7d1 [Xiangrui Meng] update java example in the user guide 76169ff [Xiangrui Meng] update java example 36c3ae2 [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into lda-guide c2a1efe [Joseph K. Bradley] Added LDA programming guide, plus Java example (which is in the guide and probably should be removed). (cherry picked from commit 855d12ac0a9cdade4cd2cc64c4e7209478be6690) Signed-off-by: Xiangrui Meng <meng@databricks.com>
Diffstat (limited to 'data')
-rw-r--r--data/mllib/sample_lda_data.txt12
1 files changed, 12 insertions, 0 deletions
diff --git a/data/mllib/sample_lda_data.txt b/data/mllib/sample_lda_data.txt
new file mode 100644
index 0000000000..2e76702ca9
--- /dev/null
+++ b/data/mllib/sample_lda_data.txt
@@ -0,0 +1,12 @@
+1 2 6 0 2 3 1 1 0 0 3
+1 3 0 1 3 0 0 2 0 0 1
+1 4 1 0 0 4 9 0 1 2 0
+2 1 0 3 0 0 5 0 2 3 9
+3 1 1 9 3 0 2 0 0 1 3
+4 2 0 3 4 5 1 1 1 4 0
+2 1 0 3 0 0 5 0 2 2 9
+1 1 1 9 2 1 2 0 0 1 3
+4 4 0 3 4 2 1 3 0 0 0
+2 8 2 0 3 0 2 0 2 7 2
+1 1 1 9 0 2 2 0 0 3 3
+4 1 0 0 4 5 1 3 0 1 0