[SPARK-6827] [MLLIB] Wrap FPGrowthModel.freqItemsets and make it consistent with Java API - spark

diff options

author	Yanbo Liang <ybliang8@gmail.com>	2015-04-22 17:22:26 -0700
committer	Xiangrui Meng <meng@databricks.com>	2015-04-22 17:22:26 -0700
commit	f4f39981f4f5e88c30eec7d0b107e2c3cdc268c9 (patch)
tree	d26235eae02cab27c9cdd537d53d41b4978fecfd /repl/scala-2.10/src/test
parent	baf865ddc2cff9b99d6aeab9861e030da511257f (diff)
download	spark-f4f39981f4f5e88c30eec7d0b107e2c3cdc268c9.tar.gz spark-f4f39981f4f5e88c30eec7d0b107e2c3cdc268c9.tar.bz2 spark-f4f39981f4f5e88c30eec7d0b107e2c3cdc268c9.zip

[SPARK-6827] [MLLIB] Wrap FPGrowthModel.freqItemsets and make it consistent with Java API

Make PySpark ```FPGrowthModel.freqItemsets``` consistent with Java/Scala API like ```MatrixFactorizationModel.userFeatures``` It return a RDD with each tuple is composed of an array and a long value. I think it's difficult to implement namedtuples to wrap the output because items of freqItemsets can be any type with arbitrary length which is tedious to impelement corresponding SerDe function. Author: Yanbo Liang <ybliang8@gmail.com> Closes #5614 from yanboliang/spark-6827 and squashes the following commits: da8c404 [Yanbo Liang] use namedtuple 5532e78 [Yanbo Liang] Wrap FPGrowthModel.freqItemsets and make it consistent with Java API

Diffstat (limited to 'repl/scala-2.10/src/test')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: