diff options
author | Yuhao Yang <hhbyyh@gmail.com> | 2016-03-17 11:21:11 +0200 |
---|---|---|
committer | Nick Pentreath <nick.pentreath@gmail.com> | 2016-03-17 11:21:11 +0200 |
commit | 357d82d84d6372debd28da6ad0a2ee904957a7fe (patch) | |
tree | 1c0facd6a63b865b7ea06ff516f69bf479a26cba /CONTRIBUTING.md | |
parent | 204c9dec2c3876d20558ef5bda4dbd6edaf59643 (diff) | |
download | spark-357d82d84d6372debd28da6ad0a2ee904957a7fe.tar.gz spark-357d82d84d6372debd28da6ad0a2ee904957a7fe.tar.bz2 spark-357d82d84d6372debd28da6ad0a2ee904957a7fe.zip |
[SPARK-13629][ML] Add binary toggle Param to CountVectorizer
## What changes were proposed in this pull request?
It would be handy to add a binary toggle Param to CountVectorizer, as in the scikit-learn one: http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html
If set, then all non-zero counts will be set to 1.
## How was this patch tested?
unit tests
Author: Yuhao Yang <hhbyyh@gmail.com>
Closes #11536 from hhbyyh/cvToggle.
Diffstat (limited to 'CONTRIBUTING.md')
0 files changed, 0 insertions, 0 deletions