aboutsummaryrefslogtreecommitdiff
path: root/sql/catalyst
diff options
context:
space:
mode:
authorYuhao Yang <hhbyyh@gmail.com>2016-03-17 11:21:11 +0200
committerNick Pentreath <nick.pentreath@gmail.com>2016-03-17 11:21:11 +0200
commit357d82d84d6372debd28da6ad0a2ee904957a7fe (patch)
tree1c0facd6a63b865b7ea06ff516f69bf479a26cba /sql/catalyst
parent204c9dec2c3876d20558ef5bda4dbd6edaf59643 (diff)
downloadspark-357d82d84d6372debd28da6ad0a2ee904957a7fe.tar.gz
spark-357d82d84d6372debd28da6ad0a2ee904957a7fe.tar.bz2
spark-357d82d84d6372debd28da6ad0a2ee904957a7fe.zip
[SPARK-13629][ML] Add binary toggle Param to CountVectorizer
## What changes were proposed in this pull request? It would be handy to add a binary toggle Param to CountVectorizer, as in the scikit-learn one: http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html If set, then all non-zero counts will be set to 1. ## How was this patch tested? unit tests Author: Yuhao Yang <hhbyyh@gmail.com> Closes #11536 from hhbyyh/cvToggle.
Diffstat (limited to 'sql/catalyst')
0 files changed, 0 insertions, 0 deletions