aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/streaming/tests.py
diff options
context:
space:
mode:
authorHolden Karau <holden@pigscanfly.ca>2015-08-11 11:33:36 -0700
committerJoseph K. Bradley <joseph@databricks.com>2015-08-11 11:33:36 -0700
commitdbd778d84d094ca142bc08c351478595b280bc2a (patch)
treeb7de552c15cca4f7317566aba5680e7538210a88 /python/pyspark/streaming/tests.py
parent8cad854ef6a2066de5adffcca6b79a205ccfd5f3 (diff)
downloadspark-dbd778d84d094ca142bc08c351478595b280bc2a.tar.gz
spark-dbd778d84d094ca142bc08c351478595b280bc2a.tar.bz2
spark-dbd778d84d094ca142bc08c351478595b280bc2a.zip
[SPARK-8764] [ML] string indexer should take option to handle unseen values
As a precursor to adding a public constructor add an option to handle unseen values by skipping rather than throwing an exception (default remains throwing an exception), Author: Holden Karau <holden@pigscanfly.ca> Closes #7266 from holdenk/SPARK-8764-string-indexer-should-take-option-to-handle-unseen-values and squashes the following commits: 38a4de9 [Holden Karau] fix long line 045bf22 [Holden Karau] Add a second b entry so b gets 0 for sure 81dd312 [Holden Karau] Update the docs for handleInvalid param to be more descriptive 7f37f6e [Holden Karau] remove extra space (scala style) 414e249 [Holden Karau] And switch to using handleInvalid instead of skipInvalid 1e53f9b [Holden Karau] update the param (codegen side) 7a22215 [Holden Karau] fix typo 100a39b [Holden Karau] Merge in master aa5b093 [Holden Karau] Since we filter we should never go down this code path if getSkipInvalid is true 75ffa69 [Holden Karau] Remove extra newline d69ef5e [Holden Karau] Add a test b5734be [Holden Karau] Add support for unseen labels afecd4e [Holden Karau] Add a param to skip invalid entries.
Diffstat (limited to 'python/pyspark/streaming/tests.py')
0 files changed, 0 insertions, 0 deletions