diff options
author | hyukjinkwon <gurwls223@gmail.com> | 2016-03-21 15:42:35 +0800 |
---|---|---|
committer | Wenchen Fan <wenchen@databricks.com> | 2016-03-21 15:42:35 +0800 |
commit | e474088144cdd2632cf2fef6b2cf10b3cd191c23 (patch) | |
tree | 66374e51a3318ea931ebef343e0a3d285c06d2c5 /mllib/src/test | |
parent | f58319a24fd5e026411538b1fb7336d9d894277b (diff) | |
download | spark-e474088144cdd2632cf2fef6b2cf10b3cd191c23.tar.gz spark-e474088144cdd2632cf2fef6b2cf10b3cd191c23.tar.bz2 spark-e474088144cdd2632cf2fef6b2cf10b3cd191c23.zip |
[SPARK-13764][SQL] Parse modes in JSON data source
## What changes were proposed in this pull request?
Currently, there is no way to control the behaviour when fails to parse corrupt records in JSON data source .
This PR adds the support for parse modes just like CSV data source. There are three modes below:
- `PERMISSIVE` : When it fails to parse, this sets `null` to to field. This is a default mode when it has been this mode.
- `DROPMALFORMED`: When it fails to parse, this drops the whole record.
- `FAILFAST`: When it fails to parse, it just throws an exception.
This PR also make JSON data source share the `ParseModes` in CSV data source.
## How was this patch tested?
Unit tests were used and `./dev/run_tests` for code style tests.
Author: hyukjinkwon <gurwls223@gmail.com>
Closes #11756 from HyukjinKwon/SPARK-13764.
Diffstat (limited to 'mllib/src/test')
0 files changed, 0 insertions, 0 deletions