[SPARK-13764][SQL] Parse modes in JSON data source - spark

diff options

author	hyukjinkwon <gurwls223@gmail.com>	2016-03-21 15:42:35 +0800
committer	Wenchen Fan <wenchen@databricks.com>	2016-03-21 15:42:35 +0800
commit	e474088144cdd2632cf2fef6b2cf10b3cd191c23 (patch)
tree	66374e51a3318ea931ebef343e0a3d285c06d2c5 /mllib/src/test
parent	f58319a24fd5e026411538b1fb7336d9d894277b (diff)
download	spark-e474088144cdd2632cf2fef6b2cf10b3cd191c23.tar.gz spark-e474088144cdd2632cf2fef6b2cf10b3cd191c23.tar.bz2 spark-e474088144cdd2632cf2fef6b2cf10b3cd191c23.zip

[SPARK-13764][SQL] Parse modes in JSON data source

## What changes were proposed in this pull request? Currently, there is no way to control the behaviour when fails to parse corrupt records in JSON data source . This PR adds the support for parse modes just like CSV data source. There are three modes below: - `PERMISSIVE` : When it fails to parse, this sets `null` to to field. This is a default mode when it has been this mode. - `DROPMALFORMED`: When it fails to parse, this drops the whole record. - `FAILFAST`: When it fails to parse, it just throws an exception. This PR also make JSON data source share the `ParseModes` in CSV data source. ## How was this patch tested? Unit tests were used and `./dev/run_tests` for code style tests. Author: hyukjinkwon <gurwls223@gmail.com> Closes #11756 from HyukjinKwon/SPARK-13764.

Diffstat (limited to 'mllib/src/test')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: