aboutsummaryrefslogtreecommitdiff
path: root/docs/mllib-data-types.md
diff options
context:
space:
mode:
authorCheng Hao <hao.cheng@intel.com>2015-08-05 22:35:55 +0800
committerCheng Lian <lian@databricks.com>2015-08-05 22:35:55 +0800
commit519cf6d3f764a977770266784d6902fe205a070f (patch)
tree2bd9e34a52613ee3c06757aa3dec2a653e7f2ee1 /docs/mllib-data-types.md
parenteb8bfa3eaa0846d685e4d12f9ee2e4273b85edcf (diff)
downloadspark-519cf6d3f764a977770266784d6902fe205a070f.tar.gz
spark-519cf6d3f764a977770266784d6902fe205a070f.tar.bz2
spark-519cf6d3f764a977770266784d6902fe205a070f.zip
[SPARK-9381] [SQL] Migrate JSON data source to the new partitioning data source
Support partitioning for the JSON data source. Still 2 open issues for the `HadoopFsRelation` - `refresh()` will invoke the `discoveryPartition()`, which will auto infer the data type for the partition columns, and maybe conflict with the given partition columns. (TODO enable `HadoopFsRelationSuite.Partition column type casting" - When insert data into a cached HadoopFsRelation based table, we need to invalidate the cache after the insertion (TODO enable `InsertSuite.Caching`) Author: Cheng Hao <hao.cheng@intel.com> Closes #7696 from chenghao-intel/json and squashes the following commits: d90b104 [Cheng Hao] revert the change for JacksonGenerator.apply 307111d [Cheng Hao] fix bug in the unit test 8738c8a [Cheng Hao] fix bug in unit testing 35f2cde [Cheng Hao] support partition for json format
Diffstat (limited to 'docs/mllib-data-types.md')
0 files changed, 0 insertions, 0 deletions