[SPARK-14206][SQL] buildReader() implementation for CSV - spark

diff options

author	Cheng Lian <lian@databricks.com>	2016-03-30 18:21:06 -0700
committer	Yin Huai <yhuai@databricks.com>	2016-03-30 18:21:06 -0700
commit	26445c2e472bad137fd350e4089dd0ff43a42039 (patch)
tree	7972c24c16fef4202224d9982edb6698ece7e589 /dev/deps/spark-deps-hadoop-2.3
parent	da54abfd8730ef752eca921089bcf568773bd24a (diff)
download	spark-26445c2e472bad137fd350e4089dd0ff43a42039.tar.gz spark-26445c2e472bad137fd350e4089dd0ff43a42039.tar.bz2 spark-26445c2e472bad137fd350e4089dd0ff43a42039.zip

[SPARK-14206][SQL] buildReader() implementation for CSV

## What changes were proposed in this pull request? Major changes: 1. Implement `FileFormat.buildReader()` for the CSV data source. 1. Add an extra argument to `FileFormat.buildReader()`, `physicalSchema`, which is basically the result of `FileFormat.inferSchema` or user specified schema. This argument is necessary because the CSV data source needs to know all the columns of the underlying files to read the file. ## How was this patch tested? Existing tests should do the work. Author: Cheng Lian <lian@databricks.com> Closes #12002 from liancheng/spark-14206-csv-build-reader.

Diffstat (limited to 'dev/deps/spark-deps-hadoop-2.3')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: