diff options
author | Hossein <hossein@databricks.com> | 2016-01-15 11:46:46 -0800 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2016-01-15 11:46:46 -0800 |
commit | 5f83c6991c95616ecbc2878f8860c69b2826f56c (patch) | |
tree | 86dc70e45f1b27b67efec9724632a108d69f2ef0 /.rat-excludes | |
parent | c5e7076da72657ea35a0aa388f8d2e6411d39280 (diff) | |
download | spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.tar.gz spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.tar.bz2 spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.zip |
[SPARK-12833][SQL] Initial import of spark-csv
CSV is the most common data format in the "small data" world. It is often the first format people want to try when they see Spark on a single node. Having to rely on a 3rd party component for this leads to poor user experience for new users. This PR merges the popular spark-csv data source package (https://github.com/databricks/spark-csv) with SparkSQL.
This is a first PR to bring the functionality to spark 2.0 master. We will complete items outlines in the design document (see JIRA attachment) in follow up pull requests.
Author: Hossein <hossein@databricks.com>
Author: Reynold Xin <rxin@databricks.com>
Closes #10766 from rxin/csv.
Diffstat (limited to '.rat-excludes')
-rw-r--r-- | .rat-excludes | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/.rat-excludes b/.rat-excludes index bf071eba65..a4f316a4aa 100644 --- a/.rat-excludes +++ b/.rat-excludes @@ -86,3 +86,5 @@ org.apache.spark.scheduler.SparkHistoryListenerFactory .*parquet LZ4BlockInputStream.java spark-deps-.* +.*csv +.*tsv |