[SPARK-12833][SQL] Initial import of spark-csv

CSV is the most common data format in the "small data" world. It is often the first format people want to try when they see Spark on a single node. Having to rely on a 3rd party component for this leads to poor user experience for new users. This PR merges the popular spark-csv data source package (https://github.com/databricks/spark-csv) with SparkSQL. This is a first PR to bring the functionality to spark 2.0 master. We will complete items outlines in the design document (see JIRA attachment) in follow up pull requests. Author: Hossein <hossein@databricks.com> Author: Reynold Xin <rxin@databricks.com> Closes #10766 from rxin/csv.
author: Hossein <hossein@databricks.com> 2016-01-15 11:46:46 -0800
committer: Reynold Xin <rxin@databricks.com> 2016-01-15 11:46:46 -0800
commit: 5f83c6991c95616ecbc2878f8860c69b2826f56c (patch)
tree: 86dc70e45f1b27b67efec9724632a108d69f2ef0 /.rat-excludes
parent: c5e7076da72657ea35a0aa388f8d2e6411d39280 (diff)
download: spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.tar.gz
spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.tar.bz2
spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.zip
1 files changed, 2 insertions, 0 deletions
diff --git a/.rat-excludes b/.rat-excludes
index bf071eba65..a4f316a4aa 100644
--- a/.rat-excludes
+++ b/.rat-excludes
@@ -86,3 +86,5 @@ org.apache.spark.scheduler.SparkHistoryListenerFactory
 .*parquet
 LZ4BlockInputStream.java
 spark-deps-.*
+.*csv
+.*tsv
author	Hossein <hossein@databricks.com>	2016-01-15 11:46:46 -0800
committer	Reynold Xin <rxin@databricks.com>	2016-01-15 11:46:46 -0800
commit	5f83c6991c95616ecbc2878f8860c69b2826f56c (patch)
tree	86dc70e45f1b27b67efec9724632a108d69f2ef0 /.rat-excludes
parent	c5e7076da72657ea35a0aa388f8d2e6411d39280 (diff)
download	spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.tar.gz spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.tar.bz2 spark-5f83c6991c95616ecbc2878f8860c69b2826f56c.zip