aboutsummaryrefslogtreecommitdiff
path: root/docs/README.md
diff options
context:
space:
mode:
authorMichael Armbrust <michael@databricks.com>2014-11-20 18:31:02 -0800
committerMichael Armbrust <michael@databricks.com>2014-11-20 18:31:02 -0800
commit02ec058efe24348cdd3691b55942e6f0ef138732 (patch)
treeaeb665a9e313f6dfe9de73329987c26762537c8f /docs/README.md
parent84d79ee9ec47465269f7b0a7971176da93c96f3f (diff)
downloadspark-02ec058efe24348cdd3691b55942e6f0ef138732.tar.gz
spark-02ec058efe24348cdd3691b55942e6f0ef138732.tar.bz2
spark-02ec058efe24348cdd3691b55942e6f0ef138732.zip
[SPARK-4413][SQL] Parquet support through datasource API
Goals: - Support for accessing parquet using SQL but not requiring Hive (thus allowing support of parquet tables with decimal columns) - Support for folder based partitioning with automatic discovery of available partitions - Caching of file metadata See scaladoc of `ParquetRelation2` for more details. Author: Michael Armbrust <michael@databricks.com> Closes #3269 from marmbrus/newParquet and squashes the following commits: 1dd75f1 [Michael Armbrust] Pass all paths for FileInputFormat at once. 645768b [Michael Armbrust] Review comments. abd8e2f [Michael Armbrust] Alternative implementation of parquet based on the datasources API. 938019e [Michael Armbrust] Add an experimental interface to data sources that exposes catalyst expressions. e9d2641 [Michael Armbrust] logging / formatting improvements.
Diffstat (limited to 'docs/README.md')
0 files changed, 0 insertions, 0 deletions