aboutsummaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authorWenchen Fan <wenchen@databricks.com>2015-11-04 09:33:30 -0800
committerReynold Xin <rxin@databricks.com>2015-11-04 09:33:30 -0800
commite0fc9c7e59848cb78f8d598898bfca004a3710d8 (patch)
treec3174dc30e83313a608e7f0322d3313671b4e12b /docs
parentcd1df662386c599a9d0968b9fc14f27b0883d285 (diff)
downloadspark-e0fc9c7e59848cb78f8d598898bfca004a3710d8.tar.gz
spark-e0fc9c7e59848cb78f8d598898bfca004a3710d8.tar.bz2
spark-e0fc9c7e59848cb78f8d598898bfca004a3710d8.zip
[SPARK-11197][SQL] add doc for run SQL on files directly
Author: Wenchen Fan <wenchen@databricks.com> Closes #9467 from cloud-fan/doc.
Diffstat (limited to 'docs')
-rw-r--r--docs/sql-programming-guide.md38
1 files changed, 38 insertions, 0 deletions
diff --git a/docs/sql-programming-guide.md b/docs/sql-programming-guide.md
index 510b359972..2fe5c36338 100644
--- a/docs/sql-programming-guide.md
+++ b/docs/sql-programming-guide.md
@@ -882,6 +882,44 @@ saveDF(select(df, "name", "age"), "namesAndAges.parquet", "parquet")
</div>
</div>
+### Run SQL on files directly
+
+Instead of using read API to load a file into DataFrame and query it, you can also query that
+file directly with SQL.
+
+<div class="codetabs">
+<div data-lang="scala" markdown="1">
+
+{% highlight scala %}
+val df = sqlContext.sql("SELECT * FROM parquet.`examples/src/main/resources/users.parquet`")
+{% endhighlight %}
+
+</div>
+
+<div data-lang="java" markdown="1">
+
+{% highlight java %}
+DataFrame df = sqlContext.sql("SELECT * FROM parquet.`examples/src/main/resources/users.parquet`");
+{% endhighlight %}
+</div>
+
+<div data-lang="python" markdown="1">
+
+{% highlight python %}
+df = sqlContext.sql("SELECT * FROM parquet.`examples/src/main/resources/users.parquet`")
+{% endhighlight %}
+
+</div>
+
+<div data-lang="r" markdown="1">
+
+{% highlight r %}
+df <- sql(sqlContext, "SELECT * FROM parquet.`examples/src/main/resources/users.parquet`")
+{% endhighlight %}
+
+</div>
+</div>
+
### Save Modes
Save operations can optionally take a `SaveMode`, that specifies how to handle existing data if