aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorLiang-Chi Hsieh <viirya@gmail.com>2015-04-15 13:01:29 -0700
committerMichael Armbrust <michael@databricks.com>2015-04-15 13:02:25 -0700
commite3e4e9a38b25174ed8bb460ba2b375813ebf3b4b (patch)
tree94345d56d01146bfe44167ba29d1d6ac2f69a1a2
parentb75b3070740803480d235b0c9a86673721344f30 (diff)
downloadspark-e3e4e9a38b25174ed8bb460ba2b375813ebf3b4b.tar.gz
spark-e3e4e9a38b25174ed8bb460ba2b375813ebf3b4b.tar.bz2
spark-e3e4e9a38b25174ed8bb460ba2b375813ebf3b4b.zip
[SPARK-6800][SQL] Update doc for JDBCRelation's columnPartition
JIRA https://issues.apache.org/jira/browse/SPARK-6800 Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #5488 from viirya/fix_jdbc_where and squashes the following commits: 51386c8 [Liang-Chi Hsieh] Update code comment. 1dcc929 [Liang-Chi Hsieh] Update document. 3eb74d6 [Liang-Chi Hsieh] Revert and modify doc. df11783 [Liang-Chi Hsieh] Merge remote-tracking branch 'upstream/master' into fix_jdbc_where 3e7db15 [Liang-Chi Hsieh] Fix wrong logic to generate WHERE clause for JDBC.
-rw-r--r--docs/sql-programming-guide.md5
-rw-r--r--sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala4
2 files changed, 6 insertions, 3 deletions
diff --git a/docs/sql-programming-guide.md b/docs/sql-programming-guide.md
index 332618edf0..03500867df 100644
--- a/docs/sql-programming-guide.md
+++ b/docs/sql-programming-guide.md
@@ -1371,7 +1371,10 @@ the Data Sources API. The following options are supported:
<td>
These options must all be specified if any of them is specified. They describe how to
partition the table when reading in parallel from multiple workers.
- <code>partitionColumn</code> must be a numeric column from the table in question.
+ <code>partitionColumn</code> must be a numeric column from the table in question. Notice
+ that <code>lowerBound</code> and <code>upperBound</code> are just used to decide the
+ partition stride, not for filtering the rows in table. So all rows in the table will be
+ partitioned and returned.
</td>
</tr>
</table>
diff --git a/sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala b/sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala
index c25ef58e6f..b237fe684c 100644
--- a/sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala
+++ b/sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala
@@ -873,8 +873,8 @@ class SQLContext(@transient val sparkContext: SparkContext)
* passed to this function.
*
* @param columnName the name of a column of integral type that will be used for partitioning.
- * @param lowerBound the minimum value of `columnName` to retrieve
- * @param upperBound the maximum value of `columnName` to retrieve
+ * @param lowerBound the minimum value of `columnName` used to decide partition stride
+ * @param upperBound the maximum value of `columnName` used to decide partition stride
* @param numPartitions the number of partitions. the range `minValue`-`maxValue` will be split
* evenly into this many partitions
*