aboutsummaryrefslogtreecommitdiff
path: root/sql/hive
diff options
context:
space:
mode:
authorEric Liang <ekl@databricks.com>2016-11-04 15:54:28 -0700
committerYin Huai <yhuai@databricks.com>2016-11-04 15:54:28 -0700
commit4cee2ce251110218e68c0f8f30363ec2f2498bea (patch)
treeb1af6b5308cffbc0b74e0c5020d762b156977629 /sql/hive
parent550cd56e8b6addb26efe3ce16976c9c34fa0c832 (diff)
downloadspark-4cee2ce251110218e68c0f8f30363ec2f2498bea.tar.gz
spark-4cee2ce251110218e68c0f8f30363ec2f2498bea.tar.bz2
spark-4cee2ce251110218e68c0f8f30363ec2f2498bea.zip
[SPARK-18167] Re-enable the non-flaky parts of SQLQuerySuite
## What changes were proposed in this pull request? It seems the proximate cause of the test failures is that `cast(str as decimal)` in derby will raise an exception instead of returning NULL. This is a problem since Hive sometimes inserts `__HIVE_DEFAULT_PARTITION__` entries into the partition table as documented here: https://github.com/apache/hive/blob/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/MetaStoreDirectSql.java#L1034 Basically, when these special default partitions are present, partition pruning pushdown using the SQL-direct mode will fail due this cast exception. As commented on in `MetaStoreDirectSql.java` above, this is normally fine since Hive falls back to JDO pruning, however when the pruning predicate contains an unsupported operator such as `>`, that will fail as well. The only remaining question is why this behavior is nondeterministic. We know that when the test flakes, retries do not help, therefore the cause must be environmental. The current best hypothesis is that some config is different between different jenkins runs, which is why this PR prints out the Spark SQL and Hive confs for the test. The hope is that by comparing the config state for failure vs success we can isolate the root cause of the flakiness. **Update:** we could not isolate the issue. It does not seem to be due to configuration differences. As such, I'm going to enable the non-flaky parts of the test since we are fairly confident these issues only occur with Derby (which is not used in production). ## How was this patch tested? N/A Author: Eric Liang <ekl@databricks.com> Closes #15725 from ericl/print-confs-out.
Diffstat (limited to 'sql/hive')
-rw-r--r--sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala31
1 files changed, 10 insertions, 21 deletions
diff --git a/sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala b/sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala
index ad70835d06..cc09aef326 100644
--- a/sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala
+++ b/sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala
@@ -1569,27 +1569,16 @@ class SQLQuerySuite extends QueryTest with SQLTestUtils with TestHiveSingleton {
).map(i => Row(i._1, i._2, i._3, i._4)))
}
- ignore("SPARK-10562: partition by column with mixed case name") {
- def runOnce() {
- withTable("tbl10562") {
- val df = Seq(2012 -> "a").toDF("Year", "val")
- df.write.partitionBy("Year").saveAsTable("tbl10562")
- checkAnswer(sql("SELECT year FROM tbl10562"), Row(2012))
- checkAnswer(sql("SELECT Year FROM tbl10562"), Row(2012))
- checkAnswer(sql("SELECT yEAr FROM tbl10562"), Row(2012))
- checkAnswer(sql("SELECT val FROM tbl10562 WHERE Year > 2015"), Nil)
- checkAnswer(sql("SELECT val FROM tbl10562 WHERE Year == 2012"), Row("a"))
- }
- }
- try {
- runOnce()
- } catch {
- case t: Throwable =>
- // Retry to gather more test data. TODO(ekl) revert this once we deflake this test.
- runOnce()
- runOnce()
- runOnce()
- throw t
+ test("SPARK-10562: partition by column with mixed case name") {
+ withTable("tbl10562") {
+ val df = Seq(2012 -> "a").toDF("Year", "val")
+ df.write.partitionBy("Year").saveAsTable("tbl10562")
+ checkAnswer(sql("SELECT year FROM tbl10562"), Row(2012))
+ checkAnswer(sql("SELECT Year FROM tbl10562"), Row(2012))
+ checkAnswer(sql("SELECT yEAr FROM tbl10562"), Row(2012))
+// TODO(ekl) this is causing test flakes [SPARK-18167], but we think the issue is derby specific
+// checkAnswer(sql("SELECT val FROM tbl10562 WHERE Year > 2015"), Nil)
+ checkAnswer(sql("SELECT val FROM tbl10562 WHERE Year == 2012"), Row("a"))
}
}