[SPARK-14088][SQL] Some Dataset API touch-up

## What changes were proposed in this pull request? 1. Deprecated unionAll. It is pretty confusing to have both "union" and "unionAll" when the two do the same thing in Spark but are different in SQL. 2. Rename reduce in KeyValueGroupedDataset to reduceGroups so it is more consistent with rest of the functions in KeyValueGroupedDataset. Also makes it more obvious what "reduce" and "reduceGroups" mean. Previously it was confusing because it could be reducing a Dataset, or just reducing groups. 3. Added a "name" function, which is more natural to name columns than "as" for non-SQL users. 4. Remove "subtract" function since it is just an alias for "except". ## How was this patch tested? All changes should be covered by existing tests. Also added couple test cases to cover "name". Author: Reynold Xin <rxin@databricks.com> Closes #11908 from rxin/SPARK-14088.
author: Reynold Xin <rxin@databricks.com> 2016-03-22 23:43:09 -0700
committer: Reynold Xin <rxin@databricks.com> 2016-03-22 23:43:09 -0700
commit: 926a93e54b83f1ee596096f3301fef015705b627 (patch)
tree: 97817dcf1069bcc8f148f996873bef5bb6643126 /sql/core/src/test/java
parent: 1a22cf1e9b6447005c9a329856d734d80a496a06 (diff)
download: spark-926a93e54b83f1ee596096f3301fef015705b627.tar.gz
spark-926a93e54b83f1ee596096f3301fef015705b627.tar.bz2
spark-926a93e54b83f1ee596096f3301fef015705b627.zip
1 files changed, 2 insertions, 2 deletions
diff --git a/sql/core/src/test/java/test/org/apache/spark/sql/JavaDatasetSuite.java b/sql/core/src/test/java/test/org/apache/spark/sql/JavaDatasetSuite.java
index 3bff129ae2..18f17a85a9 100644
--- a/sql/core/src/test/java/test/org/apache/spark/sql/JavaDatasetSuite.java
+++ b/sql/core/src/test/java/test/org/apache/spark/sql/JavaDatasetSuite.java
@@ -204,7 +204,7 @@ public class JavaDatasetSuite implements Serializable {
 
     Assert.assertEquals(asSet("1a", "3foobar"), toSet(flatMapped.collectAsList()));
 
-    Dataset<Tuple2<Integer, String>> reduced = grouped.reduce(new ReduceFunction<String>() {
+    Dataset<Tuple2<Integer, String>> reduced = grouped.reduceGroups(new ReduceFunction<String>() {
       @Override
       public String call(String v1, String v2) throws Exception {
         return v1 + v2;
@@ -300,7 +300,7 @@ public class JavaDatasetSuite implements Serializable {
       Arrays.asList("abc", "abc", "xyz", "xyz", "foo", "foo", "abc", "abc", "xyz"),
       unioned.collectAsList());
 
-    Dataset<String> subtracted = ds.subtract(ds2);
+    Dataset<String> subtracted = ds.except(ds2);
     Assert.assertEquals(Arrays.asList("abc", "abc"), subtracted.collectAsList());
   }
author	Reynold Xin <rxin@databricks.com>	2016-03-22 23:43:09 -0700
committer	Reynold Xin <rxin@databricks.com>	2016-03-22 23:43:09 -0700
commit	926a93e54b83f1ee596096f3301fef015705b627 (patch)
tree	97817dcf1069bcc8f148f996873bef5bb6643126 /sql/core/src/test/java
parent	1a22cf1e9b6447005c9a329856d734d80a496a06 (diff)
download	spark-926a93e54b83f1ee596096f3301fef015705b627.tar.gz spark-926a93e54b83f1ee596096f3301fef015705b627.tar.bz2 spark-926a93e54b83f1ee596096f3301fef015705b627.zip