diff options
author | Cheng Lian <lian@databricks.com> | 2016-01-26 20:12:34 -0800 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2016-01-26 20:12:34 -0800 |
commit | ce38a35b764397fcf561ac81de6da96579f5c13e (patch) | |
tree | 0f03dfb31f4840488fabc75d5b4edbdc7eb0d874 /sql/core/pom.xml | |
parent | e7f9199e709c46a6b5ad6b03c9ecf12cc19e3a41 (diff) | |
download | spark-ce38a35b764397fcf561ac81de6da96579f5c13e.tar.gz spark-ce38a35b764397fcf561ac81de6da96579f5c13e.tar.bz2 spark-ce38a35b764397fcf561ac81de6da96579f5c13e.zip |
[SPARK-12935][SQL] DataFrame API for Count-Min Sketch
This PR integrates Count-Min Sketch from spark-sketch into DataFrame. This version resorts to `RDD.aggregate` for building the sketch. A more performant UDAF version can be built in future follow-up PRs.
Author: Cheng Lian <lian@databricks.com>
Closes #10911 from liancheng/cms-df-api.
Diffstat (limited to 'sql/core/pom.xml')
-rw-r--r-- | sql/core/pom.xml | 5 |
1 files changed, 5 insertions, 0 deletions
diff --git a/sql/core/pom.xml b/sql/core/pom.xml index 31b364f351..4bb55f6b7f 100644 --- a/sql/core/pom.xml +++ b/sql/core/pom.xml @@ -44,6 +44,11 @@ </dependency> <dependency> <groupId>org.apache.spark</groupId> + <artifactId>spark-sketch_2.10</artifactId> + <version>${project.version}</version> + </dependency> + <dependency> + <groupId>org.apache.spark</groupId> <artifactId>spark-core_${scala.binary.version}</artifactId> <version>${project.version}</version> </dependency> |