diff options
author | Burak Yavuz <brkyvz@gmail.com> | 2015-05-01 13:29:17 -0700 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2015-05-01 13:29:17 -0700 |
commit | 4dc8d74491b101a794cf8d386d8c5ebc6019b75f (patch) | |
tree | 01733e92623635c80a0e3d7b50869b742f3f82a1 /python/pyspark/sql/__init__.py | |
parent | 7b5dd3e3c0030087eea5a8224789352c03717c1d (diff) | |
download | spark-4dc8d74491b101a794cf8d386d8c5ebc6019b75f.tar.gz spark-4dc8d74491b101a794cf8d386d8c5ebc6019b75f.tar.bz2 spark-4dc8d74491b101a794cf8d386d8c5ebc6019b75f.zip |
[SPARK-7240][SQL] Single pass covariance calculation for dataframes
Added the calculation of covariance between two columns to DataFrames.
cc mengxr rxin
Author: Burak Yavuz <brkyvz@gmail.com>
Closes #5825 from brkyvz/df-cov and squashes the following commits:
cb18046 [Burak Yavuz] changed to sample covariance
f2e862b [Burak Yavuz] fixed failed test
51e39b8 [Burak Yavuz] moved implementation
0c6a759 [Burak Yavuz] addressed math comments
8456eca [Burak Yavuz] fix pyStyle3
aa2ad29 [Burak Yavuz] fix pyStyle2
4e97a50 [Burak Yavuz] Merge branch 'master' of github.com:apache/spark into df-cov
e3b0b85 [Burak Yavuz] addressed comments v0.1
a7115f1 [Burak Yavuz] fix python style
7dc6dbc [Burak Yavuz] reorder imports
408cb77 [Burak Yavuz] initial commit
Diffstat (limited to 'python/pyspark/sql/__init__.py')
-rw-r--r-- | python/pyspark/sql/__init__.py | 4 |
1 files changed, 3 insertions, 1 deletions
diff --git a/python/pyspark/sql/__init__.py b/python/pyspark/sql/__init__.py index 6d54b9e49e..b60b991dd4 100644 --- a/python/pyspark/sql/__init__.py +++ b/python/pyspark/sql/__init__.py @@ -54,7 +54,9 @@ del modname, sys from pyspark.sql.types import Row from pyspark.sql.context import SQLContext, HiveContext from pyspark.sql.dataframe import DataFrame, GroupedData, Column, SchemaRDD, DataFrameNaFunctions +from pyspark.sql.dataframe import DataFrameStatFunctions __all__ = [ - 'SQLContext', 'HiveContext', 'DataFrame', 'GroupedData', 'Column', 'Row', 'DataFrameNaFunctions' + 'SQLContext', 'HiveContext', 'DataFrame', 'GroupedData', 'Column', 'Row', + 'DataFrameNaFunctions', 'DataFrameStatFunctions' ] |