diff options
author | Zheng RuiFeng <ruifengz@foxmail.com> | 2017-03-21 08:45:59 -0700 |
---|---|---|
committer | Xiao Li <gatorsmile@gmail.com> | 2017-03-21 08:45:59 -0700 |
commit | 63f077fbe50b4094340e9915db41d7dbdba52975 (patch) | |
tree | 3442fa7374aa58b648de5c5bb4c76a5e3a9769df /python/pyspark/sql/dataframe.py | |
parent | 14865d7ff78db5cf9a3e8626204c8e7ed059c353 (diff) | |
download | spark-63f077fbe50b4094340e9915db41d7dbdba52975.tar.gz spark-63f077fbe50b4094340e9915db41d7dbdba52975.tar.bz2 spark-63f077fbe50b4094340e9915db41d7dbdba52975.zip |
[SPARK-20041][DOC] Update docs for NaN handling in approxQuantile
## What changes were proposed in this pull request?
Update docs for NaN handling in approxQuantile.
## How was this patch tested?
existing tests.
Author: Zheng RuiFeng <ruifengz@foxmail.com>
Closes #17369 from zhengruifeng/doc_quantiles_nan.
Diffstat (limited to 'python/pyspark/sql/dataframe.py')
-rw-r--r-- | python/pyspark/sql/dataframe.py | 3 |
1 files changed, 2 insertions, 1 deletions
diff --git a/python/pyspark/sql/dataframe.py b/python/pyspark/sql/dataframe.py index bb6df22682..a24512f53c 100644 --- a/python/pyspark/sql/dataframe.py +++ b/python/pyspark/sql/dataframe.py @@ -1384,7 +1384,8 @@ class DataFrame(object): Space-efficient Online Computation of Quantile Summaries]] by Greenwald and Khanna. - Note that rows containing any null values will be removed before calculation. + Note that null values will be ignored in numerical columns before calculation. + For columns only containing null values, an empty list is returned. :param col: str, list. Can be a single column name, or a list of names for multiple columns. |