aboutsummaryrefslogtreecommitdiff
path: root/R
diff options
context:
space:
mode:
authorZheng RuiFeng <ruifengz@foxmail.com>2017-03-21 08:45:59 -0700
committerXiao Li <gatorsmile@gmail.com>2017-03-21 08:45:59 -0700
commit63f077fbe50b4094340e9915db41d7dbdba52975 (patch)
tree3442fa7374aa58b648de5c5bb4c76a5e3a9769df /R
parent14865d7ff78db5cf9a3e8626204c8e7ed059c353 (diff)
downloadspark-63f077fbe50b4094340e9915db41d7dbdba52975.tar.gz
spark-63f077fbe50b4094340e9915db41d7dbdba52975.tar.bz2
spark-63f077fbe50b4094340e9915db41d7dbdba52975.zip
[SPARK-20041][DOC] Update docs for NaN handling in approxQuantile
## What changes were proposed in this pull request? Update docs for NaN handling in approxQuantile. ## How was this patch tested? existing tests. Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes #17369 from zhengruifeng/doc_quantiles_nan.
Diffstat (limited to 'R')
-rw-r--r--R/pkg/R/stats.R3
1 files changed, 2 insertions, 1 deletions
diff --git a/R/pkg/R/stats.R b/R/pkg/R/stats.R
index 8d1d165052..d78a10893f 100644
--- a/R/pkg/R/stats.R
+++ b/R/pkg/R/stats.R
@@ -149,7 +149,8 @@ setMethod("freqItems", signature(x = "SparkDataFrame", cols = "character"),
#' This method implements a variation of the Greenwald-Khanna algorithm (with some speed
#' optimizations). The algorithm was first present in [[http://dx.doi.org/10.1145/375663.375670
#' Space-efficient Online Computation of Quantile Summaries]] by Greenwald and Khanna.
-#' Note that rows containing any NA values will be removed before calculation.
+#' Note that NA values will be ignored in numerical columns before calculation. For
+#' columns only containing NA values, an empty list is returned.
#'
#' @param x A SparkDataFrame.
#' @param cols A single column name, or a list of names for multiple columns.