[SPARK-11371] Make "mean" an alias for "avg" operator - spark

diff options

author	tedyu <yuzhihong@gmail.com>	2015-11-02 13:51:53 -0800
committer	Yin Huai <yhuai@databricks.com>	2015-11-02 13:51:53 -0800
commit	db11ee5e56e5fac59895c772a9a87c5ac86888ef (patch)
tree	66994bfbff8688a17c06d2968795ba5c926d494b /docs/mllib-naive-bayes.md
parent	33ae7a35daa86c34f1f9f72f997e0c2d4cd8abec (diff)
download	spark-db11ee5e56e5fac59895c772a9a87c5ac86888ef.tar.gz spark-db11ee5e56e5fac59895c772a9a87c5ac86888ef.tar.bz2 spark-db11ee5e56e5fac59895c772a9a87c5ac86888ef.zip

[SPARK-11371] Make "mean" an alias for "avg" operator

From Reynold in the thread 'Exception when using some aggregate operators' (http://search-hadoop.com/m/q3RTt0xFr22nXB4/): I don't think these are bugs. The SQL standard for average is "avg", not "mean". Similarly, a distinct count is supposed to be written as "count(distinct col)", not "countDistinct(col)". We can, however, make "mean" an alias for "avg" to improve compatibility between DataFrame and SQL. Author: tedyu <yuzhihong@gmail.com> Closes #9332 from ted-yu/master.

Diffstat (limited to 'docs/mllib-naive-bayes.md')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: