[SPARK-16012][SPARKR] Implement gapplyCollect which will apply a R function on each group similar to gapply and collect the result back to R data.frame - spark

diff options

author	Narine Kokhlikyan <narine@slice.com>	2016-07-01 13:55:13 -0700
committer	Shivaram Venkataraman <shivaram@cs.berkeley.edu>	2016-07-01 13:55:13 -0700
commit	26afb4ce4099e7942f8db1ead3817ed8fbf71ce3 (patch)
tree	a43b0b4dfa9278f8d4f5492b40cfde3c6922c16b /python/pyspark/sql/streaming.py
parent	c55397652ad1c6d047a8b8eb7fd92a8a1dc66306 (diff)
download	spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.tar.gz spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.tar.bz2 spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.zip

[SPARK-16012][SPARKR] Implement gapplyCollect which will apply a R function on each group similar to gapply and collect the result back to R data.frame

## What changes were proposed in this pull request? gapplyCollect() does gapply() on a SparkDataFrame and collect the result back to R. Compared to gapply() + collect(), gapplyCollect() offers performance optimization as well as programming convenience, as no schema is needed to be provided. This is similar to dapplyCollect(). ## How was this patch tested? Added test cases for gapplyCollect similar to dapplyCollect Author: Narine Kokhlikyan <narine@slice.com> Closes #13760 from NarineK/gapplyCollect.

Diffstat (limited to 'python/pyspark/sql/streaming.py')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: