diff options
author | Narine Kokhlikyan <narine@slice.com> | 2016-07-01 13:55:13 -0700 |
---|---|---|
committer | Shivaram Venkataraman <shivaram@cs.berkeley.edu> | 2016-07-01 13:55:13 -0700 |
commit | 26afb4ce4099e7942f8db1ead3817ed8fbf71ce3 (patch) | |
tree | a43b0b4dfa9278f8d4f5492b40cfde3c6922c16b /yarn | |
parent | c55397652ad1c6d047a8b8eb7fd92a8a1dc66306 (diff) | |
download | spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.tar.gz spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.tar.bz2 spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.zip |
[SPARK-16012][SPARKR] Implement gapplyCollect which will apply a R function on each group similar to gapply and collect the result back to R data.frame
## What changes were proposed in this pull request?
gapplyCollect() does gapply() on a SparkDataFrame and collect the result back to R. Compared to gapply() + collect(), gapplyCollect() offers performance optimization as well as programming convenience, as no schema is needed to be provided.
This is similar to dapplyCollect().
## How was this patch tested?
Added test cases for gapplyCollect similar to dapplyCollect
Author: Narine Kokhlikyan <narine@slice.com>
Closes #13760 from NarineK/gapplyCollect.
Diffstat (limited to 'yarn')
0 files changed, 0 insertions, 0 deletions