aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql/streaming.py
diff options
context:
space:
mode:
authorNarine Kokhlikyan <narine@slice.com>2016-07-01 13:55:13 -0700
committerShivaram Venkataraman <shivaram@cs.berkeley.edu>2016-07-01 13:55:13 -0700
commit26afb4ce4099e7942f8db1ead3817ed8fbf71ce3 (patch)
treea43b0b4dfa9278f8d4f5492b40cfde3c6922c16b /python/pyspark/sql/streaming.py
parentc55397652ad1c6d047a8b8eb7fd92a8a1dc66306 (diff)
downloadspark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.tar.gz
spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.tar.bz2
spark-26afb4ce4099e7942f8db1ead3817ed8fbf71ce3.zip
[SPARK-16012][SPARKR] Implement gapplyCollect which will apply a R function on each group similar to gapply and collect the result back to R data.frame
## What changes were proposed in this pull request? gapplyCollect() does gapply() on a SparkDataFrame and collect the result back to R. Compared to gapply() + collect(), gapplyCollect() offers performance optimization as well as programming convenience, as no schema is needed to be provided. This is similar to dapplyCollect(). ## How was this patch tested? Added test cases for gapplyCollect similar to dapplyCollect Author: Narine Kokhlikyan <narine@slice.com> Closes #13760 from NarineK/gapplyCollect.
Diffstat (limited to 'python/pyspark/sql/streaming.py')
0 files changed, 0 insertions, 0 deletions