From 37c199bbb098c68efecb4f8bd10b5cb8dfd9da3b Mon Sep 17 00:00:00 2001 From: Josh Rosen Date: Fri, 28 Sep 2012 23:44:19 -0700 Subject: Allow controlling number of splits in distinct(). --- docs/scala-programming-guide.md | 4 ++++ 1 file changed, 4 insertions(+) (limited to 'docs') diff --git a/docs/scala-programming-guide.md b/docs/scala-programming-guide.md index a370bf3ddc..db761d7df1 100644 --- a/docs/scala-programming-guide.md +++ b/docs/scala-programming-guide.md @@ -147,6 +147,10 @@ The following tables list the transformations and actions currently supported (s union(otherDataset) Return a new dataset that contains the union of the elements in the source dataset and the argument. + + distinct([numTasks])) + Return a new dataset that contains the distinct elements of the source dataset. + groupByKey([numTasks]) When called on a dataset of (K, V) pairs, returns a dataset of (K, Seq[V]) pairs.
-- cgit v1.2.3