diff options
author | Daniel Darabos <darabos.daniel@gmail.com> | 2015-02-11 20:24:17 +0000 |
---|---|---|
committer | Sean Owen <sowen@cloudera.com> | 2015-02-11 20:24:17 +0000 |
commit | 03bf704bf442ac7dd960795295b51957ce972491 (patch) | |
tree | 2522d32cb3b7c2e48049b592c9ee83252872a50f /docs/programming-guide.md | |
parent | b694eb9c2fefeaa33891d3e61f9bea369bc09984 (diff) | |
download | spark-03bf704bf442ac7dd960795295b51957ce972491.tar.gz spark-03bf704bf442ac7dd960795295b51957ce972491.tar.bz2 spark-03bf704bf442ac7dd960795295b51957ce972491.zip |
Remove outdated remark about take(n).
Looking at the code, I believe this remark about `take(n)` computing partitions on the driver is no longer correct. Apologies if I'm wrong.
This came up in http://stackoverflow.com/q/28436559/3318517.
Author: Daniel Darabos <darabos.daniel@gmail.com>
Closes #4533 from darabos/patch-2 and squashes the following commits:
cc80f3a [Daniel Darabos] Remove outdated remark about take(n).
Diffstat (limited to 'docs/programming-guide.md')
-rw-r--r-- | docs/programming-guide.md | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/docs/programming-guide.md b/docs/programming-guide.md index 6b365e83fb..118701549a 100644 --- a/docs/programming-guide.md +++ b/docs/programming-guide.md @@ -975,7 +975,7 @@ for details. </tr> <tr> <td> <b>take</b>(<i>n</i>) </td> - <td> Return an array with the first <i>n</i> elements of the dataset. Note that this is currently not executed in parallel. Instead, the driver program computes all the elements. </td> + <td> Return an array with the first <i>n</i> elements of the dataset. </td> </tr> <tr> <td> <b>takeSample</b>(<i>withReplacement</i>, <i>num</i>, [<i>seed</i>]) </td> |