aboutsummaryrefslogtreecommitdiff
path: root/docs/running-on-yarn.md
diff options
context:
space:
mode:
authorMatei Zaharia <matei@eecs.berkeley.edu>2012-09-25 21:46:58 -0700
committerMatei Zaharia <matei@eecs.berkeley.edu>2012-09-25 21:46:58 -0700
commit051785c7e67b7ba0f2f0b5e078753d3f4f380961 (patch)
tree5ff31cdbae7a7dd61fbf7f0a080771b3ca850d08 /docs/running-on-yarn.md
parent56c90485fd947d75bbe7aac81593ba42cfe56821 (diff)
downloadspark-051785c7e67b7ba0f2f0b5e078753d3f4f380961.tar.gz
spark-051785c7e67b7ba0f2f0b5e078753d3f4f380961.tar.bz2
spark-051785c7e67b7ba0f2f0b5e078753d3f4f380961.zip
Several fixes to sampling issues pointed out by Henry Milner:
- takeSample was biased towards earlier partitions - There were some range errors in takeSample - SampledRDDs with replacement didn't produce appropriate counts across partitions (we took exactly frac of each one)
Diffstat (limited to 'docs/running-on-yarn.md')
0 files changed, 0 insertions, 0 deletions