diff options
author | Matei Zaharia <matei@eecs.berkeley.edu> | 2012-09-25 21:46:58 -0700 |
---|---|---|
committer | Matei Zaharia <matei@eecs.berkeley.edu> | 2012-09-25 21:46:58 -0700 |
commit | 051785c7e67b7ba0f2f0b5e078753d3f4f380961 (patch) | |
tree | 5ff31cdbae7a7dd61fbf7f0a080771b3ca850d08 /docs/_config.yml | |
parent | 56c90485fd947d75bbe7aac81593ba42cfe56821 (diff) | |
download | spark-051785c7e67b7ba0f2f0b5e078753d3f4f380961.tar.gz spark-051785c7e67b7ba0f2f0b5e078753d3f4f380961.tar.bz2 spark-051785c7e67b7ba0f2f0b5e078753d3f4f380961.zip |
Several fixes to sampling issues pointed out by Henry Milner:
- takeSample was biased towards earlier partitions
- There were some range errors in takeSample
- SampledRDDs with replacement didn't produce appropriate counts
across partitions (we took exactly frac of each one)
Diffstat (limited to 'docs/_config.yml')
0 files changed, 0 insertions, 0 deletions