aboutsummaryrefslogtreecommitdiff
path: root/bin
diff options
context:
space:
mode:
authorSital Kedia <skedia@fb.com>2016-05-27 11:22:39 -0700
committerAndrew Or <andrew@databricks.com>2016-05-27 11:22:39 -0700
commitce756daa4f012ebdc5a41bf5a89ff11b6dfdab8c (patch)
tree964120ed4895c8eaf70b73692664d8fdefc7542a /bin
parent5bdbedf2201efa6c34392aa9eff709761f027e1d (diff)
downloadspark-ce756daa4f012ebdc5a41bf5a89ff11b6dfdab8c.tar.gz
spark-ce756daa4f012ebdc5a41bf5a89ff11b6dfdab8c.tar.bz2
spark-ce756daa4f012ebdc5a41bf5a89ff11b6dfdab8c.zip
[SPARK-15569] Reduce frequency of updateBytesWritten function in Disk…
## What changes were proposed in this pull request? Profiling a Spark job spilling large amount of intermediate data we found that significant portion of time is being spent in DiskObjectWriter.updateBytesWritten function. Looking at the code, we see that the function is being called too frequently to update the number of bytes written to disk. We should reduce the frequency to avoid this. ## How was this patch tested? Tested by running the job on cluster and saw 20% CPU gain by this change. Author: Sital Kedia <skedia@fb.com> Closes #13332 from sitalkedia/DiskObjectWriter.
Diffstat (limited to 'bin')
0 files changed, 0 insertions, 0 deletions