aboutsummaryrefslogtreecommitdiff
path: root/mllib
diff options
context:
space:
mode:
authorhyukjinkwon <gurwls223@gmail.com>2016-06-21 21:58:38 -0700
committerDavies Liu <davies.liu@gmail.com>2016-06-21 21:58:38 -0700
commit7580f3041a1a3757a0b14b9d8afeb720f261fff6 (patch)
treea4da4d014c9a95d4f57e82e9f8081e2a21eedd44 /mllib
parentd77c4e6e2eef24f4276c38b3add8c29bb885f4db (diff)
downloadspark-7580f3041a1a3757a0b14b9d8afeb720f261fff6.tar.gz
spark-7580f3041a1a3757a0b14b9d8afeb720f261fff6.tar.bz2
spark-7580f3041a1a3757a0b14b9d8afeb720f261fff6.zip
[SPARK-16104] [SQL] Do not creaate CSV writer object for every flush when writing
## What changes were proposed in this pull request? This PR let `CsvWriter` object is not created for each time but able to be reused. This way was taken after from JSON data source. Original `CsvWriter` was being created for each row but it was enhanced in https://github.com/apache/spark/pull/13229. However, it still creates `CsvWriter` object for each `flush()` in `LineCsvWriter`. It seems it does not have to close the object and re-create this for every flush. It follows the original logic as it is but `CsvWriter` is reused by reseting `CharArrayWriter`. ## How was this patch tested? Existing tests should cover this. Author: hyukjinkwon <gurwls223@gmail.com> Closes #13809 from HyukjinKwon/write-perf.
Diffstat (limited to 'mllib')
0 files changed, 0 insertions, 0 deletions