diff options
author | Davies Liu <davies@databricks.com> | 2016-02-10 23:23:01 -0800 |
---|---|---|
committer | Davies Liu <davies.liu@gmail.com> | 2016-02-10 23:23:01 -0800 |
commit | 8f744fe3d931c2380613b8e5bafa1bb1fd292839 (patch) | |
tree | 9f2b217223475d6cf242146b628720cfe51f0ef9 /build | |
parent | b5761d150b66ee0ae5f1be897d9d7a1abb039884 (diff) | |
download | spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.tar.gz spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.tar.bz2 spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.zip |
[SPARK-13234] [SQL] remove duplicated SQL metrics
For lots of SQL operators, we have metrics for both of input and output, the number of input rows should be exactly the number of output rows of child, we could only have metrics for output rows.
After we improved the performance using whole stage codegen, the overhead of SQL metrics are not trivial anymore, we should avoid that if it's not necessary.
This PR remove all the SQL metrics for number of input rows, add SQL metric of number of output rows for all LeafNode. All remove the SQL metrics from those operators that have the same number of rows from input and output (for example, Projection, we may don't need that).
The new SQL UI will looks like:
![metrics](https://cloud.githubusercontent.com/assets/40902/12965227/63614e5e-d009-11e5-88b3-84fea04f9c20.png)
Author: Davies Liu <davies@databricks.com>
Closes #11163 from davies/remove_metrics.
Diffstat (limited to 'build')
0 files changed, 0 insertions, 0 deletions