[SPARK-13234] [SQL] remove duplicated SQL metrics - spark

diff options

author	Davies Liu <davies@databricks.com>	2016-02-10 23:23:01 -0800
committer	Davies Liu <davies.liu@gmail.com>	2016-02-10 23:23:01 -0800
commit	8f744fe3d931c2380613b8e5bafa1bb1fd292839 (patch)
tree	9f2b217223475d6cf242146b628720cfe51f0ef9 /.gitattributes
parent	b5761d150b66ee0ae5f1be897d9d7a1abb039884 (diff)
download	spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.tar.gz spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.tar.bz2 spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.zip

[SPARK-13234] [SQL] remove duplicated SQL metrics

For lots of SQL operators, we have metrics for both of input and output, the number of input rows should be exactly the number of output rows of child, we could only have metrics for output rows. After we improved the performance using whole stage codegen, the overhead of SQL metrics are not trivial anymore, we should avoid that if it's not necessary. This PR remove all the SQL metrics for number of input rows, add SQL metric of number of output rows for all LeafNode. All remove the SQL metrics from those operators that have the same number of rows from input and output (for example, Projection, we may don't need that). The new SQL UI will looks like: ![metrics](https://cloud.githubusercontent.com/assets/40902/12965227/63614e5e-d009-11e5-88b3-84fea04f9c20.png) Author: Davies Liu <davies@databricks.com> Closes #11163 from davies/remove_metrics.

Diffstat (limited to '.gitattributes')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: