aboutsummaryrefslogtreecommitdiff
path: root/.gitattributes
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2016-02-10 23:23:01 -0800
committerDavies Liu <davies.liu@gmail.com>2016-02-10 23:23:01 -0800
commit8f744fe3d931c2380613b8e5bafa1bb1fd292839 (patch)
tree9f2b217223475d6cf242146b628720cfe51f0ef9 /.gitattributes
parentb5761d150b66ee0ae5f1be897d9d7a1abb039884 (diff)
downloadspark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.tar.gz
spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.tar.bz2
spark-8f744fe3d931c2380613b8e5bafa1bb1fd292839.zip
[SPARK-13234] [SQL] remove duplicated SQL metrics
For lots of SQL operators, we have metrics for both of input and output, the number of input rows should be exactly the number of output rows of child, we could only have metrics for output rows. After we improved the performance using whole stage codegen, the overhead of SQL metrics are not trivial anymore, we should avoid that if it's not necessary. This PR remove all the SQL metrics for number of input rows, add SQL metric of number of output rows for all LeafNode. All remove the SQL metrics from those operators that have the same number of rows from input and output (for example, Projection, we may don't need that). The new SQL UI will looks like: ![metrics](https://cloud.githubusercontent.com/assets/40902/12965227/63614e5e-d009-11e5-88b3-84fea04f9c20.png) Author: Davies Liu <davies@databricks.com> Closes #11163 from davies/remove_metrics.
Diffstat (limited to '.gitattributes')
0 files changed, 0 insertions, 0 deletions