diff options
author | gatorsmile <gatorsmile@gmail.com> | 2016-03-02 09:59:22 -0800 |
---|---|---|
committer | Davies Liu <davies.liu@gmail.com> | 2016-03-02 09:59:22 -0800 |
commit | 8f8d8a2315514cd1f3609bc06e5cf6e6d06fdd91 (patch) | |
tree | 2f413b57d7688c3af91b926bbdb0a9f545c1fba7 /streaming | |
parent | d8afd45f8949e0914ce4bd56d832b1158e3c9220 (diff) | |
download | spark-8f8d8a2315514cd1f3609bc06e5cf6e6d06fdd91.tar.gz spark-8f8d8a2315514cd1f3609bc06e5cf6e6d06fdd91.tar.bz2 spark-8f8d8a2315514cd1f3609bc06e5cf6e6d06fdd91.zip |
[SPARK-13609] [SQL] Support Column Pruning for MapPartitions
#### What changes were proposed in this pull request?
This PR is to prune unnecessary columns when the operator is `MapPartitions`. The solution is to add an extra `Project` in the child node.
For the other two operators `AppendColumns` and `MapGroups`, it sounds doable. More discussions are required. The major reason is the current implementation of the `inputPlan` of `groupBy` is based on the child of `AppendColumns`. It might be a bug? Thus, will submit a separate PR.
#### How was this patch tested?
Added a test case in ColumnPruningSuite to verify the rule. Added another test case in DatasetSuite.scala to verify the data.
Author: gatorsmile <gatorsmile@gmail.com>
Closes #11460 from gatorsmile/datasetPruningNew.
Diffstat (limited to 'streaming')
0 files changed, 0 insertions, 0 deletions