diff options
author | Nong Li <nong@databricks.com> | 2015-12-29 18:47:41 -0800 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2015-12-29 18:47:41 -0800 |
commit | b600bccf41a7b1958e33d8301a19214e6517e388 (patch) | |
tree | 93a01d5b6a39d2c2506c5581e6174ab9ffa8ba6d /dev/merge_spark_pr.py | |
parent | 270a659584b6c1c304a9f9a331c56287672e00b0 (diff) | |
download | spark-b600bccf41a7b1958e33d8301a19214e6517e388.tar.gz spark-b600bccf41a7b1958e33d8301a19214e6517e388.tar.bz2 spark-b600bccf41a7b1958e33d8301a19214e6517e388.zip |
[SPARK-12362][SQL][WIP] Inline Hive Parser
This is a WIP. The PR has been taken over from nongli (see https://github.com/apache/spark/pull/10420). I have removed some additional dead code, and fixed a few issues which were caused by the fact that the inlined Hive parser is newer than the Hive parser we currently use in Spark.
I am submitting this PR in order to get some feedback and testing done. There is quite a bit of work to do:
- [ ] Get it to pass jenkins build/test.
- [ ] Aknowledge Hive-project for using their parser.
- [ ] Refactorings between HiveQl and the java classes.
- [ ] Create our own ASTNode and integrate the current implicit extentions.
- [ ] Move remaining ```SemanticAnalyzer``` and ```ParseUtils``` functionality to ```HiveQl```.
- [ ] Removing Hive dependencies from the parser. This will require some edits in the grammar files.
- [ ] Introduce our own context which needs to contain a ```TokenRewriteStream```.
- [ ] Add ```useSQL11ReservedKeywordsForIdentifier``` and ```allowQuotedId``` to the catalyst or sql configuration.
- [ ] Remove ```HiveConf``` from grammar files &HiveQl, and pass in our own configuration.
- [ ] Moving the parser into sql/core.
cc nongli rxin
Author: Herman van Hovell <hvanhovell@questtec.nl>
Author: Nong Li <nong@databricks.com>
Author: Nong Li <nongli@gmail.com>
Closes #10509 from hvanhovell/SPARK-12362.
Diffstat (limited to 'dev/merge_spark_pr.py')
0 files changed, 0 insertions, 0 deletions