diff options
author | Josh Rosen <joshrosen@databricks.com> | 2015-05-18 21:53:44 -0700 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2015-05-18 21:53:52 -0700 |
commit | 99436bd040cf477f475fa14fcf3a730350085c51 (patch) | |
tree | beab5dffc884b35249b7b55734eb620a9984d0ce /streaming | |
parent | 914ecd0504b68f7b01e825d8661fd208d3e40f1a (diff) | |
download | spark-99436bd040cf477f475fa14fcf3a730350085c51.tar.gz spark-99436bd040cf477f475fa14fcf3a730350085c51.tar.bz2 spark-99436bd040cf477f475fa14fcf3a730350085c51.zip |
[SPARK-7687] [SQL] DataFrame.describe() should cast all aggregates to String
In `DataFrame.describe()`, the `count` aggregate produces an integer, the `avg` and `stdev` aggregates produce doubles, and `min` and `max` aggregates can produce varying types depending on what type of column they're applied to. As a result, we should cast all aggregate results to String so that `describe()`'s output types match its declared output schema.
Author: Josh Rosen <joshrosen@databricks.com>
Closes #6218 from JoshRosen/SPARK-7687 and squashes the following commits:
146b615 [Josh Rosen] Fix R test.
2974bd5 [Josh Rosen] Cast to string type instead
f206580 [Josh Rosen] Cast to double to fix SPARK-7687
307ecbf [Josh Rosen] Add failing regression test for SPARK-7687
(cherry picked from commit c9fa870a6de3f7d0903fa7a75ea5ffb6a2fcd174)
Signed-off-by: Reynold Xin <rxin@databricks.com>
Diffstat (limited to 'streaming')
0 files changed, 0 insertions, 0 deletions