diff options
author | Liwei Lin <lwlin7@gmail.com> | 2016-07-19 10:24:48 -0700 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2016-07-19 10:24:48 -0700 |
commit | 0bd76e872b60cb80295fc12654e370cf22390056 (patch) | |
tree | d5377d139b31babe5d6c3bae91f6345201d91831 /data | |
parent | 670891496a82538a5e2bf981a4044fb6f4cbb062 (diff) | |
download | spark-0bd76e872b60cb80295fc12654e370cf22390056.tar.gz spark-0bd76e872b60cb80295fc12654e370cf22390056.tar.bz2 spark-0bd76e872b60cb80295fc12654e370cf22390056.zip |
[SPARK-16620][CORE] Add back the tokenization process in `RDD.pipe(command: String)`
## What changes were proposed in this pull request?
Currently `RDD.pipe(command: String)`:
- works only when the command is specified without any options, such as `RDD.pipe("wc")`
- does NOT work when the command is specified with some options, such as `RDD.pipe("wc -l")`
This is a regression from Spark 1.6.
This patch adds back the tokenization process in `RDD.pipe(command: String)` to fix this regression.
## How was this patch tested?
Added a test which:
- would pass in `1.6`
- _[prior to this patch]_ would fail in `master`
- _[after this patch]_ would pass in `master`
Author: Liwei Lin <lwlin7@gmail.com>
Closes #14256 from lw-lin/rdd-pipe.
Diffstat (limited to 'data')
0 files changed, 0 insertions, 0 deletions