[SPARK-16620][CORE] Add back the tokenization process in `RDD.pipe(command: String)` - spark

diff options

author	Liwei Lin <lwlin7@gmail.com>	2016-07-19 10:24:48 -0700
committer	Reynold Xin <rxin@databricks.com>	2016-07-19 10:24:48 -0700
commit	0bd76e872b60cb80295fc12654e370cf22390056 (patch)
tree	d5377d139b31babe5d6c3bae91f6345201d91831 /data
parent	670891496a82538a5e2bf981a4044fb6f4cbb062 (diff)
download	spark-0bd76e872b60cb80295fc12654e370cf22390056.tar.gz spark-0bd76e872b60cb80295fc12654e370cf22390056.tar.bz2 spark-0bd76e872b60cb80295fc12654e370cf22390056.zip

[SPARK-16620][CORE] Add back the tokenization process in `RDD.pipe(command: String)`

## What changes were proposed in this pull request? Currently `RDD.pipe(command: String)`: - works only when the command is specified without any options, such as `RDD.pipe("wc")` - does NOT work when the command is specified with some options, such as `RDD.pipe("wc -l")` This is a regression from Spark 1.6. This patch adds back the tokenization process in `RDD.pipe(command: String)` to fix this regression. ## How was this patch tested? Added a test which: - would pass in `1.6` - _[prior to this patch]_ would fail in `master` - _[after this patch]_ would pass in `master` Author: Liwei Lin <lwlin7@gmail.com> Closes #14256 from lw-lin/rdd-pipe.

Diffstat (limited to 'data')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: