aboutsummaryrefslogtreecommitdiff
path: root/dev/sparktestsupport/modules.py
diff options
context:
space:
mode:
authorJoseph Batchik <josephbatchik@gmail.com>2015-07-29 23:35:55 -0700
committerReynold Xin <rxin@databricks.com>2015-07-29 23:35:55 -0700
commit1221849f91739454b8e495889cba7498ba8beea7 (patch)
tree267b356aaa35da1189874af2a3dd1af3950cdbf4 /dev/sparktestsupport/modules.py
parente127ec34d58ceb0a9d45748c2f2918786ba0a83d (diff)
downloadspark-1221849f91739454b8e495889cba7498ba8beea7.tar.gz
spark-1221849f91739454b8e495889cba7498ba8beea7.tar.bz2
spark-1221849f91739454b8e495889cba7498ba8beea7.zip
[SPARK-8005][SQL] Input file name
Users can now get the file name of the partition being read in. A thread local variable is in `SQLNewHadoopRDD` and is set when the partition is computed. `SQLNewHadoopRDD` is moved to core so that the catalyst package can reach it. This supports: `df.select(inputFileName())` and `sqlContext.sql("select input_file_name() from table")` Author: Joseph Batchik <josephbatchik@gmail.com> Closes #7743 from JDrit/input_file_name and squashes the following commits: abb8609 [Joseph Batchik] fixed failing test and changed the default value to be an empty string d2f323d [Joseph Batchik] updates per review 102061f [Joseph Batchik] updates per review 75313f5 [Joseph Batchik] small fixes c7f7b5a [Joseph Batchik] addeding input file name to Spark SQL
Diffstat (limited to 'dev/sparktestsupport/modules.py')
0 files changed, 0 insertions, 0 deletions