aboutsummaryrefslogtreecommitdiff
path: root/python
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2016-06-24 14:35:34 -0700
committerDavies Liu <davies.liu@gmail.com>2016-06-24 14:35:34 -0700
commitd48935400ca47275f677b527c636976af09332c8 (patch)
treecbdfbdcbeb2b8df6ee9a7e6399bf5220fe31c8af /python
parenta4851ed05053a9b7545a258c9159fd529225c455 (diff)
downloadspark-d48935400ca47275f677b527c636976af09332c8.tar.gz
spark-d48935400ca47275f677b527c636976af09332c8.tar.bz2
spark-d48935400ca47275f677b527c636976af09332c8.zip
[SPARK-16077] [PYSPARK] catch the exception from pickle.whichmodule()
## What changes were proposed in this pull request? In the case that we don't know which module a object came from, will call pickle.whichmodule() to go throught all the loaded modules to find the object, which could fail because some modules, for example, six, see https://bitbucket.org/gutworth/six/issues/63/importing-six-breaks-pickling We should ignore the exception here, use `__main__` as the module name (it means we can't find the module). ## How was this patch tested? Manual tested. Can't have a unit test for this. Author: Davies Liu <davies@databricks.com> Closes #13788 from davies/whichmodule.
Diffstat (limited to 'python')
-rw-r--r--python/pyspark/cloudpickle.py14
1 files changed, 12 insertions, 2 deletions
diff --git a/python/pyspark/cloudpickle.py b/python/pyspark/cloudpickle.py
index e56e22a9b9..822ae46e45 100644
--- a/python/pyspark/cloudpickle.py
+++ b/python/pyspark/cloudpickle.py
@@ -169,7 +169,12 @@ class CloudPickler(Pickler):
if name is None:
name = obj.__name__
- modname = pickle.whichmodule(obj, name)
+ try:
+ # whichmodule() could fail, see
+ # https://bitbucket.org/gutworth/six/issues/63/importing-six-breaks-pickling
+ modname = pickle.whichmodule(obj, name)
+ except Exception:
+ modname = None
# print('which gives %s %s %s' % (modname, obj, name))
try:
themodule = sys.modules[modname]
@@ -326,7 +331,12 @@ class CloudPickler(Pickler):
modname = getattr(obj, "__module__", None)
if modname is None:
- modname = pickle.whichmodule(obj, name)
+ try:
+ # whichmodule() could fail, see
+ # https://bitbucket.org/gutworth/six/issues/63/importing-six-breaks-pickling
+ modname = pickle.whichmodule(obj, name)
+ except Exception:
+ modname = '__main__'
if modname == '__main__':
themodule = None