diff options
author | Davies Liu <davies@databricks.com> | 2016-06-24 14:35:34 -0700 |
---|---|---|
committer | Davies Liu <davies.liu@gmail.com> | 2016-06-24 14:35:34 -0700 |
commit | d48935400ca47275f677b527c636976af09332c8 (patch) | |
tree | cbdfbdcbeb2b8df6ee9a7e6399bf5220fe31c8af /python/pyspark/cloudpickle.py | |
parent | a4851ed05053a9b7545a258c9159fd529225c455 (diff) | |
download | spark-d48935400ca47275f677b527c636976af09332c8.tar.gz spark-d48935400ca47275f677b527c636976af09332c8.tar.bz2 spark-d48935400ca47275f677b527c636976af09332c8.zip |
[SPARK-16077] [PYSPARK] catch the exception from pickle.whichmodule()
## What changes were proposed in this pull request?
In the case that we don't know which module a object came from, will call pickle.whichmodule() to go throught all the loaded modules to find the object, which could fail because some modules, for example, six, see https://bitbucket.org/gutworth/six/issues/63/importing-six-breaks-pickling
We should ignore the exception here, use `__main__` as the module name (it means we can't find the module).
## How was this patch tested?
Manual tested. Can't have a unit test for this.
Author: Davies Liu <davies@databricks.com>
Closes #13788 from davies/whichmodule.
Diffstat (limited to 'python/pyspark/cloudpickle.py')
-rw-r--r-- | python/pyspark/cloudpickle.py | 14 |
1 files changed, 12 insertions, 2 deletions
diff --git a/python/pyspark/cloudpickle.py b/python/pyspark/cloudpickle.py index e56e22a9b9..822ae46e45 100644 --- a/python/pyspark/cloudpickle.py +++ b/python/pyspark/cloudpickle.py @@ -169,7 +169,12 @@ class CloudPickler(Pickler): if name is None: name = obj.__name__ - modname = pickle.whichmodule(obj, name) + try: + # whichmodule() could fail, see + # https://bitbucket.org/gutworth/six/issues/63/importing-six-breaks-pickling + modname = pickle.whichmodule(obj, name) + except Exception: + modname = None # print('which gives %s %s %s' % (modname, obj, name)) try: themodule = sys.modules[modname] @@ -326,7 +331,12 @@ class CloudPickler(Pickler): modname = getattr(obj, "__module__", None) if modname is None: - modname = pickle.whichmodule(obj, name) + try: + # whichmodule() could fail, see + # https://bitbucket.org/gutworth/six/issues/63/importing-six-breaks-pickling + modname = pickle.whichmodule(obj, name) + except Exception: + modname = '__main__' if modname == '__main__': themodule = None |