aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/rdd.py
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2015-04-02 12:18:33 -0700
committerJosh Rosen <joshrosen@databricks.com>2015-04-02 12:18:33 -0700
commit0cce5451adfc6bf4661bcf67aca3db26376455fe (patch)
tree145c47ad89bf382ce722fbfdfc10e16dae0a55ef /python/pyspark/rdd.py
parent424e987dfebbbaa37f4496d44090d469a931ce76 (diff)
downloadspark-0cce5451adfc6bf4661bcf67aca3db26376455fe.tar.gz
spark-0cce5451adfc6bf4661bcf67aca3db26376455fe.tar.bz2
spark-0cce5451adfc6bf4661bcf67aca3db26376455fe.zip
[SPARK-6667] [PySpark] remove setReuseAddress
The reused address on server side had caused the server can not acknowledge the connected connections, remove it. This PR will retry once after timeout, it also add a timeout at client side. Author: Davies Liu <davies@databricks.com> Closes #5324 from davies/collect_hang and squashes the following commits: e5a51a2 [Davies Liu] remove setReuseAddress 7977c2f [Davies Liu] do retry on client side b838f35 [Davies Liu] retry after timeout
Diffstat (limited to 'python/pyspark/rdd.py')
-rw-r--r--python/pyspark/rdd.py1
1 files changed, 1 insertions, 0 deletions
diff --git a/python/pyspark/rdd.py b/python/pyspark/rdd.py
index c337a43c8a..2d05611321 100644
--- a/python/pyspark/rdd.py
+++ b/python/pyspark/rdd.py
@@ -113,6 +113,7 @@ def _parse_memory(s):
def _load_from_socket(port, serializer):
sock = socket.socket()
+ sock.settimeout(3)
try:
sock.connect(("localhost", port))
rf = sock.makefile("rb", 65536)