aboutsummaryrefslogtreecommitdiff
path: root/dev/run-tests
diff options
context:
space:
mode:
authorMark Hamstra <markhamstra@gmail.com>2014-09-08 20:51:56 -0700
committerAndrew Or <andrewor14@gmail.com>2014-09-08 20:51:56 -0700
commit092e2f152fb674e7200cc8a2cb99a8fe0a9b2b33 (patch)
tree4617b2c6247bef0dc434005072e36730b97e5494 /dev/run-tests
parent2b7ab814f9bde65ebc57ebd04386e56c97f06f4a (diff)
downloadspark-092e2f152fb674e7200cc8a2cb99a8fe0a9b2b33.tar.gz
spark-092e2f152fb674e7200cc8a2cb99a8fe0a9b2b33.tar.bz2
spark-092e2f152fb674e7200cc8a2cb99a8fe0a9b2b33.zip
SPARK-2425 Don't kill a still-running Application because of some misbehaving Executors
Introduces a LOADING -> RUNNING ApplicationState transition and prevents Master from removing an Application with RUNNING Executors. Two basic changes: 1) Instead of allowing MAX_NUM_RETRY abnormal Executor exits over the entire lifetime of the Application, allow that many since any Executor successfully began running the Application; 2) Don't remove the Application while Master still thinks that there are RUNNING Executors. This should be fine as long as the ApplicationInfo doesn't believe any Executors are forever RUNNING when they are not. I think that any non-RUNNING Executors will eventually no longer be RUNNING in Master's accounting, but another set of eyes should confirm that. This PR also doesn't try to detect which nodes have gone rogue or to kill off bad Workers, so repeatedly failing Executors will continue to fail and fill up log files with failure reports as long as the Application keeps running. Author: Mark Hamstra <markhamstra@gmail.com> Closes #1360 from markhamstra/SPARK-2425 and squashes the following commits: f099c0b [Mark Hamstra] Reuse appInfo b2b7b25 [Mark Hamstra] Moved 'Application failed' logging bdd0928 [Mark Hamstra] switched to string interpolation 1dd591b [Mark Hamstra] SPARK-2425 introduce LOADING -> RUNNING ApplicationState transition and prevent Master from removing Application with RUNNING Executors
Diffstat (limited to 'dev/run-tests')
0 files changed, 0 insertions, 0 deletions