climateprediction.net home page
Task 16338408

Task 16338408

Name hadam3p_eu_f1t2_2013_1_008548733_0
Workunit 8696245
Created 5 Mar 2014, 16:04:53 UTC
Sent 8 Mar 2014, 12:26:48 UTC
Report deadline 18 Feb 2015, 17:46:48 UTC
Received 20 Mar 2014, 10:38:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1316256
Run time 2 days 17 hours 7 min 50 sec
CPU time 9 hours 48 min 53 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 2.07 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.8.44</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20092, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2068, selfPID=3428, iMonCtr=1
Model crash detected, will try to restart...
01:53:40 (4992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5160, selfPID=4856, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27988, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8164, selfPID=5192, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5572, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Glontrollerrker:: CPDN pess is nos not runnin exiting, bRetVatVal = 1, checkPID, selfPID=5132, iMonCtr=tr=
Mo
del crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4172, selfPID=4344, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5332, selfPID=4772, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=86560, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=225580, iMonCtr=2
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5984, selfPID=5232, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2464, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5180, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2284, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=5344, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Glontrolobal: CPDN :: ocess is not running, exiting, bRetVal = 1, checkPID=0, selfPID=selfPID=5192, iMo
MCtr=2
odel crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6108, selfPID=1544, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6232, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8972, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1400, selfPID=6628, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Mar 2014 19:32:13 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 80,736 15,263 0.1890
16 Mar 2014 06:46:45 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 69,216 155,525 2.2470
16 Mar 2014 06:46:45 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 57,696 131,824 2.2848
16 Mar 2014 06:46:45 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 46,176 110,959 2.4030
16 Mar 2014 06:46:45 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 34,656 89,961 2.5958
10 Mar 2014 18:34:43 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 23,136 65,040 2.8112
09 Mar 2014 04:34:34 1316256 16338408 hadam3p_eu_f1t2_2013_1_008548733_0 11,616 33,045 2.8448


©2024 cpdn.org