climateprediction.net home page
Task 13994232

Task 13994232

Name hadam3p_eu_91iz_1969_1_007722035_0
Workunit 7877143
Created 26 Jan 2012, 13:30:34 UTC
Sent 13 Feb 2012, 17:57:10 UTC
Report deadline 25 Jan 2013, 23:17:10 UTC
Received 6 Mar 2012, 22:06:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1164706
Run time 6 days 23 hours 35 min 41 sec
CPU time 4 days 8 hours 38 min 9 sec
Validate state Invalid
Credit 1,194.02
Device peak FLOPS 1.22 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5728, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1996, selfPID=1996, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2120, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3584, selfPID=3584, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2236, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4240, selfPID=4644, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2620, selfPID=3472, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=264, selfPID=264, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:05:23 (4572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2196, selfPID=4660, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=360, selfPID=360, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4904, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=2
Model crash detected, will try to restart...
15:25:16 (880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:25:17 (880): No heartbeat from core client for 30 sec - exiting
15:25:18 (880): No heartbeat from core client for 30 sec - exiting
15:25:19 (880): No heartbeat from core client for 30 sec - exiting
15:25:20 (880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5840, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2236, selfPID=1564, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2552, selfPID=2552, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2272, selfPID=2272, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
GCobal oorkerlle CPDN process is not not running, exiting, bRetVal = Val = 1, checkP selfPIselfPID=4120, iM=nCt
r=2
l crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=200, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1168, selfPID=5260, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3632, selfPID=3632, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1104, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1268, iMonCtr=2
Model crash detected, will try to restart...
09:49:12 (2760): No heartbeat from core client for 30 sec - exiting
09:49:13 (2760): No heartbeat from core client for 30 sec - exiting
09:49:14 (2760): No heartbeat from core client for 30 sec - exiting
09:49:15 (2760): No heartbeat from core client for 30 sec - exiting
09:49:16 (2760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5344, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1648, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Mar 2012 21:03:14 1164706 13994232 hadam3p_eu_91iz_1969_1_007722035_0 69,216 343,685 4.9654
29 Feb 2012 20:28:13 1164706 13994232 hadam3p_eu_91iz_1969_1_007722035_0 57,696 283,438 4.9126
26 Feb 2012 10:45:22 1164706 13994232 hadam3p_eu_91iz_1969_1_007722035_0 46,176 228,727 4.9534
23 Feb 2012 22:36:30 1164706 13994232 hadam3p_eu_91iz_1969_1_007722035_0 34,656 174,027 5.0216
20 Feb 2012 10:11:09 1164706 13994232 hadam3p_eu_91iz_1969_1_007722035_0 23,136 119,032 5.1449
18 Feb 2012 14:37:08 1164706 13994232 hadam3p_eu_91iz_1969_1_007722035_0 11,616 61,547 5.2985


©2024 cpdn.org