climateprediction.net home page
Task 14935114

Task 14935114

Name hadam3p_eu_a75s_1975_1_008060182_1
Workunit 8215296
Created 18 Jul 2012, 8:48:20 UTC
Sent 23 Jul 2012, 11:14:37 UTC
Report deadline 5 Jul 2013, 16:34:37 UTC
Received 7 Jan 2013, 10:59:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 1 (0x00000001) Unknown error code
Computer ID 1194677
Run time 7 days 21 hours 58 min 54 sec
CPU time 6 days 11 hours 35 min 32 sec
Validate state Invalid
Credit 1,591.48
Device peak FLOPS 1.56 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2968, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5912, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3424, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4132, selfPID=3180, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2540, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2032, selfPID=3408, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2892, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1068, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3444, selfPID=2812, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=980, selfPID=1808, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2168, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1276, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3012, iMonCtr=2
Model crash detected, will try to restart...
18:33:53 (3908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2336, selfPID=2336, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6060, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2964, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2924, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5768, selfPID=2960, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=680, selfPID=4000, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3068, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4644, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4652, selfPID=2844, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2248, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2468, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2260, selfPID=3020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2492, selfPID=3504, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3224, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2068, selfPID=2904, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2156, selfPID=3004, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2924, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2564, selfPID=2696, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2368, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4416, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2044, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2852, selfPID=3100, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5364, selfPID=4220, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=2964, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4064, selfPID=4064, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4064, selfPID=3320, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Dec 2012 13:52:52 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 92,256 540,137 5.8548
03 Oct 2012 10:51:53 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 80,738 471,439 5.8391
30 Sep 2012 12:22:14 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 80,736 470,489 5.8275
07 Sep 2012 16:01:28 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 69,216 401,612 5.8023
27 Aug 2012 13:27:25 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 57,702 335,213 5.8094
27 Aug 2012 12:27:13 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 57,698 334,289 5.7938
26 Aug 2012 17:33:18 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 57,696 333,353 5.7777
23 Aug 2012 13:52:33 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 46,177 266,479 5.7708
23 Aug 2012 13:52:33 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 46,176 265,596 5.7518
06 Aug 2012 14:39:41 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 34,656 199,359 5.7525
03 Aug 2012 12:21:58 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 23,136 133,245 5.7592
30 Jul 2012 15:43:10 1194677 14935114 hadam3p_eu_a75s_1975_1_008060182_1 11,616 67,236 5.7882


©2024 cpdn.org