climateprediction.net home page
Task 12601305

Task 12601305

Name hadam3p_eu_2rh6_1989_1_007169106_0
Workunit 7353946
Created 18 Feb 2011, 18:45:05 UTC
Sent 20 Feb 2011, 9:26:25 UTC
Report deadline 2 Feb 2012, 14:46:25 UTC
Received 20 Apr 2011, 17:48:02 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1131863
Run time 6 days 3 hours 38 min 9 sec
CPU time 5 days 20 hours 12 min 35 sec
Validate state Workunit error - check skipped
Credit 2,386.39
Device peak FLOPS 2.53 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2412, selfPID=1628, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2304, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3220, selfPID=3304, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3188, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2556, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2384, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3636, iMonCtr=2
Leaving CPDN_Main::Monitor...
Coltroller :: kerDN:p: CPDN procot running, exiting, bRetVal = RetVa1, checkPID=0, self selfPID=4024, iMo2
Ctr=2
 crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1248, selfPID=904, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2612, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
07:20:41 (1588): No heartbeat from core client for 30 sec - exiting
07:20:42 (1588): No heartbeat from core client for 30 sec - exiting
07:20:44 (1588): No heartbeat from core client for 30 sec - exiting
07:20:45 (1588): No heartbeat from core client for 30 sec - exiting
07:20:46 (1588): No heartbeat from core client for 30 sec - exiting
07:20:47 (1588): No heartbeat from core client for 30 sec - exiting
07:20:48 (1588): No heartbeat from core client for 30 sec - exiting
07:20:49 (1588): No heartbeat from core client for 30 sec - exiting
07:20:50 (1588): No heartbeat from core client for 30 sec - exiting
07:20:51 (1588): No heartbeat from core client for 30 sec - exiting
07:20:52 (1588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2112, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=164, iMonCtr=2
Leaving CPDN_Main::Monitor...

zip error: Could not create output file (was replacing the original zip file)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2844, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2912, selfPID=3100, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2076, selfPID=212, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2868, selfPID=2880, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1484, selfPID=3084, iMonCtr=1
Model crash detected, will try to restart...
23:09:40 (880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:09:42 (880): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=520, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1116, selfPID=3940, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:42:55 (2236): No heartbeat from core client for 30 sec - exiting
21:42:56 (2236): No heartbeat from core client for 30 sec - exiting
21:42:57 (2236): No heartbeat from core client for 30 sec - exiting
21:42:58 (2236): No heartbeat from core client for 30 sec - exiting
21:42:59 (2236): No heartbeat from core client for 30 sec - exiting
21:43:00 (2236): No heartbeat from core client for 30 sec - exiting
21:43:01 (2236): No heartbeat from core client for 30 sec - exiting
21:43:03 (2236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=720, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1848, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2968, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=296, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1980, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1984, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2504, selfPID=2700, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
ContbalrWollerr::: CDN Nproccess iss not  running, exxitingg, bReVtVa=l , chec= 1, checklPID==0,4 s elofPIrD=7
Mod il crCtrh=2e
ected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2912, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:35:41 (3504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2792, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2616, selfPID=3212, iMonCtr=1
Model crash detected, will try to restart...
18:55:28 (3240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3348, selfPID=3248, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3120, selfPID=2344, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:47:04 (2204): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Apr 2011 19:07:44 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 138,336 503,945 3.6429
10 Apr 2011 07:35:29 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 126,816 462,853 3.6498
05 Apr 2011 19:04:46 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 115,296 423,301 3.6714
02 Apr 2011 15:50:53 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 103,780 383,683 3.6971
02 Apr 2011 11:06:37 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 103,776 383,076 3.6914
29 Mar 2011 06:15:49 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 92,256 342,371 3.7111
28 Mar 2011 18:26:08 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 80,736 301,583 3.7354
24 Mar 2011 11:05:39 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 69,216 261,238 3.7742
24 Mar 2011 00:03:35 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 57,696 221,435 3.8380
10 Mar 2011 02:06:48 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 46,176 181,255 3.9253
08 Mar 2011 22:02:31 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 34,656 137,767 3.9753
08 Mar 2011 22:02:31 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 23,137 92,989 4.0191
08 Mar 2011 22:02:31 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 23,136 92,329 3.9907
22 Feb 2011 09:18:12 1131863 12601305 hadam3p_eu_2rh6_1989_1_007169106_0 11,616 46,707 4.0209


©2024 cpdn.org