climateprediction.net home page
Task 11883975

Task 11883975

Name hadam3p_pnw_v3k0_1959_1_006678238_1
Workunit 6881491
Created 13 Sep 2010, 14:26:03 UTC
Sent 13 Sep 2010, 19:19:41 UTC
Report deadline 27 Aug 2011, 0:39:41 UTC
Received 11 Oct 2010, 13:41:50 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1048978
Run time 5 days 19 hours 13 min 1 sec
CPU time 4 days 23 hours 5 min 40 sec
Validate state Workunit error - check skipped
Credit 3,005.88
Device peak FLOPS 2.56 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.05
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
18:13:59 (2280): No heartbeat from core client for 30 sec - exiting
18:14:00 (2280): No heartbeat from core client for 30 sec - exiting
18:14:01 (2280): No heartbeat from core client for 30 sec - exiting
18:14:02 (2280): No heartbeat from core client for 30 sec - exiting
18:14:03 (2280): No heartbeat from core client for 30 sec - exiting
18:14:04 (2280): No heartbeat from core client for 30 sec - exiting
18:14:05 (2280): No heartbeat from core client for 30 sec - exiting
18:14:06 (2280): No heartbeat from core client for 30 sec - exiting
18:14:07 (2280): No heartbeat from core client for 30 sec - exiting
18:14:08 (2280): No heartbeat from core client for 30 sec - exiting
18:14:09 (2280): No heartbeat from core client for 30 sec - exiting
18:14:10 (2280): No heartbeat from core client for 30 sec - exiting
18:14:11 (2280): No heartbeat from core client for 30 sec - exiting
18:14:12 (2280): No heartbeat from core client for 30 sec - exiting
18:14:13 (2280): No heartbeat from core client for 30 sec - exiting
18:14:14 (2280): No heartbeat from core client for 30 sec - exiting
18:14:15 (2280): No heartbeat from core client for 30 sec - exiting
18:14:16 (2280): No heartbeat from core client for 30 sec - exiting
18:14:17 (2280): No heartbeat from core client for 30 sec - exiting
18:14:18 (2280): No heartbeat from core client for 30 sec - exiting
18:14:19 (2280): No heartbeat from core client for 30 sec - exiting
18:14:20 (2280): No heartbeat from core client for 30 sec - exiting
18:14:21 (2280): No heartbeat from core client for 30 sec - exiting
18:14:22 (2280): No heartbeat from core client for 30 sec - exiting
18:14:23 (2280): No heartbeat from core client for 30 sec - exiting
18:14:24 (2280): No heartbeat from core client for 30 sec - exiting
18:14:25 (2280): No heartbeat from core client for 30 sec - exiting
18:14:26 (2280): No heartbeat from core client for 30 sec - exiting
18:14:27 (2280): No heartbeat from core client for 30 sec - exiting
18:14:28 (2280): No heartbeat from core client for 30 sec - exiting
18:14:29 (2280): No heartbeat from core client for 30 sec - exiting
18:14:30 (2280): No heartbeat from core client for 30 sec - exiting
18:14:31 (2280): No heartbeat from core client for 30 sec - exiting
18:14:32 (2280): No heartbeat from core client for 30 sec - exiting
18:14:33 (2280): No heartbeat from core client for 30 sec - exiting
18:14:34 (2280): No heartbeat from core client for 30 sec - exiting
18:14:35 (2280): No heartbeat from core client for 30 sec - exiting
18:14:36 (2280): No heartbeat from core client for 30 sec - exiting
18:14:37 (2280): No heartbeat from core client for 30 sec - exiting
18:14:38 (2280): No heartbeat from core client for 30 sec - exiting
18:14:39 (2280): No heartbeat from core client for 30 sec - exiting
18:14:40 (2280): No heartbeat from core client for 30 sec - exiting
18:14:41 (2280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:14:42 (2280): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5496, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=4336, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5920, selfPID=4104, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=4352, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1912, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4772, selfPID=5160, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3636, selfPID=4284, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4756, selfPID=2504, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5568, selfPID=1968, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5884, selfPID=4380, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5844, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1284, selfPID=4644, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4360, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5480, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5536, selfPID=4180, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2588, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4192, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5636, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5656, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5412, selfPID=2860, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2456, selfPID=4740, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5468, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5492, selfPID=3600, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5460, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5468, selfPID=2384, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5840, selfPID=4348, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5196, selfPID=4816, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=2
Model crash detected, will try to restart...
GGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4128, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3968, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 10
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5448, selfPID=4664, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5760, selfPID=4284, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
20:59:21 (2984): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Oct 2010 18:48:29 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 138,336 427,965 3.0937
10 Oct 2010 11:21:31 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 126,816 391,546 3.0875
05 Oct 2010 16:49:49 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 115,296 355,214 3.0809
03 Oct 2010 13:08:21 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 103,776 319,385 3.0776
30 Sep 2010 16:57:36 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 92,256 284,474 3.0835
27 Sep 2010 19:50:15 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 80,736 250,270 3.0999
26 Sep 2010 14:07:04 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 69,216 214,758 3.1027
24 Sep 2010 19:32:04 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 57,696 180,123 3.1219
23 Sep 2010 13:47:08 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 46,180 144,407 3.1270
22 Sep 2010 19:59:33 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 46,176 143,981 3.1181
20 Sep 2010 17:59:10 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 34,656 107,260 3.0950
18 Sep 2010 16:56:24 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 23,136 71,171 3.0762
16 Sep 2010 16:59:31 1048978 11883975 hadam3p_pnw_v3k0_1959_1_006678238_1 11,616 35,733 3.0762


©2024 cpdn.org