climateprediction.net home page
Task 14577861

Task 14577861

Name hadam3p_pnw_ytkh_1976_1_006883785_1
Workunit 7087101
Created 23 Apr 2012, 11:23:42 UTC
Sent 23 Apr 2012, 11:27:02 UTC
Report deadline 5 Apr 2013, 16:47:02 UTC
Received 6 Jun 2012, 18:47:21 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 1160229
Run time 11 days 6 hours 22 min 42 sec
CPU time 9 days 20 hours 51 min 48 sec
Validate state Workunit error - check skipped
Credit 3,005.88
Device peak FLOPS 2.02 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5776, selfPID=3628, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5396, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5456, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4632, selfPID=3364, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6116, selfPID=6116, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5756, iMonCtr=2
Model crash detected, will try to restart...
00:43:20 (420): No heartbeat from core client for 30 sec - exiting
00:43:21 (420): No heartbeat from core client for 30 sec - exiting
00:43:22 (420): No heartbeat from core client for 30 sec - exiting
00:43:23 (420): No heartbeat from core client for 30 sec - exiting
00:43:24 (420): No heartbeat from core client for 30 sec - exiting
00:43:25 (420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4140, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3088, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5136, selfPID=5424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=2
Model crash detected, will try to restart...
Colntroller:: CerN process is not running, exiting, bRiting, bRetVal = 1ID=0, skPID=0, selfPID=4656, iM
onCtr cra
sh detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2172, iMonCtr=2
Model crash detected, will try to restart...
Global WorkGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4408, selfPID=1616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3912, selfPID=1784, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1916, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=712, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4548, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4556, selfPID=1248, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4340, selfPID=2364, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2060, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Atmos Restart file copy failed on atmos_restart.day
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5596, selfPID=3548, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1504, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1260, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4596, selfPID=3748, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2352, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1260, selfPID=1900, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4356, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5276, selfPID=1916, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4848, selfPID=2316, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=3348, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4828, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4768, selfPID=1140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5840, selfPID=2988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4824, selfPID=1084, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5940, selfPID=3608, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4852, selfPID=3576, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2448, selfPID=3600, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3312, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5168, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Jun 2012 17:36:29 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 138,336 851,224 6.1533
03 Jun 2012 16:19:28 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 126,816 779,006 6.1428
01 Jun 2012 17:03:06 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 115,296 707,334 6.1349
25 May 2012 18:37:34 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 103,776 637,499 6.1430
21 May 2012 18:15:25 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 92,257 566,657 6.1422
21 May 2012 03:32:04 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 92,256 565,745 6.1323
19 May 2012 11:53:20 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 80,736 495,645 6.1391
17 May 2012 11:33:43 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 69,216 423,185 6.1140
13 May 2012 18:20:20 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 57,696 352,433 6.1084
12 May 2012 10:13:01 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 46,177 282,088 6.1088
11 May 2012 19:18:37 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 46,176 281,274 6.0913
06 May 2012 12:58:54 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 34,656 213,761 6.1681
27 Apr 2012 17:56:15 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 23,136 147,073 6.3569
25 Apr 2012 15:56:40 1160229 14577861 hadam3p_pnw_ytkh_1976_1_006883785_1 11,616 74,152 6.3836


©2024 cpdn.org