climateprediction.net home page
Task 14630333

Task 14630333

Name hadam3p_pnw_bqxc_1975_1_007921199_1
Workunit 8076311
Created 5 May 2012, 5:31:07 UTC
Sent 5 May 2012, 5:59:23 UTC
Report deadline 17 Apr 2013, 11:19:23 UTC
Received 24 Aug 2012, 12:38:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1175594
Run time 17 days 23 hours 19 min 28 sec
CPU time 12 days 2 hours 53 min 35 sec
Validate state Invalid
Credit 1,504.07
Device peak FLOPS 0.70 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5280, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5776, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3880, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4360, selfPID=2380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2936, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=828, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1868, selfPID=772, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3516, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3924, selfPID=2856, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Colobal Woroler:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID==3712, iMonCtr=2

Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2956, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4164, iMonCtr=2
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2224, selfPID=2816, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4824, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1080, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2408, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=2
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2488, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5036, selfPID=1952, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3688, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2292, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5520, selfPID=2292, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=2
Model crash detected, will try to restart...
GLeaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
CGlobal Worker:: CPDN processntr llerot CunNing,cexi in , b rtVaing,1,xiting,IbRetVal = ID ch68kPiM=0, s=lf
PID=3540, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4040, iMonCtr=2
13:01:37 (4216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5184, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5244, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4836, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Aug 2012 05:15:40 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 69,220 1,037,765 14.9923
17 Aug 2012 01:53:34 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 69,217 1,035,814 14.9647
16 Aug 2012 17:37:52 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 69,216 1,033,855 14.9366
05 Aug 2012 13:42:19 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 57,696 859,832 14.9028
27 Jul 2012 09:15:42 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 46,176 696,100 15.0749
19 Jul 2012 18:42:04 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 34,656 525,148 15.1532
04 Jul 2012 15:18:10 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 23,136 355,712 15.3748
10 Jun 2012 22:46:00 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 11,622 181,576 15.6235
09 Jun 2012 05:12:35 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 11,620 179,470 15.4449
20 May 2012 18:14:51 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 11,617 177,593 15.2873
20 May 2012 05:16:50 1175594 14630333 hadam3p_pnw_bqxc_1975_1_007921199_1 11,616 175,658 15.1221


©2024 climateprediction.net