climateprediction.net home page
Task 14631753

Task 14631753

Name hadam3p_pnw_bv3c_1975_1_007926599_1
Workunit 8081711
Created 5 May 2012, 17:19:09 UTC
Sent 5 May 2012, 18:05:48 UTC
Report deadline 17 Apr 2013, 23:25:48 UTC
Received 8 Jun 2012, 20:49:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 984314
Run time 5 days 21 hours 56 min 29 sec
CPU time 5 days 21 hours 56 min 29 sec
Validate state Invalid
Credit 2,755.56
Device peak FLOPS 2.19 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1736, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4652, selfPID=3904, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4188, selfPID=5592, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4716, selfPID=4996, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5936, selfPID=5284, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5364, selfPID=5872, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2420, selfPID=5976, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1872, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3036, selfPID=4504, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4988, selfPID=4396, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1092, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5384, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2212, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=168, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4264, selfPID=2192, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 10
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=288, selfPID=2836, iMonCtr=1
Model crash detected, will try to restart...
20:27:32 (4664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3288, selfPID=4432, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Jun 2012 17:35:40 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 126,816 493,300 3.8899
05 Jun 2012 20:04:35 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 115,296 449,054 3.8948
02 Jun 2012 17:57:08 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 103,776 404,783 3.9005
30 May 2012 05:49:29 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 92,256 360,358 3.9061
21 May 2012 19:54:50 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 80,736 316,151 3.9159
20 May 2012 10:54:05 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 69,216 271,836 3.9274
19 May 2012 11:22:42 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 57,696 227,384 3.9411
17 May 2012 13:00:34 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 46,176 182,472 3.9517
13 May 2012 11:05:19 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 34,656 136,342 3.9342
11 May 2012 19:34:09 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 23,136 91,205 3.9421
06 May 2012 20:27:01 984314 14631753 hadam3p_pnw_bv3c_1975_1_007926599_1 11,616 45,998 3.9599


©2024 cpdn.org