climateprediction.net home page
Task 16397085

Task 16397085

Name hadam3p_anz_n6g3_2012_1_008582043_0
Workunit 8728555
Created 25 Mar 2014, 19:12:21 UTC
Sent 25 Mar 2014, 19:13:25 UTC
Report deadline 8 Mar 2015, 0:33:25 UTC
Received 8 May 2014, 16:24:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1287911
Run time 10 days 18 hours 36 min 40 sec
CPU time 8 days 9 hours 15 min 3 sec
Validate state Invalid
Credit 5,477.92
Device peak FLOPS 2.88 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4992, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9264, selfPID=9264, iMonCtr=2
10:39:19 (3652): No heartbeat from core client for 30 sec - exiting
10:39:20 (3652): No heartbeat from core client for 30 sec - exiting
10:39:21 (3652): No heartbeat from core client for 30 sec - exiting
10:39:22 (3652): No heartbeat from core client for 30 sec - exiting
10:39:24 (3652): No heartbeat from core client for 30 sec - exiting
10:39:25 (3652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6620, selfPID=6620, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3832, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3904, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5728, selfPID=4404, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:14:31 (6092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6788, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3524, selfPID=3524, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3144, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1336, selfPID=5832, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4376, iMonCtr=2
08:04:40 (5688): No heartbeat from core client for 30 sec - exiting
08:04:41 (5688): No heartbeat from core client for 30 sec - exiting
08:04:42 (5688): No heartbeat from core client for 30 sec - exiting
08:04:43 (5688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3296, selfPID=3296, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=884, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7504, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
GGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4264, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16800, selfPID=16800, iMonCtr=2
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 May 2014 16:44:33 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 127,019 672,507 5.2945
05 May 2014 08:13:40 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 115,499 609,459 5.2767
02 May 2014 12:19:48 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 103,979 548,178 5.2720
23 Apr 2014 21:00:41 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 92,459 486,487 5.2617
21 Apr 2014 16:30:39 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 80,939 426,622 5.2709
15 Apr 2014 20:18:12 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 69,419 365,506 5.2652
11 Apr 2014 12:03:46 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 57,899 305,651 5.2790
09 Apr 2014 10:35:03 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 46,379 245,606 5.2956
07 Apr 2014 06:25:21 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 34,859 184,865 5.3032
03 Apr 2014 19:22:05 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 23,339 124,173 5.3204
30 Mar 2014 17:47:26 1287911 16397085 hadam3p_anz_n6g3_2012_1_008582043_0 11,819 62,614 5.2977


©2024 climateprediction.net