climateprediction.net home page
Task 16415557

Task 16415557

Name hadam3p_anz_n88g_2012_1_008598288_0
Workunit 8744800
Created 26 Mar 2014, 18:56:17 UTC
Sent 27 Mar 2014, 23:30:50 UTC
Report deadline 10 Mar 2015, 4:50:50 UTC
Received 9 May 2014, 22:05:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1261147
Run time 7 days 15 hours 59 min 54 sec
CPU time 6 days 16 hours 10 min 59 sec
Validate state Invalid
Credit 4,981.10
Device peak FLOPS 2.81 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1228, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3884, selfPID=3668, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=2
Model crash detected, will try to restart...
18:02:24 (4844): No heartbeat from core client for 30 sec - exiting
18:02:25 (4844): No heartbeat from core client for 30 sec - exiting
18:02:26 (4844): No heartbeat from core client for 30 sec - exiting
18:02:27 (4844): No heartbeat from core client for 30 sec - exiting
18:02:28 (4844): No heartbeat from core client for 30 sec - exiting
18:02:29 (4844): No heartbeat from core client for 30 sec - exiting
18:02:30 (4844): No heartbeat from core client for 30 sec - exiting
18:02:31 (4844): No heartbeat from core client for 30 sec - exiting
18:02:32 (4844): No heartbeat from core client for 30 sec - exiting
18:02:33 (4844): No heartbeat from core client for 30 sec - exiting
18:02:34 (4844): No heartbeat from core client for 30 sec - exiting
18:02:35 (4844): No heartbeat from core client for 30 sec - exiting
18:02:36 (4844): No heartbeat from core client for 30 sec - exiting
18:02:37 (4844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:02:38 (4844): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5384, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1620, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1496, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=928, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4080, selfPID=2992, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=964, selfPID=5828, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2280, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4056, selfPID=4056, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 May 2014 00:54:08 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 115,499 563,664 4.8803
04 May 2014 15:58:16 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 103,979 507,031 4.8763
02 May 2014 23:45:35 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 92,459 450,808 4.8758
25 Apr 2014 01:09:11 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 80,939 394,757 4.8772
18 Apr 2014 22:37:36 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 69,419 338,454 4.8755
14 Apr 2014 01:19:08 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 57,899 282,288 4.8755
11 Apr 2014 23:53:49 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 46,379 226,132 4.8757
06 Apr 2014 23:28:53 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 34,859 169,690 4.8679
05 Apr 2014 00:18:30 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 23,339 113,496 4.8629
30 Mar 2014 23:10:35 1261147 16415557 hadam3p_anz_n88g_2012_1_008598288_0 11,819 57,213 4.8408


©2024 climateprediction.net