climateprediction.net home page
Task 18621689

Task 18621689

Name hadam3p_anz_m7nk_2013_1_009946773_0
Workunit 9974135
Created 23 Jun 2015, 22:25:09 UTC
Sent 24 Jun 2015, 12:30:57 UTC
Report deadline 5 Jun 2016, 17:50:57 UTC
Received 13 Jul 2015, 6:05:14 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1316091
Run time 6 days 2 hours 58 min 56 sec
CPU time 5 days 11 hours 45 min 51 sec
Validate state Invalid
Credit 4,981.10
Device peak FLOPS 4.15 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1652, selfPID=7312, iMonCtr=1
Model crash detected, will try to restart...
07:35:31 (7660): No heartbeat from core client for 30 sec - exiting
07:35:32 (7660): No heartbeat from core client for 30 sec - exiting
07:35:33 (7660): No heartbeat from core client for 30 sec - exiting
07:35:34 (7660): No heartbeat from core client for 30 sec - exiting
07:35:35 (7660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3140, selfPID=3140, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4352, selfPID=5196, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
15:38:29 (1748): No heartbeat from core client for 30 sec - exiting
15:38:30 (1748): No heartbeat from core client for 30 sec - exiting
15:38:31 (1748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:58:59 (5432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7852, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7872, selfPID=6676, iMonCtr=1
Model crash detected, will try to restart...
20:00:01 (7864): No heartbeat from core client for 30 sec - exiting
20:00:02 (7864): No heartbeat from core client for 30 sec - exiting
20:00:03 (7864): No heartbeat from core client for 30 sec - exiting
20:00:04 (7864): No heartbeat from core client for 30 sec - exiting
20:00:05 (7864): No heartbeat from core client for 30 sec - exiting
20:00:06 (7864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8156, iMonCtr=2
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3712, selfPID=6264, iMonCtr=1
Model crash detected, will try to restart...
14:00:59 (7896): No heartbeat from core client for 30 sec - exiting
14:01:00 (7896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:54:30 (1772): No heartbeat from core client for 30 sec - exiting
07:54:32 (1772): No heartbeat from core client for 30 sec - exiting
07:54:33 (1772): No heartbeat from core client for 30 sec - exiting
07:54:34 (1772): No heartbeat from core client for 30 sec - exiting
07:54:35 (1772): No heartbeat from core client for 30 sec - exiting
07:54:36 (1772): No heartbeat from core client for 30 sec - exiting
07:54:37 (1772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:01:20 (8116): No heartbeat from core client for 30 sec - exiting
10:01:21 (8116): No heartbeat from core client for 30 sec - exiting
10:01:22 (8116): No heartbeat from core client for 30 sec - exiting
10:01:23 (8116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6388, selfPID=6388, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5320, selfPID=3848, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8184, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=2
09:45:42 (7876): No heartbeat from core client for 30 sec - exiting
09:45:43 (7876): No heartbeat from core client for 30 sec - exiting
09:45:44 (7876): No heartbeat from core client for 30 sec - exiting
09:45:45 (7876): No heartbeat from core client for 30 sec - exiting
09:45:46 (7876): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:08:00 (7276): No heartbeat from core client for 30 sec - exiting
08:08:01 (7276): No heartbeat from core client for 30 sec - exiting
08:08:02 (7276): No heartbeat from core client for 30 sec - exiting
08:08:03 (7276): No heartbeat from core client for 30 sec - exiting
08:08:04 (7276): No heartbeat from core client for 30 sec - exiting
08:08:05 (7276): No heartbeat from core client for 30 sec - exiting
08:08:06 (7276): No heartbeat from core client for 30 sec - exiting
08:08:07 (7276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11808, selfPID=11808, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2352, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
21:47:33 (7924): No heartbeat from core client for 30 sec - exiting
21:47:34 (7924): No heartbeat from core client for 30 sec - exiting
21:47:35 (7924): No heartbeat from core client for 30 sec - exiting
21:47:36 (7924): No heartbeat from core client for 30 sec - exiting
21:47:37 (7924): No heartbeat from core client for 30 sec - exiting
21:47:38 (7924): No heartbeat from core client for 30 sec - exiting
21:47:39 (7924): No heartbeat from core client for 30 sec - exiting
21:47:40 (7924): No heartbeat from core client for 30 sec - exiting
21:47:41 (7924): No heartbeat from core client for 30 sec - exiting
21:47:42 (7924): No heartbeat from core client for 30 sec - exiting
21:47:43 (7924): No heartbeat from core client for 30 sec - exiting
21:47:44 (7924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4308, iMonCtr=2
10:19:06 (6364): No heartbeat from core client for 30 sec - exiting
10:19:07 (6364): No heartbeat from core client for 30 sec - exiting
10:19:08 (6364): No heartbeat from core client for 30 sec - exiting
10:19:09 (6364): No heartbeat from core client for 30 sec - exiting
10:19:10 (6364): No heartbeat from core client for 30 sec - exiting
10:19:11 (6364): No heartbeat from core client for 30 sec - exiting
10:19:12 (6364): No heartbeat from core client for 30 sec - exiting
10:19:13 (6364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8540, selfPID=8540, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Jul 2015 14:19:30 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 115,499 464,849 4.0247
09 Jul 2015 11:50:11 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 103,979 417,636 4.0165
07 Jul 2015 18:43:56 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 92,459 369,118 3.9922
05 Jul 2015 08:46:46 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 80,939 319,333 3.9454
03 Jul 2015 09:30:20 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 69,419 274,172 3.9495
03 Jul 2015 08:39:37 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 57,899 228,749 3.9508
30 Jun 2015 12:51:37 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 46,379 183,772 3.9624
28 Jun 2015 20:28:58 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 34,859 137,656 3.9489
27 Jun 2015 09:43:15 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 23,339 92,601 3.9677
25 Jun 2015 13:58:08 1316091 18621689 hadam3p_anz_m7nk_2013_1_009946773_0 11,819 47,218 3.9951


©2024 cpdn.org