climateprediction.net home page
Task 16418432

Task 16418432

Name hadam3p_anz_naf2_2012_1_008601118_0
Workunit 8747630
Created 26 Mar 2014, 19:21:15 UTC
Sent 27 Mar 2014, 6:35:57 UTC
Report deadline 9 Mar 2015, 11:55:57 UTC
Received 1 May 2014, 16:23:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1218615
Run time 10 days 19 hours 10 min 28 sec
CPU time 10 days 5 hours 48 min 23 sec
Validate state Invalid
Credit 4,981.10
Device peak FLOPS 2.34 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22364, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
06:28:30 (1668): No heartbeat from core client for 30 sec - exiting
06:28:31 (1668): No heartbeat from core client for 30 sec - exiting
06:28:33 (1668): No heartbeat from core client for 30 sec - exiting
06:28:34 (1668): No heartbeat from core client for 30 sec - exiting
06:28:35 (1668): No heartbeat from core client for 30 sec - exiting
06:28:36 (1668): No heartbeat from core client for 30 sec - exiting
06:28:37 (1668): No heartbeat from core client for 30 sec - exiting
06:28:38 (1668): No heartbeat from core client for 30 sec - exiting
06:28:39 (1668): No heartbeat from core client for 30 sec - exiting
06:28:40 (1668): No heartbeat from core client for 30 sec - exiting
06:28:41 (1668): No heartbeat from core client for 30 sec - exiting
06:28:42 (1668): No heartbeat from core client for 30 sec - exiting
06:28:43 (1668): No heartbeat from core client for 30 sec - exiting
06:28:45 (1668): No heartbeat from core client for 30 sec - exiting
06:28:46 (1668): No heartbeat from core client for 30 sec - exiting
06:28:47 (1668): No heartbeat from core client for 30 sec - exiting
06:28:48 (1668): No heartbeat from core client for 30 sec - exiting
06:28:49 (1668): No heartbeat from core client for 30 sec - exiting
06:28:50 (1668): No heartbeat from core client for 30 sec - exiting
06:28:51 (1668): No heartbeat from core client for 30 sec - exiting
06:28:52 (1668): No heartbeat from core client for 30 sec - exiting
06:28:53 (1668): No heartbeat from core client for 30 sec - exiting
06:28:54 (1668): No heartbeat from core client for 30 sec - exiting
06:28:55 (1668): No heartbeat from core client for 30 sec - exiting
06:28:57 (1668): No heartbeat from core client for 30 sec - exiting
06:28:58 (1668): No heartbeat from core client for 30 sec - exiting
06:28:59 (1668): No heartbeat from core client for 30 sec - exiting
06:29:00 (1668): No heartbeat from core client for 30 sec - exiting
06:29:01 (1668): No heartbeat from core client for 30 sec - exiting
06:29:02 (1668): No heartbeat from core client for 30 sec - exiting
06:29:03 (1668): No heartbeat from core client for 30 sec - exiting
06:29:04 (1668): No heartbeat from core client for 30 sec - exiting
06:29:05 (1668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3484, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4404, selfPID=5368, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6032, selfPID=4372, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6016, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1792, selfPID=4852, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3516, selfPID=5244, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
06:17:22 (2564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:02:38 (4648): No heartbeat from core client for 30 sec - exiting
06:02:39 (4648): No heartbeat from core client for 30 sec - exiting
06:02:40 (4648): No heartbeat from core client for 30 sec - exiting
06:02:41 (4648): No heartbeat from core client for 30 sec - exiting
06:02:42 (4648): No heartbeat from core client for 30 sec - exiting
06:02:44 (4648): No heartbeat from core client for 30 sec - exiting
06:02:45 (4648): No heartbeat from core client for 30 sec - exiting
06:02:46 (4648): No heartbeat from core client for 30 sec - exiting
06:02:47 (4648): No heartbeat from core client for 30 sec - exiting
06:02:48 (4648): No heartbeat from core client for 30 sec - exiting
06:02:49 (4648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4684, selfPID=4684, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1968, iMonCtr=2
Model crash detected, will try to rSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5124, selfPID=5080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4036, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4128, selfPID=3004, iMonCtr=1
Model crash detected, will try to restart...
05:49:23 (4588): No heartbeat from core client for 30 sec - exiting
05:49:24 (4588): No heartbeat from core client for 30 sec - exiting
05:49:25 (4588): No heartbeat from core client for 30 sec - exiting
05:49:26 (4588): No heartbeat from core client for 30 sec - exiting
05:49:27 (4588): No heartbeat from core client for 30 sec - exiting
05:49:28 (4588): No heartbeat from core client for 30 sec - exiting
05:49:29 (4588): No heartbeat from core client for 30 sec - exiting
05:49:31 (4588): No heartbeat from core client for 30 sec - exiting
05:49:32 (4588): No heartbeat from core client for 30 sec - exiting
05:49:33 (4588): No heartbeat from core client for 30 sec - exiting
05:49:34 (4588): No heartbeat from core client for 30 sec - exiting
05:49:35 (4588): No heartbeat from core client for 30 sec - exiting
05:49:36 (4588): No heartbeat from core client for 30 sec - exiting
05:49:37 (4588): No heartbeat from core client for 30 sec - exiting
05:49:38 (4588): No heartbeat from core client for 30 sec - exiting
05:49:39 (4588): No heartbeat from core client for 30 sec - exiting
05:49:40 (4588): No heartbeat from core client for 30 sec - exiting
05:49:41 (4588): No heartbeat from core client for 30 sec - exiting
05:49:43 (4588): No heartbeat from core client for 30 sec - exiting
05:49:44 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:23:15 (4376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4792, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4576, selfPID=4272, iMonCtr=1
Model crash detected, will try to restart...
06:28:52 (4956): No heartbeat from core client for 30 sec - exiting
06:28:53 (4956): No heartbeat from core client for 30 sec - exiting
06:28:54 (4956): No heartbeat from core client for 30 sec - exiting
06:28:55 (4956): No heartbeat from core client for 30 sec - exiting
06:28:56 (4956): No heartbeat from core client for 30 sec - exiting
06:28:57 (4956): No heartbeat from core client for 30 sec - exiting
06:28:58 (4956): No heartbeat from core client for 30 sec - exiting
06:28:59 (4956): No heartbeat from core client for 30 sec - exiting
06:29:01 (4956): No heartbeat from core client for 30 sec - exiting
06:29:02 (4956): No heartbeat from core client for 30 sec - exiting
06:29:03 (4956): No heartbeat from core client for 30 sec - exiting
06:29:04 (4956): No heartbeat from core client for 30 sec - exiting
06:29:05 (4956): No heartbeat from core client for 30 sec - exiting
06:29:06 (4956): No heartbeat from core client for 30 sec - exiting
06:29:07 (4956): No heartbeat from core client for 30 sec - exiting
06:29:08 (4956): No heartbeat from core client for 30 sec - exiting
06:29:09 (4956): No heartbeat from core client for 30 sec - exiting
06:29:10 (4956): No heartbeat from core client for 30 sec - exiting
06:29:11 (4956): No heartbeat from core client for 30 sec - exiting
06:29:13 (4956): No heartbeat from core client for 30 sec - exiting
06:29:14 (4956): No heartbeat from core client for 30 sec - exiting
06:29:15 (4956): No heartbeat from core client for 30 sec - exiting
06:29:16 (4956): No heartbeat from core client for 30 sec - exiting
06:29:17 (4956): No heartbeat from core client for 30 sec - exiting
06:29:18 (4956): No heartbeat from core client for 30 sec - exiting
06:29:19 (4956): No heartbeat from core client for 30 sec - exiting
06:29:20 (4956): No heartbeat from core client for 30 sec - exiting
06:29:21 (4956): No heartbeat from core client for 30 sec - exiting
06:29:22 (4956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:29:23 (4956): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5488, selfPID=5176, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Apr 2014 06:33:36 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 115,499 853,531 7.3899
27 Apr 2014 05:28:04 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 103,979 769,178 7.3974
26 Apr 2014 03:47:30 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 92,459 680,615 7.3613
20 Apr 2014 20:57:43 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 80,939 597,363 7.3804
14 Apr 2014 07:45:17 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 69,419 512,860 7.3879
13 Apr 2014 06:05:48 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 57,899 425,894 7.3558
10 Apr 2014 12:51:57 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 46,379 341,119 7.3550
07 Apr 2014 02:34:33 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 34,859 255,398 7.3266
06 Apr 2014 02:02:57 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 23,339 169,733 7.2725
31 Mar 2014 11:44:22 1218615 16418432 hadam3p_anz_naf2_2012_1_008601118_0 11,819 84,216 7.1255


©2024 climateprediction.net