climateprediction.net home page
Task 15917224

Task 15917224

Name hadcm3n_n7ho_1960_40_008400042_1
Workunit 8550898
Created 14 Aug 2013, 15:23:51 UTC
Sent 14 Aug 2013, 15:28:07 UTC
Report deadline 13 Nov 2013, 22:55:18 UTC
Received 24 Aug 2013, 2:09:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1181321
Run time 9 days 1 hours 59 min 18 sec
CPU time 7 days 17 hours 13 min 56 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 2.25 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
06:16:27 (4768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:16:28 (4768): No heartbeat from core client for 30 sec - exiting
06:16:29 (4768): No heartbeat from core client for 30 sec - exiting
06:17:49 (5008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:06:59 (5276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:07:01 (5276): No heartbeat from core client for 30 sec - exiting
08:07:46 (5248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:07:47 (5248): No heartbeat from core client for 30 sec - exiting
20:21:02 (3724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1
Model crash detected, will try to restart...
02:07:04 (352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:17:35 (3716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:17:36 (3716): No heartbeat from core client for 30 sec - exiting
06:18:39 (3544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:00:41 (3748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:00:44 (3748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
20:58:12 (2680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:03:13 (1992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:59:05 (856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:59:07 (856): No heartbeat from core client for 30 sec - exiting
16:23:23 (3192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:23:24 (3192): No heartbeat from core client for 30 sec - exiting
20:27:16 (4656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:27:18 (4656): No heartbeat from core client for 30 sec - exiting
20:28:07 (3016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:06:32 (4744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:06:34 (4744): No heartbeat from core client for 30 sec - exiting
03:06:35 (4744): No heartbeat from core client for 30 sec - exiting
03:10:16 (4664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:43:34 (6056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:43:35 (6056): No heartbeat from core client for 30 sec - exiting
16:43:36 (6056): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Aug 2013 17:33:39 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 311,040 641,658 2.0629
23 Aug 2013 10:32:58 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 285,120 589,431 2.0673
22 Aug 2013 08:39:30 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 259,200 537,903 2.0752
21 Aug 2013 15:53:21 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 233,280 485,508 2.0812
20 Aug 2013 23:53:23 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 207,360 433,599 2.0910
20 Aug 2013 08:58:12 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 181,440 385,350 2.1238
19 Aug 2013 11:40:17 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 155,520 336,307 2.1625
18 Aug 2013 18:26:41 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 129,600 283,669 2.1888
17 Aug 2013 22:07:21 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 103,680 227,189 2.1913
17 Aug 2013 01:53:28 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 77,760 170,003 2.1863
16 Aug 2013 05:48:50 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 51,840 112,668 2.1734
15 Aug 2013 10:21:08 1181321 15917224 hadcm3n_n7ho_1960_40_008400042_1 25,920 55,791 2.1524


©2024 cpdn.org